Removing research/community models

f5fc733a · Byzantine · 09bc9f54 · 09bc9f54 · 09bc9f54 · 09bc9f54
Commit f5fc733a authored Feb 03, 2022 by Byzantine
20 changed files
--- a/research/adversarial_text/README.md
+++ b/research/adversarial_text/README.md
-![TensorFlow Requirement: 1.x](https://img.shields.io/badge/TensorFlow%20Requirement-1.x-brightgreen)
-![TensorFlow 2 Not Supported](https://img.shields.io/badge/TensorFlow%202%20Not%20Supported-%E2%9C%95-red.svg)
-# Adversarial Text Classification
-Code for [*Adversarial Training Methods for Semi-Supervised Text Classification*](https://arxiv.org/abs/1605.07725) and [*Semi-Supervised Sequence Learning*](https://arxiv.org/abs/1511.01432).
-## Requirements
-* TensorFlow >= v1.3
-## End-to-end IMDB Sentiment Classification
-### Fetch data
-```bash
-$ wget http://ai.stanford.edu/~amaas/data/sentiment/aclImdb_v1.tar.gz \
-    -O /tmp/imdb.tar.gz
-$ tar -xf /tmp/imdb.tar.gz -C /tmp
-```
-The directory `/tmp/aclImdb` contains the raw IMDB data.
-### Generate vocabulary
-```bash
-$ IMDB_DATA_DIR=/tmp/imdb
-$ python gen_vocab.py \
-    --output_dir=$IMDB_DATA_DIR \
-    --dataset=imdb \
-    --imdb_input_dir=/tmp/aclImdb \
-    --lowercase=False
-```
-Vocabulary and frequency files will be generated in `$IMDB_DATA_DIR`.
-###  Generate training, validation, and test data
-```bash
-$ python gen_data.py \
-    --output_dir=$IMDB_DATA_DIR \
-    --dataset=imdb \
-    --imdb_input_dir=/tmp/aclImdb \
-    --lowercase=False \
-    --label_gain=False
-```
-`$IMDB_DATA_DIR` contains TFRecords files.
-### Pretrain IMDB Language Model
-```bash
-$ PRETRAIN_DIR=/tmp/models/imdb_pretrain
-$ python pretrain.py \
-    --train_dir=$PRETRAIN_DIR \
-    --data_dir=$IMDB_DATA_DIR \
-    --vocab_size=86934 \
-    --embedding_dims=256 \
-    --rnn_cell_size=1024 \
-    --num_candidate_samples=1024 \
-    --batch_size=256 \
-    --learning_rate=0.001 \
-    --learning_rate_decay_factor=0.9999 \
-    --max_steps=100000 \
-    --max_grad_norm=1.0 \
-    --num_timesteps=400 \
-    --keep_prob_emb=0.5 \
-    --normalize_embeddings
-```
-`$PRETRAIN_DIR` contains checkpoints of the pretrained language model.
-### Train classifier
-Most flags stay the same, save for the removal of candidate sampling and the
-addition of `pretrained_model_dir`, from which the classifier will load the
-pretrained embedding and LSTM variables, and flags related to adversarial
-training and classification.
-```bash
-$ TRAIN_DIR=/tmp/models/imdb_classify
-$ python train_classifier.py \
-    --train_dir=$TRAIN_DIR \
-    --pretrained_model_dir=$PRETRAIN_DIR \
-    --data_dir=$IMDB_DATA_DIR \
-    --vocab_size=86934 \
-    --embedding_dims=256 \
-    --rnn_cell_size=1024 \
-    --cl_num_layers=1 \
-    --cl_hidden_size=30 \
-    --batch_size=64 \
-    --learning_rate=0.0005 \
-    --learning_rate_decay_factor=0.9998 \
-    --max_steps=15000 \
-    --max_grad_norm=1.0 \
-    --num_timesteps=400 \
-    --keep_prob_emb=0.5 \
-    --normalize_embeddings \
-    --adv_training_method=vat \
-    --perturb_norm_length=5.0
-```
-### Evaluate on test data
-```bash
-$ EVAL_DIR=/tmp/models/imdb_eval
-$ python evaluate.py \
-    --eval_dir=$EVAL_DIR \
-    --checkpoint_dir=$TRAIN_DIR \
-    --eval_data=test \
-    --run_once \
-    --num_examples=25000 \
-    --data_dir=$IMDB_DATA_DIR \
-    --vocab_size=86934 \
-    --embedding_dims=256 \
-    --rnn_cell_size=1024 \
-    --batch_size=256 \
-    --num_timesteps=400 \
-    --normalize_embeddings
-```
-## Code Overview
-The main entry points are the binaries listed below. Each training binary builds
-a `VatxtModel`, defined in `graphs.py`, which in turn uses graph building blocks
-defined in `inputs.py` (defines input data reading and parsing), `layers.py`
-(defines core model components), and `adversarial_losses.py` (defines
-adversarial training losses). The training loop itself is defined in
-`train_utils.py`.
-### Binaries
-*   Pretraining: `pretrain.py`
-*   Classifier Training: `train_classifier.py`
-*   Evaluation: `evaluate.py`
-### Command-Line Flags
-Flags related to distributed training and the training loop itself are defined
-in [`train_utils.py`](https://github.com/tensorflow/models/tree/master/research/adversarial_text/train_utils.py).
-Flags related to model hyperparameters are defined in [`graphs.py`](https://github.com/tensorflow/models/tree/master/research/adversarial_text/graphs.py).
-Flags related to adversarial training are defined in [`adversarial_losses.py`](https://github.com/tensorflow/models/tree/master/research/adversarial_text/adversarial_losses.py).
-Flags particular to each job are defined in the main binary files.
-### Data Generation
-*   Vocabulary generation: [`gen_vocab.py`](https://github.com/tensorflow/models/tree/master/research/adversarial_text/gen_vocab.py)
-*   Data generation: [`gen_data.py`](https://github.com/tensorflow/models/tree/master/research/adversarial_text/gen_data.py)
-Command-line flags defined in [`document_generators.py`](https://github.com/tensorflow/models/tree/master/research/adversarial_text/data/document_generators.py)
-control which dataset is processed and how.
-## Contact for Issues
-* Ryan Sepassi, @rsepassi
-* Andrew M. Dai, @a-dai <adai@google.com>
-* Takeru Miyato, @takerum (Original implementation)
--- a/research/adversarial_text/__init__.py
+++ b/research/adversarial_text/__init__.py
--- a/research/adversarial_text/adversarial_losses.py
+++ b/research/adversarial_text/adversarial_losses.py
-# Copyright 2017 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Adversarial losses for text models."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-# Dependency imports
-from six.moves import xrange
-import tensorflow as tf
-flags = tf.app.flags
-FLAGS = flags.FLAGS
-# Adversarial and virtual adversarial training parameters.
-flags.DEFINE_float('perturb_norm_length', 5.0,
-                   'Norm length of adversarial perturbation to be '
-                   'optimized with validation. '
-                   '5.0 is optimal on IMDB with virtual adversarial training. ')
-# Virtual adversarial training parameters
-flags.DEFINE_integer('num_power_iteration', 1, 'The number of power iteration')
-flags.DEFINE_float('small_constant_for_finite_diff', 1e-1,
-                   'Small constant for finite difference method')
-# Parameters for building the graph
-flags.DEFINE_string('adv_training_method', None,
-                    'The flag which specifies training method. '
-                    '""    : non-adversarial training (e.g. for running the '
-                    '        semi-supervised sequence learning model) '
-                    '"rp"  : random perturbation training '
-                    '"at"  : adversarial training '
-                    '"vat" : virtual adversarial training '
-                    '"atvat" : at + vat ')
-flags.DEFINE_float('adv_reg_coeff', 1.0,
-                   'Regularization coefficient of adversarial loss.')
-def random_perturbation_loss(embedded, length, loss_fn):
-  """Adds noise to embeddings and recomputes classification loss."""
-  noise = tf.random_normal(shape=tf.shape(embedded))
-  perturb = _scale_l2(_mask_by_length(noise, length), FLAGS.perturb_norm_length)
-  return loss_fn(embedded + perturb)
-def adversarial_loss(embedded, loss, loss_fn):
-  """Adds gradient to embedding and recomputes classification loss."""
-  grad, = tf.gradients(
-      loss,
-      embedded,
-      aggregation_method=tf.AggregationMethod.EXPERIMENTAL_ACCUMULATE_N)
-  grad = tf.stop_gradient(grad)
-  perturb = _scale_l2(grad, FLAGS.perturb_norm_length)
-  return loss_fn(embedded + perturb)
-def virtual_adversarial_loss(logits, embedded, inputs,
-                             logits_from_embedding_fn):
-  """Virtual adversarial loss.
-  Computes virtual adversarial perturbation by finite difference method and
-  power iteration, adds it to the embedding, and computes the KL divergence
-  between the new logits and the original logits.
-  Args:
-    logits: 3-D float Tensor, [batch_size, num_timesteps, m], where m=1 if
-      num_classes=2, otherwise m=num_classes.
-    embedded: 3-D float Tensor, [batch_size, num_timesteps, embedding_dim].
-    inputs: VatxtInput.
-    logits_from_embedding_fn: callable that takes embeddings and returns
-      classifier logits.
-  Returns:
-    kl: float scalar.
-  """
-  # Stop gradient of logits. See https://arxiv.org/abs/1507.00677 for details.
-  logits = tf.stop_gradient(logits)
-  # Only care about the KL divergence on the final timestep.
-  weights = inputs.eos_weights
-  assert weights is not None
-  if FLAGS.single_label:
-    indices = tf.stack([tf.range(FLAGS.batch_size), inputs.length - 1], 1)
-    weights = tf.expand_dims(tf.gather_nd(inputs.eos_weights, indices), 1)
-  # Initialize perturbation with random noise.
-  # shape(embedded) = (batch_size, num_timesteps, embedding_dim)
-  d = tf.random_normal(shape=tf.shape(embedded))
-  # Perform finite difference method and power iteration.
-  # See Eq.(8) in the paper http://arxiv.org/pdf/1507.00677.pdf,
-  # Adding small noise to input and taking gradient with respect to the noise
-  # corresponds to 1 power iteration.
-  for _ in xrange(FLAGS.num_power_iteration):
-    d = _scale_l2(
-        _mask_by_length(d, inputs.length), FLAGS.small_constant_for_finite_diff)
-    d_logits = logits_from_embedding_fn(embedded + d)
-    kl = _kl_divergence_with_logits(logits, d_logits, weights)
-    d, = tf.gradients(
-        kl,
-        d,
-        aggregation_method=tf.AggregationMethod.EXPERIMENTAL_ACCUMULATE_N)
-    d = tf.stop_gradient(d)
-  perturb = _scale_l2(d, FLAGS.perturb_norm_length)
-  vadv_logits = logits_from_embedding_fn(embedded + perturb)
-  return _kl_divergence_with_logits(logits, vadv_logits, weights)
-def random_perturbation_loss_bidir(embedded, length, loss_fn):
-  """Adds noise to embeddings and recomputes classification loss."""
-  noise = [tf.random_normal(shape=tf.shape(emb)) for emb in embedded]
-  masked = [_mask_by_length(n, length) for n in noise]
-  scaled = [_scale_l2(m, FLAGS.perturb_norm_length) for m in masked]
-  return loss_fn([e + s for (e, s) in zip(embedded, scaled)])
-def adversarial_loss_bidir(embedded, loss, loss_fn):
-  """Adds gradient to embeddings and recomputes classification loss."""
-  grads = tf.gradients(
-      loss,
-      embedded,
-      aggregation_method=tf.AggregationMethod.EXPERIMENTAL_ACCUMULATE_N)
-  adv_exs = [
-      emb + _scale_l2(tf.stop_gradient(g), FLAGS.perturb_norm_length)
-      for emb, g in zip(embedded, grads)
-  ]
-  return loss_fn(adv_exs)
-def virtual_adversarial_loss_bidir(logits, embedded, inputs,
-                                   logits_from_embedding_fn):
-  """Virtual adversarial loss for bidirectional models."""
-  logits = tf.stop_gradient(logits)
-  f_inputs, _ = inputs
-  weights = f_inputs.eos_weights
-  if FLAGS.single_label:
-    indices = tf.stack([tf.range(FLAGS.batch_size), f_inputs.length - 1], 1)
-    weights = tf.expand_dims(tf.gather_nd(f_inputs.eos_weights, indices), 1)
-  assert weights is not None
-  perturbs = [
-      _mask_by_length(tf.random_normal(shape=tf.shape(emb)), f_inputs.length)
-      for emb in embedded
-  ]
-  for _ in xrange(FLAGS.num_power_iteration):
-    perturbs = [
-        _scale_l2(d, FLAGS.small_constant_for_finite_diff) for d in perturbs
-    ]
-    d_logits = logits_from_embedding_fn(
-        [emb + d for (emb, d) in zip(embedded, perturbs)])
-    kl = _kl_divergence_with_logits(logits, d_logits, weights)
-    perturbs = tf.gradients(
-        kl,
-        perturbs,
-        aggregation_method=tf.AggregationMethod.EXPERIMENTAL_ACCUMULATE_N)
-    perturbs = [tf.stop_gradient(d) for d in perturbs]
-  perturbs = [_scale_l2(d, FLAGS.perturb_norm_length) for d in perturbs]
-  vadv_logits = logits_from_embedding_fn(
-      [emb + d for (emb, d) in zip(embedded, perturbs)])
-  return _kl_divergence_with_logits(logits, vadv_logits, weights)
-def _mask_by_length(t, length):
-  """Mask t, 3-D [batch, time, dim], by length, 1-D [batch,]."""
-  maxlen = t.get_shape().as_list()[1]
-  # Subtract 1 from length to prevent the perturbation from going on 'eos'
-  mask = tf.sequence_mask(length - 1, maxlen=maxlen)
-  mask = tf.expand_dims(tf.cast(mask, tf.float32), -1)
-  # shape(mask) = (batch, num_timesteps, 1)
-  return t * mask
-def _scale_l2(x, norm_length):
-  # shape(x) = (batch, num_timesteps, d)
-  # Divide x by max(abs(x)) for a numerically stable L2 norm.
-  # 2norm(x) = a * 2norm(x/a)
-  # Scale over the full sequence, dims (1, 2)
-  alpha = tf.reduce_max(tf.abs(x), (1, 2), keep_dims=True) + 1e-12
-  l2_norm = alpha * tf.sqrt(
-      tf.reduce_sum(tf.pow(x / alpha, 2), (1, 2), keep_dims=True) + 1e-6)
-  x_unit = x / l2_norm
-  return norm_length * x_unit
-def _kl_divergence_with_logits(q_logits, p_logits, weights):
-  """Returns weighted KL divergence between distributions q and p.
-  Args:
-    q_logits: logits for 1st argument of KL divergence shape
-              [batch_size, num_timesteps, num_classes] if num_classes > 2, and
-              [batch_size, num_timesteps] if num_classes == 2.
-    p_logits: logits for 2nd argument of KL divergence with same shape q_logits.
-    weights: 1-D float tensor with shape [batch_size, num_timesteps].
-             Elements should be 1.0 only on end of sequences
-  Returns:
-    KL: float scalar.
-  """
-  # For logistic regression
-  if FLAGS.num_classes == 2:
-    q = tf.nn.sigmoid(q_logits)
-    kl = (-tf.nn.sigmoid_cross_entropy_with_logits(logits=q_logits, labels=q) +
-          tf.nn.sigmoid_cross_entropy_with_logits(logits=p_logits, labels=q))
-    kl = tf.squeeze(kl, 2)
-  # For softmax regression
-  else:
-    q = tf.nn.softmax(q_logits)
-    kl = tf.reduce_sum(
-        q * (tf.nn.log_softmax(q_logits) - tf.nn.log_softmax(p_logits)), -1)
-  num_labels = tf.reduce_sum(weights)
-  num_labels = tf.where(tf.equal(num_labels, 0.), 1., num_labels)
-  kl.get_shape().assert_has_rank(2)
-  weights.get_shape().assert_has_rank(2)
-  loss = tf.identity(tf.reduce_sum(weights * kl) / num_labels, name='kl')
-  return loss
--- a/research/adversarial_text/data/__init__.py
+++ b/research/adversarial_text/data/__init__.py
--- a/research/adversarial_text/data/data_utils.py
+++ b/research/adversarial_text/data/data_utils.py
-# Copyright 2017 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Utilities for generating/preprocessing data for adversarial text models."""
-import operator
-import os
-import random
-import re
-# Dependency imports
-import tensorflow as tf
-EOS_TOKEN = '</s>'
-# Data filenames
-# Sequence Autoencoder
-ALL_SA = 'all_sa.tfrecords'
-TRAIN_SA = 'train_sa.tfrecords'
-TEST_SA = 'test_sa.tfrecords'
-# Language Model
-ALL_LM = 'all_lm.tfrecords'
-TRAIN_LM = 'train_lm.tfrecords'
-TEST_LM = 'test_lm.tfrecords'
-# Classification
-TRAIN_CLASS = 'train_classification.tfrecords'
-TEST_CLASS = 'test_classification.tfrecords'
-VALID_CLASS = 'validate_classification.tfrecords'
-# LM with bidirectional LSTM
-TRAIN_REV_LM = 'train_reverse_lm.tfrecords'
-TEST_REV_LM = 'test_reverse_lm.tfrecords'
-# Classification with bidirectional LSTM
-TRAIN_BD_CLASS = 'train_bidir_classification.tfrecords'
-TEST_BD_CLASS = 'test_bidir_classification.tfrecords'
-VALID_BD_CLASS = 'validate_bidir_classification.tfrecords'
-class ShufflingTFRecordWriter(object):
-  """Thin wrapper around TFRecordWriter that shuffles records."""
-  def __init__(self, path):
-    self._path = path
-    self._records = []
-    self._closed = False
-  def write(self, record):
-    assert not self._closed
-    self._records.append(record)
-  def close(self):
-    assert not self._closed
-    random.shuffle(self._records)
-    with tf.python_io.TFRecordWriter(self._path) as f:
-      for record in self._records:
-        f.write(record)
-    self._closed = True
-  def __enter__(self):
-    return self
-  def __exit__(self, unused_type, unused_value, unused_traceback):
-    self.close()
-class Timestep(object):
-  """Represents a single timestep in a SequenceWrapper."""
-  def __init__(self, token, label, weight, multivalent_tokens=False):
-    """Constructs Timestep from empty Features."""
-    self._token = token
-    self._label = label
-    self._weight = weight
-    self._multivalent_tokens = multivalent_tokens
-    self._fill_with_defaults()
-  @property
-  def token(self):
-    if self._multivalent_tokens:
-      raise TypeError('Timestep may contain multiple values; use `tokens`')
-    return self._token.int64_list.value[0]
-  @property
-  def tokens(self):
-    return self._token.int64_list.value
-  @property
-  def label(self):
-    return self._label.int64_list.value[0]
-  @property
-  def weight(self):
-    return self._weight.float_list.value[0]
-  def set_token(self, token):
-    if self._multivalent_tokens:
-      raise TypeError('Timestep may contain multiple values; use `add_token`')
-    self._token.int64_list.value[0] = token
-    return self
-  def add_token(self, token):
-    self._token.int64_list.value.append(token)
-    return self
-  def set_label(self, label):
-    self._label.int64_list.value[0] = label
-    return self
-  def set_weight(self, weight):
-    self._weight.float_list.value[0] = weight
-    return self
-  def copy_from(self, timestep):
-    self.set_token(timestep.token).set_label(timestep.label).set_weight(
-        timestep.weight)
-    return self
-  def _fill_with_defaults(self):
-    if not self._multivalent_tokens:
-      self._token.int64_list.value.append(0)
-    self._label.int64_list.value.append(0)
-    self._weight.float_list.value.append(0.0)
-class SequenceWrapper(object):
-  """Wrapper around tf.SequenceExample."""
-  F_TOKEN_ID = 'token_id'
-  F_LABEL = 'label'
-  F_WEIGHT = 'weight'
-  def __init__(self, multivalent_tokens=False):
-    self._seq = tf.train.SequenceExample()
-    self._flist = self._seq.feature_lists.feature_list
-    self._timesteps = []
-    self._multivalent_tokens = multivalent_tokens
-  @property
-  def seq(self):
-    return self._seq
-  @property
-  def multivalent_tokens(self):
-    return self._multivalent_tokens
-  @property
-  def _tokens(self):
-    return self._flist[SequenceWrapper.F_TOKEN_ID].feature
-  @property
-  def _labels(self):
-    return self._flist[SequenceWrapper.F_LABEL].feature
-  @property
-  def _weights(self):
-    return self._flist[SequenceWrapper.F_WEIGHT].feature
-  def add_timestep(self):
-    timestep = Timestep(
-        self._tokens.add(),
-        self._labels.add(),
-        self._weights.add(),
-        multivalent_tokens=self._multivalent_tokens)
-    self._timesteps.append(timestep)
-    return timestep
-  def __iter__(self):
-    for timestep in self._timesteps:
-      yield timestep
-  def __len__(self):
-    return len(self._timesteps)
-  def __getitem__(self, idx):
-    return self._timesteps[idx]
-def build_reverse_sequence(seq):
-  """Builds a sequence that is the reverse of the input sequence."""
-  reverse_seq = SequenceWrapper()
-  # Copy all but last timestep
-  for timestep in reversed(seq[:-1]):
-    reverse_seq.add_timestep().copy_from(timestep)
-  # Copy final timestep
-  reverse_seq.add_timestep().copy_from(seq[-1])
-  return reverse_seq
-def build_bidirectional_seq(seq, rev_seq):
-  bidir_seq = SequenceWrapper(multivalent_tokens=True)
-  for forward_ts, reverse_ts in zip(seq, rev_seq):
-    bidir_seq.add_timestep().add_token(forward_ts.token).add_token(
-        reverse_ts.token)
-  return bidir_seq
-def build_lm_sequence(seq):
-  """Builds language model sequence from input sequence.
-  Args:
-    seq: SequenceWrapper.
-  Returns:
-    SequenceWrapper with `seq` tokens copied over to output sequence tokens and
-    labels (offset by 1, i.e. predict next token) with weights set to 1.0,
-    except for <eos> token.
-  """
-  lm_seq = SequenceWrapper()
-  for i, timestep in enumerate(seq):
-    if i == len(seq) - 1:
-      lm_seq.add_timestep().set_token(timestep.token).set_label(
-          seq[i].token).set_weight(0.0)
-    else:
-      lm_seq.add_timestep().set_token(timestep.token).set_label(
-          seq[i + 1].token).set_weight(1.0)
-  return lm_seq
-def build_seq_ae_sequence(seq):
-  """Builds seq_ae sequence from input sequence.
-  Args:
-    seq: SequenceWrapper.
-  Returns:
-    SequenceWrapper with `seq` inputs copied and concatenated, and with labels
-    copied in on the right-hand (i.e. decoder) side with weights set to 1.0.
-    The new sequence will have length `len(seq) * 2 - 1`, as the last timestep
-    of the encoder section and the first step of the decoder section will
-    overlap.
-  """
-  seq_ae_seq = SequenceWrapper()
-  for i in range(len(seq) * 2 - 1):
-    ts = seq_ae_seq.add_timestep()
-    if i < len(seq) - 1:
-      # Encoder
-      ts.set_token(seq[i].token)
-    elif i == len(seq) - 1:
-      # Transition step
-      ts.set_token(seq[i].token)
-      ts.set_label(seq[0].token)
-      ts.set_weight(1.0)
-    else:
-      # Decoder
-      ts.set_token(seq[i % len(seq)].token)
-      ts.set_label(seq[(i + 1) % len(seq)].token)
-      ts.set_weight(1.0)
-  return seq_ae_seq
-def build_labeled_sequence(seq, class_label, label_gain=False):
-  """Builds labeled sequence from input sequence.
-  Args:
-    seq: SequenceWrapper.
-    class_label: integer, starting from 0.
-    label_gain: bool. If True, class_label will be put on every timestep and
-      weight will increase linearly from 0 to 1.
-  Returns:
-    SequenceWrapper with `seq` copied in and `class_label` added as label to
-    final timestep.
-  """
-  label_seq = SequenceWrapper(multivalent_tokens=seq.multivalent_tokens)
-  # Copy sequence without labels
-  seq_len = len(seq)
-  final_timestep = None
-  for i, timestep in enumerate(seq):
-    label_timestep = label_seq.add_timestep()
-    if seq.multivalent_tokens:
-      for token in timestep.tokens:
-        label_timestep.add_token(token)
-    else:
-      label_timestep.set_token(timestep.token)
-    if label_gain:
-      label_timestep.set_label(int(class_label))
-      weight = 1.0 if seq_len < 2 else float(i) / (seq_len - 1)
-      label_timestep.set_weight(weight)
-    if i == (seq_len - 1):
-      final_timestep = label_timestep
-  # Edit final timestep to have class label and weight = 1.
-  final_timestep.set_label(int(class_label)).set_weight(1.0)
-  return label_seq
-def split_by_punct(segment):
-  """Splits str segment by punctuation, filters our empties and spaces."""
-  return [s for s in re.split(r'\W+', segment) if s and not s.isspace()]
-def sort_vocab_by_frequency(vocab_freq_map):
-  """Sorts vocab_freq_map by count.
-  Args:
-    vocab_freq_map: dict<str term, int count>, vocabulary terms with counts.
-  Returns:
-    list<tuple<str term, int count>> sorted by count, descending.
-  """
-  return sorted(
-      vocab_freq_map.items(), key=operator.itemgetter(1), reverse=True)
-def write_vocab_and_frequency(ordered_vocab_freqs, output_dir):
-  """Writes ordered_vocab_freqs into vocab.txt and vocab_freq.txt."""
-  tf.gfile.MakeDirs(output_dir)
-  with open(os.path.join(output_dir, 'vocab.txt'), 'w', encoding='utf-8') as vocab_f:
-    with open(os.path.join(output_dir, 'vocab_freq.txt'), 'w', encoding='utf-8') as freq_f:
-      for word, freq in ordered_vocab_freqs:
-        vocab_f.write('{}\n'.format(word))
-        freq_f.write('{}\n'.format(freq))
--- a/research/adversarial_text/data/data_utils_test.py
+++ b/research/adversarial_text/data/data_utils_test.py
-# Copyright 2017 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Tests for data_utils."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-# Dependency imports
-import tensorflow as tf
-from data import data_utils
-data = data_utils
-class SequenceWrapperTest(tf.test.TestCase):
-  def testDefaultTimesteps(self):
-    seq = data.SequenceWrapper()
-    t1 = seq.add_timestep()
-    _ = seq.add_timestep()
-    self.assertEqual(len(seq), 2)
-    self.assertEqual(t1.weight, 0.0)
-    self.assertEqual(t1.label, 0)
-    self.assertEqual(t1.token, 0)
-  def testSettersAndGetters(self):
-    ts = data.SequenceWrapper().add_timestep()
-    ts.set_token(3)
-    ts.set_label(4)
-    ts.set_weight(2.0)
-    self.assertEqual(ts.token, 3)
-    self.assertEqual(ts.label, 4)
-    self.assertEqual(ts.weight, 2.0)
-  def testTimestepIteration(self):
-    seq = data.SequenceWrapper()
-    seq.add_timestep().set_token(0)
-    seq.add_timestep().set_token(1)
-    seq.add_timestep().set_token(2)
-    for i, ts in enumerate(seq):
-      self.assertEqual(ts.token, i)
-  def testFillsSequenceExampleCorrectly(self):
-    seq = data.SequenceWrapper()
-    seq.add_timestep().set_token(1).set_label(2).set_weight(3.0)
-    seq.add_timestep().set_token(10).set_label(20).set_weight(30.0)
-    seq_ex = seq.seq
-    fl = seq_ex.feature_lists.feature_list
-    fl_token = fl[data.SequenceWrapper.F_TOKEN_ID].feature
-    fl_label = fl[data.SequenceWrapper.F_LABEL].feature
-    fl_weight = fl[data.SequenceWrapper.F_WEIGHT].feature
-    _ = [self.assertEqual(len(f), 2) for f in [fl_token, fl_label, fl_weight]]
-    self.assertAllEqual([f.int64_list.value[0] for f in fl_token], [1, 10])
-    self.assertAllEqual([f.int64_list.value[0] for f in fl_label], [2, 20])
-    self.assertAllEqual([f.float_list.value[0] for f in fl_weight], [3.0, 30.0])
-class DataUtilsTest(tf.test.TestCase):
-  def testSplitByPunct(self):
-    output = data.split_by_punct(
-        'hello! world, i\'ve been\nwaiting\tfor\ryou for.a long time')
-    expected = [
-        'hello', 'world', 'i', 've', 'been', 'waiting', 'for', 'you', 'for',
-        'a', 'long', 'time'
-    ]
-    self.assertListEqual(output, expected)
-  def _buildDummySequence(self):
-    seq = data.SequenceWrapper()
-    for i in range(10):
-      seq.add_timestep().set_token(i)
-    return seq
-  def testBuildLMSeq(self):
-    seq = self._buildDummySequence()
-    lm_seq = data.build_lm_sequence(seq)
-    for i, ts in enumerate(lm_seq):
-      # For end of sequence, the token and label should be same, and weight
-      # should be 0.0.
-      if i == len(lm_seq) - 1:
-        self.assertEqual(ts.token, i)
-        self.assertEqual(ts.label, i)
-        self.assertEqual(ts.weight, 0.0)
-      else:
-        self.assertEqual(ts.token, i)
-        self.assertEqual(ts.label, i + 1)
-        self.assertEqual(ts.weight, 1.0)
-  def testBuildSAESeq(self):
-    seq = self._buildDummySequence()
-    sa_seq = data.build_seq_ae_sequence(seq)
-    self.assertEqual(len(sa_seq), len(seq) * 2 - 1)
-    # Tokens should be sequence twice, minus the EOS token at the end
-    for i, ts in enumerate(sa_seq):
-      self.assertEqual(ts.token, seq[i % 10].token)
-    # Weights should be len-1 0.0's and len 1.0's.
-    for i in range(len(seq) - 1):
-      self.assertEqual(sa_seq[i].weight, 0.0)
-    for i in range(len(seq) - 1, len(sa_seq)):
-      self.assertEqual(sa_seq[i].weight, 1.0)
-    # Labels should be len-1 0's, and then the sequence
-    for i in range(len(seq) - 1):
-      self.assertEqual(sa_seq[i].label, 0)
-    for i in range(len(seq) - 1, len(sa_seq)):
-      self.assertEqual(sa_seq[i].label, seq[i - (len(seq) - 1)].token)
-  def testBuildLabelSeq(self):
-    seq = self._buildDummySequence()
-    eos_id = len(seq) - 1
-    label_seq = data.build_labeled_sequence(seq, True)
-    for i, ts in enumerate(label_seq[:-1]):
-      self.assertEqual(ts.token, i)
-      self.assertEqual(ts.label, 0)
-      self.assertEqual(ts.weight, 0.0)
-    final_timestep = label_seq[-1]
-    self.assertEqual(final_timestep.token, eos_id)
-    self.assertEqual(final_timestep.label, 1)
-    self.assertEqual(final_timestep.weight, 1.0)
-  def testBuildBidirLabelSeq(self):
-    seq = self._buildDummySequence()
-    reverse_seq = data.build_reverse_sequence(seq)
-    bidir_seq = data.build_bidirectional_seq(seq, reverse_seq)
-    label_seq = data.build_labeled_sequence(bidir_seq, True)
-    for (i, ts), j in zip(
-        enumerate(label_seq[:-1]), reversed(range(len(seq) - 1))):
-      self.assertAllEqual(ts.tokens, [i, j])
-      self.assertEqual(ts.label, 0)
-      self.assertEqual(ts.weight, 0.0)
-    final_timestep = label_seq[-1]
-    eos_id = len(seq) - 1
-    self.assertAllEqual(final_timestep.tokens, [eos_id, eos_id])
-    self.assertEqual(final_timestep.label, 1)
-    self.assertEqual(final_timestep.weight, 1.0)
-  def testReverseSeq(self):
-    seq = self._buildDummySequence()
-    reverse_seq = data.build_reverse_sequence(seq)
-    for i, ts in enumerate(reversed(reverse_seq[:-1])):
-      self.assertEqual(ts.token, i)
-      self.assertEqual(ts.label, 0)
-      self.assertEqual(ts.weight, 0.0)
-    final_timestep = reverse_seq[-1]
-    eos_id = len(seq) - 1
-    self.assertEqual(final_timestep.token, eos_id)
-    self.assertEqual(final_timestep.label, 0)
-    self.assertEqual(final_timestep.weight, 0.0)
-  def testBidirSeq(self):
-    seq = self._buildDummySequence()
-    reverse_seq = data.build_reverse_sequence(seq)
-    bidir_seq = data.build_bidirectional_seq(seq, reverse_seq)
-    for (i, ts), j in zip(
-        enumerate(bidir_seq[:-1]), reversed(range(len(seq) - 1))):
-      self.assertAllEqual(ts.tokens, [i, j])
-      self.assertEqual(ts.label, 0)
-      self.assertEqual(ts.weight, 0.0)
-    final_timestep = bidir_seq[-1]
-    eos_id = len(seq) - 1
-    self.assertAllEqual(final_timestep.tokens, [eos_id, eos_id])
-    self.assertEqual(final_timestep.label, 0)
-    self.assertEqual(final_timestep.weight, 0.0)
-  def testLabelGain(self):
-    seq = self._buildDummySequence()
-    label_seq = data.build_labeled_sequence(seq, True, label_gain=True)
-    for i, ts in enumerate(label_seq):
-      self.assertEqual(ts.token, i)
-      self.assertEqual(ts.label, 1)
-      self.assertNear(ts.weight, float(i) / (len(seq) - 1), 1e-3)
-if __name__ == '__main__':
-  tf.test.main()
--- a/research/adversarial_text/data/document_generators.py
+++ b/research/adversarial_text/data/document_generators.py
-# Copyright 2017 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Input readers and document/token generators for datasets."""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-from collections import namedtuple
-import csv
-import os
-import random
-# Dependency imports
-import tensorflow as tf
-from data import data_utils
-flags = tf.app.flags
-FLAGS = flags.FLAGS
-flags.DEFINE_string('dataset', '', 'Which dataset to generate data for')
-# Preprocessing config
-flags.DEFINE_boolean('output_unigrams', True, 'Whether to output unigrams.')
-flags.DEFINE_boolean('output_bigrams', False, 'Whether to output bigrams.')
-flags.DEFINE_boolean('output_char', False, 'Whether to output characters.')
-flags.DEFINE_boolean('lowercase', True, 'Whether to lowercase document terms.')
-# IMDB
-flags.DEFINE_string('imdb_input_dir', '', 'The input directory containing the '
-                    'IMDB sentiment dataset.')
-flags.DEFINE_integer('imdb_validation_pos_start_id', 10621, 'File id of the '
-                     'first file in the pos sentiment validation set.')
-flags.DEFINE_integer('imdb_validation_neg_start_id', 10625, 'File id of the '
-                     'first file in the neg sentiment validation set.')
-# DBpedia
-flags.DEFINE_string('dbpedia_input_dir', '',
-                    'Path to DBpedia directory containing train.csv and '
-                    'test.csv.')
-# Reuters Corpus (rcv1)
-flags.DEFINE_string('rcv1_input_dir', '',
-                    'Path to rcv1 directory containing train.csv, unlab.csv, '
-                    'and test.csv.')
-# Rotten Tomatoes
-flags.DEFINE_string('rt_input_dir', '',
-                    'The Rotten Tomatoes dataset input directory.')
-# The amazon reviews input file to use in either the RT or IMDB datasets.
-flags.DEFINE_string('amazon_unlabeled_input_file', '',
-                    'The unlabeled Amazon Reviews dataset input file. If set, '
-                    'the input file is used to augment RT and IMDB vocab.')
-Document = namedtuple('Document',
-                      'content is_validation is_test label add_tokens')
-def documents(dataset='train',
-              include_unlabeled=False,
-              include_validation=False):
-  """Generates Documents based on FLAGS.dataset.
-  Args:
-    dataset: str, identifies folder within IMDB data directory, test or train.
-    include_unlabeled: bool, whether to include the unsup directory. Only valid
-      when dataset=train.
-    include_validation: bool, whether to include validation data.
-  Yields:
-    Document
-  Raises:
-    ValueError: if include_unlabeled is true but dataset is not 'train'
-  """
-  if include_unlabeled and dataset != 'train':
-    raise ValueError('If include_unlabeled=True, must use train dataset')
-  # Set the random seed so that we have the same validation set when running
-  # gen_data and gen_vocab.
-  random.seed(302)
-  ds = FLAGS.dataset
-  if ds == 'imdb':
-    docs_gen = imdb_documents
-  elif ds == 'dbpedia':
-    docs_gen = dbpedia_documents
-  elif ds == 'rcv1':
-    docs_gen = rcv1_documents
-  elif ds == 'rt':
-    docs_gen = rt_documents
-  else:
-    raise ValueError('Unrecognized dataset %s' % FLAGS.dataset)
-  for doc in docs_gen(dataset, include_unlabeled, include_validation):
-    yield doc
-def tokens(doc):
-  """Given a Document, produces character or word tokens.
-  Tokens can be either characters, or word-level tokens (unigrams and/or
-  bigrams).
-  Args:
-    doc: Document to produce tokens from.
-  Yields:
-    token
-  Raises:
-    ValueError: if all FLAGS.{output_unigrams, output_bigrams, output_char}
-      are False.
-  """
-  if not (FLAGS.output_unigrams or FLAGS.output_bigrams or FLAGS.output_char):
-    raise ValueError(
-        'At least one of {FLAGS.output_unigrams, FLAGS.output_bigrams, '
-        'FLAGS.output_char} must be true')
-  content = doc.content.strip()
-  if FLAGS.lowercase:
-    content = content.lower()
-  if FLAGS.output_char:
-    for char in content:
-      yield char
-  else:
-    tokens_ = data_utils.split_by_punct(content)
-    for i, token in enumerate(tokens_):
-      if FLAGS.output_unigrams:
-        yield token
-      if FLAGS.output_bigrams:
-        previous_token = (tokens_[i - 1] if i > 0 else data_utils.EOS_TOKEN)
-        bigram = '_'.join([previous_token, token])
-        yield bigram
-        if (i + 1) == len(tokens_):
-          bigram = '_'.join([token, data_utils.EOS_TOKEN])
-          yield bigram
-def imdb_documents(dataset='train',
-                   include_unlabeled=False,
-                   include_validation=False):
-  """Generates Documents for IMDB dataset.
-  Data from http://ai.stanford.edu/~amaas/data/sentiment/
-  Args:
-    dataset: str, identifies folder within IMDB data directory, test or train.
-    include_unlabeled: bool, whether to include the unsup directory. Only valid
-      when dataset=train.
-    include_validation: bool, whether to include validation data.
-  Yields:
-    Document
-  Raises:
-    ValueError: if FLAGS.imdb_input_dir is empty.
-  """
-  if not FLAGS.imdb_input_dir:
-    raise ValueError('Must provide FLAGS.imdb_input_dir')
-  tf.logging.info('Generating IMDB documents...')
-  def check_is_validation(filename, class_label):
-    if class_label is None:
-      return False
-    file_idx = int(filename.split('_')[0])
-    is_pos_valid = (class_label and
-                    file_idx >= FLAGS.imdb_validation_pos_start_id)
-    is_neg_valid = (not class_label and
-                    file_idx >= FLAGS.imdb_validation_neg_start_id)
-    return is_pos_valid or is_neg_valid
-  dirs = [(dataset + '/pos', True), (dataset + '/neg', False)]
-  if include_unlabeled:
-    dirs.append(('train/unsup', None))
-  for d, class_label in dirs:
-    for filename in os.listdir(os.path.join(FLAGS.imdb_input_dir, d)):
-      is_validation = check_is_validation(filename, class_label)
-      if is_validation and not include_validation:
-        continue
-      with open(os.path.join(FLAGS.imdb_input_dir, d, filename), encoding='utf-8') as imdb_f:
-        content = imdb_f.read()
-      yield Document(
-          content=content,
-          is_validation=is_validation,
-          is_test=False,
-          label=class_label,
-          add_tokens=True)
-  if FLAGS.amazon_unlabeled_input_file and include_unlabeled:
-    with open(FLAGS.amazon_unlabeled_input_file, encoding='utf-8') as rt_f:
-      for content in rt_f:
-        yield Document(
-            content=content,
-            is_validation=False,
-            is_test=False,
-            label=None,
-            add_tokens=False)
-def dbpedia_documents(dataset='train',
-                      include_unlabeled=False,
-                      include_validation=False):
-  """Generates Documents for DBpedia dataset.
-  Dataset linked to at https://github.com/zhangxiangxiao/Crepe.
-  Args:
-    dataset: str, identifies the csv file within the DBpedia data directory,
-      test or train.
-    include_unlabeled: bool, unused.
-    include_validation: bool, whether to include validation data, which is a
-      randomly selected 10% of the data.
-  Yields:
-    Document
-  Raises:
-    ValueError: if FLAGS.dbpedia_input_dir is empty.
-  """
-  del include_unlabeled
-  if not FLAGS.dbpedia_input_dir:
-    raise ValueError('Must provide FLAGS.dbpedia_input_dir')
-  tf.logging.info('Generating DBpedia documents...')
-  with open(os.path.join(FLAGS.dbpedia_input_dir, dataset + '.csv')) as db_f:
-    reader = csv.reader(db_f)
-    for row in reader:
-      # 10% of the data is randomly held out
-      is_validation = random.randint(1, 10) == 1
-      if is_validation and not include_validation:
-        continue
-      content = row[1] + ' ' + row[2]
-      yield Document(
-          content=content,
-          is_validation=is_validation,
-          is_test=False,
-          label=int(row[0]) - 1,  # Labels should start from 0
-          add_tokens=True)
-def rcv1_documents(dataset='train',
-                   include_unlabeled=True,
-                   include_validation=False):
-  # pylint:disable=line-too-long
-  """Generates Documents for Reuters Corpus (rcv1) dataset.
-  Dataset described at
-  http://www.ai.mit.edu/projects/jmlr/papers/volume5/lewis04a/lyrl2004_rcv1v2_README.htm
-  Args:
-    dataset: str, identifies the csv file within the rcv1 data directory.
-    include_unlabeled: bool, whether to include the unlab file. Only valid
-      when dataset=train.
-    include_validation: bool, whether to include validation data, which is a
-      randomly selected 10% of the data.
-  Yields:
-    Document
-  Raises:
-    ValueError: if FLAGS.rcv1_input_dir is empty.
-  """
-  # pylint:enable=line-too-long
-  if not FLAGS.rcv1_input_dir:
-    raise ValueError('Must provide FLAGS.rcv1_input_dir')
-  tf.logging.info('Generating rcv1 documents...')
-  datasets = [dataset]
-  if include_unlabeled:
-    if dataset == 'train':
-      datasets.append('unlab')
-  for dset in datasets:
-    with open(os.path.join(FLAGS.rcv1_input_dir, dset + '.csv')) as db_f:
-      reader = csv.reader(db_f)
-      for row in reader:
-        # 10% of the data is randomly held out
-        is_validation = random.randint(1, 10) == 1
-        if is_validation and not include_validation:
-          continue
-        content = row[1]
-        yield Document(
-            content=content,
-            is_validation=is_validation,
-            is_test=False,
-            label=int(row[0]),
-            add_tokens=True)
-def rt_documents(dataset='train',
-                 include_unlabeled=True,
-                 include_validation=False):
-  # pylint:disable=line-too-long
-  """Generates Documents for the Rotten Tomatoes dataset.
-  Dataset available at http://www.cs.cornell.edu/people/pabo/movie-review-data/
-  In this dataset, amazon reviews are used for the unlabeled data.
-  Args:
-    dataset: str, identifies the data subdirectory.
-    include_unlabeled: bool, whether to include the unlabeled data. Only valid
-      when dataset=train.
-    include_validation: bool, whether to include validation data, which is a
-      randomly selected 10% of the data.
-  Yields:
-    Document
-  Raises:
-    ValueError: if FLAGS.rt_input_dir is empty.
-  """
-  # pylint:enable=line-too-long
-  if not FLAGS.rt_input_dir:
-    raise ValueError('Must provide FLAGS.rt_input_dir')
-  tf.logging.info('Generating rt documents...')
-  data_files = []
-  input_filenames = os.listdir(FLAGS.rt_input_dir)
-  for inp_fname in input_filenames:
-    if inp_fname.endswith('.pos'):
-      data_files.append((os.path.join(FLAGS.rt_input_dir, inp_fname), True))
-    elif inp_fname.endswith('.neg'):
-      data_files.append((os.path.join(FLAGS.rt_input_dir, inp_fname), False))
-  if include_unlabeled and FLAGS.amazon_unlabeled_input_file:
-    data_files.append((FLAGS.amazon_unlabeled_input_file, None))
-  for filename, class_label in data_files:
-    with open(filename) as rt_f:
-      for content in rt_f:
-        if class_label is None:
-          # Process Amazon Review data for unlabeled dataset
-          if content.startswith('review/text'):
-            yield Document(
-                content=content,
-                is_validation=False,
-                is_test=False,
-                label=None,
-                add_tokens=False)
-        else:
-          # 10% of the data is randomly held out for the validation set and
-          # another 10% of it is randomly held out for the test set
-          random_int = random.randint(1, 10)
-          is_validation = random_int == 1
-          is_test = random_int == 2
-          if (is_test and dataset != 'test') or (is_validation and
-                                                 not include_validation):
-            continue
-          yield Document(
-              content=content,
-              is_validation=is_validation,
-              is_test=is_test,
-              label=class_label,
-              add_tokens=True)
--- a/research/adversarial_text/evaluate.py
+++ b/research/adversarial_text/evaluate.py
--- a/research/adversarial_text/gen_data.py
+++ b/research/adversarial_text/gen_data.py
--- a/research/adversarial_text/gen_vocab.py
+++ b/research/adversarial_text/gen_vocab.py
--- a/research/adversarial_text/graphs.py
+++ b/research/adversarial_text/graphs.py
--- a/research/adversarial_text/graphs_test.py
+++ b/research/adversarial_text/graphs_test.py
--- a/research/adversarial_text/inputs.py
+++ b/research/adversarial_text/inputs.py
--- a/research/adversarial_text/layers.py
+++ b/research/adversarial_text/layers.py
--- a/research/adversarial_text/pretrain.py
+++ b/research/adversarial_text/pretrain.py
-# Copyright 2017 Google Inc. All Rights Reserved.
-#
-# Licensed under the Apache License, Version 2.0 (the "License");
-# you may not use this file except in compliance with the License.
-# You may obtain a copy of the License at
-#
-#     http://www.apache.org/licenses/LICENSE-2.0
-#
-# Unless required by applicable law or agreed to in writing, software
-# distributed under the License is distributed on an "AS IS" BASIS,
-# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
-# See the License for the specific language governing permissions and
-# limitations under the License.
-# ==============================================================================
-"""Pretrains a recurrent language model.
-Computational time:
-  2 days to train 100000 steps on 1 layer 1024 hidden units LSTM,
-  256 embeddings, 400 truncated BP, 256 minibatch and on single GPU (Pascal
-  Titan X, cuDNNv5).
-"""
-from __future__ import absolute_import
-from __future__ import division
-from __future__ import print_function
-# Dependency imports
-import tensorflow as tf
-import graphs
-import train_utils
-FLAGS = tf.app.flags.FLAGS
-def main(_):
-  """Trains Language Model."""
-  tf.logging.set_verbosity(tf.logging.INFO)
-  with tf.device(tf.train.replica_device_setter(FLAGS.ps_tasks)):
-    model = graphs.get_model()
-    train_op, loss, global_step = model.language_model_training()
-    train_utils.run_training(train_op, loss, global_step)
-if __name__ == '__main__':
-  tf.app.run()
--- a/research/adversarial_text/train_classifier.py
+++ b/research/adversarial_text/train_classifier.py
--- a/research/adversarial_text/train_utils.py
+++ b/research/adversarial_text/train_utils.py
--- a/research/attention_ocr/README.md
+++ b/research/attention_ocr/README.md
--- a/research/attention_ocr/python/all_jobs.screenrc
+++ b/research/attention_ocr/python/all_jobs.screenrc
--- a/research/attention_ocr/python/common_flags.py
+++ b/research/attention_ocr/python/common_flags.py