"But this approach has severly-limited scalability. For larger data it should be streamed off disk.\n",
"But this approach has severly-limited scalability. For larger data it should be streamed off disk.\n",
"the `census_dataset.input_fn` provides an example of how to do this using `tf.decode_csv` and `tf.data.TextLineDataset`: \n",
"The `census_dataset.input_fn` provides an example of how to do this using `tf.decode_csv` and `tf.data.TextLineDataset`: \n",
"\n",
"\n",
"TODO(markdaoust): This `input_fn` should use `tf.contrib.data.make_csv_dataset`"
"<!-- TODO(markdaoust): This `input_fn` should use `tf.contrib.data.make_csv_dataset` -->"
]
]
},
},
{
{
...
@@ -470,7 +525,7 @@
...
@@ -470,7 +525,7 @@
"\n",
"\n",
"Estimators use a system called `feature_columns` to describe how the model\n",
"Estimators use a system called `feature_columns` to describe how the model\n",
"should interpret each of the raw input features. An Estimator exepcts a vector\n",
"should interpret each of the raw input features. An Estimator exepcts a vector\n",
"of numeric inputs, and feature columns describe how the model shoukld convert\n",
"of numeric inputs, and feature columns describe how the model should convert\n",
"each feature.\n",
"each feature.\n",
"\n",
"\n",
"Selecting and crafting the right set of feature columns is key to learning an\n",
"Selecting and crafting the right set of feature columns is key to learning an\n",
...
@@ -752,7 +807,7 @@
...
@@ -752,7 +807,7 @@
},
},
"cell_type": "markdown",
"cell_type": "markdown",
"source": [
"source": [
"if we run `input_layer` with the hashed column we see that the output shape is `(batch_size, hash_bucket_size)`"
"If we run `input_layer` with the hashed column we see that the output shape is `(batch_size, hash_bucket_size)`"
]
]
},
},
{
{
...
@@ -1259,11 +1314,24 @@
...
@@ -1259,11 +1314,24 @@
"\n",
"\n",
"For more about estimators:\n",
"For more about estimators:\n",
"\n",
"\n",
"- The [TensorFlow Hub transfer-learning tutorial](https://www.tensorflow.org/hub/tutorials/text_classification_with_tf_hub)\n",
"- The [TensorFlow Hub text classification tutorial](https://www.tensorflow.org/hub/tutorials/text_classification_with_tf_hub) uses `hub.text_embedding_column` to easily ingest free form text. \n",
"- The [Gradient-boosted-trees estimator tutorial](https://github.com/tensorflow/models/tree/master/official/boosted_trees)\n",
"- The [Gradient-boosted-trees estimator tutorial](https://github.com/tensorflow/models/tree/master/official/boosted_trees)\n",
"- This [blog post]( https://medium.com/tensorflow/classifying-text-with-tensorflow-estimators) on processing text with `Estimators`\n",
"- This [blog post]( https://medium.com/tensorflow/classifying-text-with-tensorflow-estimators) on processing text with `Estimators`\n",
"- How to [build a custom CNN estimator](https://www.tensorflow.org/tutorials/estimators/cnn)"
"- How to [build a custom CNN estimator](https://www.tensorflow.org/tutorials/estimators/cnn)"