Merge pull request #2167 from alexgorban/UpdateReadme

Update Attention OCR README.md to point to a fixed checkpoint file.

Merge pull request #2167 from alexgorban/UpdateReadme
Update Attention OCR README.md to point to a fixed checkpoint file.
a03b3638 · Martin Wicke · GitHub · 36203f09 · 62c3cab2 · a03b3638
Commit a03b3638 authored Aug 09, 2017 by Martin Wicke Committed by GitHub Aug 09, 2017
Show whitespace changes
Inline Side-by-side

Showing with 20 additions and 5 deletions

attention_ocr/README.md attention_ocr/README.md +20 -5

No files found.
--- a/attention_ocr/README.md
+++ b/attention_ocr/README.md
@@ -71,8 +71,8 @@ python train.py --checkpoint_inception=./inception_v3.ckpt
 To fine tune the Attention OCR model using a checkpoint:
 ```
-wget http://download.tensorflow.org/models/attention_ocr_2017_05_17.tar.gz
+wget http://download.tensorflow.org/models/attention_ocr_2017_08_09.tar.gz
-tar xf attention_ocr_2017_05_17.tar.gz
+tar xf attention_ocr_2017_08_09.tar.gz
 python train.py --checkpoint=model.ckpt-399731
 ```
@@ -160,17 +160,32 @@ as the example.
 The inference part was not released yet, but it is pretty straightforward to
 implement one in Python or C++.
-The recommended way is to use the [Serving infrastructure](https://tensorflow.github.io/serving/serving_basic).
+The recommended way is to use the [Serving infrastructure][serving].
 Alternatively you can:
 1. define a placeholder for images (or use directly an numpy array)
 2. [create a graph ](https://github.com/tensorflow/models/blob/master/attention_ocr/python/eval.py#L60)
-`endpoints = model.create_base(images_placeholder, labels_one_hot=None)`
+```
+endpoints = model.create_base(images_placeholder, labels_one_hot=None)
+```
 3. [load a pretrained model](https://github.com/tensorflow/models/blob/master/attention_ocr/python/model.py#L494)
 4. run computations through the graph:
-`predictions = sess.run(endpoints.predicted_chars, feed_dict={images_placeholder:images_actual_data})`
+```
+predictions = sess.run(endpoints.predicted_chars, 
+                       feed_dict={images_placeholder:images_actual_data})
+```
 5. Convert character IDs (predictions) to UTF8 using the provided charset file.
+Please note that tensor names may change overtime and old stored checkpoints can
+become unloadable. In many cases such backward incompatible changes can be
+fixed with a [string substitution][1] to update the checkpoint itself or using a
+custom var_list with [assign_from_checkpoint_fn][2]. For anything
+other than a one time experiment please use the [TensorFlow Serving][serving].
+[1]: https://github.com/tensorflow/tensorflow/blob/aaf7adc/tensorflow/contrib/rnn/python/tools/checkpoint_convert.py
+[2]: https://www.tensorflow.org/api_docs/python/tf/contrib/framework/assign_from_checkpoint_fn
+[serving]: https://tensorflow.github.io/serving/serving_basic
 ## Disclaimer
 This code is a modified version of the internal model we used for our paper.