Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
ResNet50_tensorflow
Commits
e9211fca
Commit
e9211fca
authored
May 17, 2017
by
Alexander Gorban
Browse files
attention_ocr# Update checkpoint and instructions.
parent
71bf3d47
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
20 additions
and
7 deletions
+20
-7
attention_ocr/README.md
attention_ocr/README.md
+20
-7
No files found.
attention_ocr/README.md
View file @
e9211fca
...
@@ -23,10 +23,22 @@ Pull requests:
...
@@ -23,10 +23,22 @@ Pull requests:
## Requirements
## Requirements
1.
Installed TensorFlow library (
[
instructions
][
TF
]
).
1.
Installed TensorFlow library (
[
instructions
][
TF
]
).
```
virtualenv --system-site-packages ~/.tensorflow
source ~/.tensorflow/bin/activate
pip install --upgrade pip
pip install --upgrade tensorflow_gpu
```
2.
At least 158Gb of free disk space to download FSNS dataset:
2.
At least 158Gb of free disk space to download FSNS dataset:
Assume your current directory is
`models/attention_ocr/python`
```
```
aria2c -c -j 20 -i ../street/python/fsns_urls.txt
cd datasets
aria2c -c -j 20 -i ../../../street/python/fsns_urls.txt
cd -
```
```
3.
16Gb of RAM or more, 32Gb is recommended.
3.
16Gb of RAM or more, 32Gb is recommended.
...
@@ -60,16 +72,17 @@ python train.py --checkpoint_inception=inception_v3.ckpt
...
@@ -60,16 +72,17 @@ python train.py --checkpoint_inception=inception_v3.ckpt
To fine tune the Attention OCR model using a checkpoint:
To fine tune the Attention OCR model using a checkpoint:
```
```
wget http://download.tensorflow.org/models/attention_ocr_2017_05_
0
1.tar.gz
wget http://download.tensorflow.org/models/attention_ocr_2017_05_1
7
.tar.gz
tar xf attention_ocr_2017_05_
0
1.tar.gz
tar xf attention_ocr_2017_05_1
7
.tar.gz
python train.py --checkpoint=model.ckpt-
232572
python train.py --checkpoint=model.ckpt-
399731
```
```
## Disclaimer
## Disclaimer
This code is a modified version of the internal model we used for our paper.
This code is a modified version of the internal model we used for our paper.
Currently it reaches 8
2
.7
1
% full sequence accuracy after
215
k steps of training.
Currently it reaches 8
3
.7
9
% full sequence accuracy after
400
k steps of training.
The main difference between this version and the version used in the paper - for
The main difference between this version and the version used in the paper - for
the paper we used a distributed training with 50 GPU (K80) workers (asynchronous
the paper we used a distributed training with 50 GPU (K80) workers (asynchronous
updates), the provided checkpoint was created using this code after ~60 hours of
updates), the provided checkpoint was created using this code after ~6 days of
training on a single GPU (Titan X).
training on a single GPU (Titan X) (it reached 81% after 24 hours of training),
the coordinate encoding is missing TODO(alexgorban@).
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment