Update README.md (#6569)

* Update README.md * Update README.md * Update README.md

Update README.md (#6569)
* Update README.md * Update README.md * Update README.md
b4b8c723 · Yash Katariya · Toby Boyd · 645202b1 · b4b8c723
Commit b4b8c723 authored Apr 12, 2019 by Yash Katariya Committed by Toby Boyd Apr 12, 2019
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

official/transformer/README.md official/transformer/README.md +1 -1

No files found.
--- a/official/transformer/README.md
+++ b/official/transformer/README.md
 # Transformer Translation Model
-This is an implementation of the Transformer translation model as described in the [Attention is All You Need](https://arxiv.org/abs/1706.03762) paper. Based on the code provided by the authors: [Transformer code](https://github.com/tensorflow/tensor2tensor/blob/master/tensor2tensor/models/transformer.py) from [Tensor2Tensor](https://github.com/tensorflow/tensor2tensor).
+This is an implementation of the Transformer translation model as described in the [Attention is All You Need](https://arxiv.org/abs/1706.03762) paper. Based on the code provided by the authors: [Transformer code](https://github.com/tensorflow/tensor2tensor/blob/master/tensor2tensor/models/transformer.py) from [Tensor2Tensor](https://github.com/tensorflow/tensor2tensor). Also, check out the [tutorial](https://www.tensorflow.org/alpha/tutorials/sequences/transformer) on Transformer in TF 2.0.

 Transformer is a neural network architecture that solves sequence to sequence problems using attention mechanisms. Unlike traditional neural seq2seq models, Transformer does not involve recurrent connections. The attention mechanism learns dependencies between tokens in two sequences. Since attention weights apply to all tokens in the sequences, the Transformer model is able to easily capture long-distance dependencies.