Add TalkingHeadsAttention to README.

PiperOrigin-RevId: 307082084

Add TalkingHeadsAttention to README.
PiperOrigin-RevId: 307082084
85b50c88 · Chen Chen · A. Unique TensorFlower · f499e880 · 85b50c88
Commit 85b50c88 authored Apr 17, 2020 by Chen Chen Committed by A. Unique TensorFlower Apr 17, 2020
Hide whitespace changes
Inline Side-by-side

Showing with 3 additions and 0 deletions

official/nlp/modeling/layers/README.md official/nlp/modeling/layers/README.md +3 -0

No files found.
--- a/official/nlp/modeling/layers/README.md
+++ b/official/nlp/modeling/layers/README.md
@@ -14,6 +14,9 @@ If `from_tensor` and `to_tensor` are the same, then this is self-attention.
 * [CachedAttention](attention.py) implements an attention layer with cache used
 for auto-agressive decoding.
+* [TalkingHeadsAttention](talking_heads_attention.py) implements the talking
+heads attention, as decribed in ["Talking-Heads Attention"](https://arxiv.org/abs/2003.02436).
 * [Transformer](transformer.py) implements an optionally masked transformer as
 described in ["Attention Is All You Need"](https://arxiv.org/abs/1706.03762).