Internal change

PiperOrigin-RevId: 350648045

Internal change
PiperOrigin-RevId: 350648045
6fcefc13 · A. Unique TensorFlower · 66acaa7c · 6fcefc13
Commit 6fcefc13 authored Jan 07, 2021 by A. Unique TensorFlower
Hide whitespace changes
Inline Side-by-side

Showing with 31 additions and 0 deletions

official/nlp/projects/tn_bert/README.md official/nlp/projects/tn_bert/README.md +31 -0

No files found.
--- a/official/nlp/projects/tn_bert/README.md
+++ b/official/nlp/projects/tn_bert/README.md
+# TN-BERT (TensorNetwork BERT)
+TN-BERT is a modification of the BERT-base architecture that greatly compresses
+the original BERT model using tensor networks. The dense feedforward layers are
+replaced with Expand / Condense tn layers tuned to the TPU architecture.
+This work is based on research conducted during the development of the
+[TensorNetwork](https://arxiv.org/abs/1905.01330) Library. Check it out on
+[github](https://github.com/google/TensorNetwork).
+TN-BERT achieves the following improvements:
+*   69M params, or 37% fewer than the original BERT base.
+*   22% faster inference than the baseline model on TPUs.
+*   Pre-training time under 8 hours on an 8x8 pod of TPUs.
+*   15% less energy consumption by accellerators
+For more information go to the TF Hub model page
+[here](https://tfhub.dev/tensorflow/tn_bert/1)
+### Implementation
+The expand_condense and transformer layers are the only components that differ
+from the reference BERT implementation. These layers can be viewed at:
+* [tn_transformer_expand_condense.py](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/layers/tn_transformer_expand_condense.py)
+* [tn_expand_condense.py](https://github.com/tensorflow/models/blob/master/official/nlp/modeling/layers/tn_expand_condense.py)