Add model_cards for DynaBERT (#8012)

* Update README.md * Add dynabert_overview.png * Update README.md * Create README.md * Add dynabert_overview.png * Update README.md * Update README.md * Delete dynabert_overview.png * Update README.md * Delete dynabert_overview.png * Update README.md

Add model_cards for DynaBERT (#8012)
* Update README.md * Add dynabert_overview.png * Update README.md * Create README.md * Add dynabert_overview.png * Update README.md * Update README.md * Delete dynabert_overview.png * Update README.md * Delete dynabert_overview.png * Update README.md
0a3b9733 · Zhiqi Huang · GitHub · afa21504 · 0a3b9733 · 0a3b9733
Unverified Commit 0a3b9733 authored Oct 29, 2020 by Zhiqi Huang Committed by GitHub Oct 29, 2020
Showing with 34 additions and 3 deletions

model_cards/huawei-noah/DynaBERT_MNLI/README.md model_cards/huawei-noah/DynaBERT_MNLI/README.md +20 -0

model_cards/huawei-noah/DynaBERT_SST-2/README.md model_cards/huawei-noah/DynaBERT_SST-2/README.md +14 -3

No files found.
--- a/model_cards/huawei-noah/DynaBERT_MNLI/README.md
+++ b/model_cards/huawei-noah/DynaBERT_MNLI/README.md
+## DynaBERT: Dynamic BERT with Adaptive Width and Depth
+
+* DynaBERT can flexibly adjust the size and latency by selecting adaptive width and depth, and 
+the subnetworks of it have competitive performances as other similar-sized compressed models.
+The training process of DynaBERT includes first training a width-adaptive BERT and then 
+allowing both adaptive width and depth using knowledge distillation. 
+
+* This code is modified based on the repository developed by Hugging Face: [Transformers v2.1.1](https://github.com/huggingface/transformers/tree/v2.1.1), and is released in [GitHub](https://github.com/huawei-noah/Pretrained-Language-Model/tree/master/DynaBERT).
+
+### Reference
+Lu Hou, Zhiqi Huang, Lifeng Shang, Xin Jiang, Qun Liu.
+[DynaBERT: Dynamic BERT with Adaptive Width and Depth](https://arxiv.org/abs/2004.04037).
+```
+@inproceedings{hou2020dynabert,
+  title = {DynaBERT: Dynamic BERT with Adaptive Width and Depth},
+  author = {Lu Hou, Zhiqi Huang, Lifeng Shang, Xin Jiang, Qun Liu},  
+  booktitle = {NeurIPS},
+  year = {2020}
+}
+```
--- a/model_cards/huawei-noah/DynaBERT_SST-2/README.md
+++ b/model_cards/huawei-noah/DynaBERT_SST-2/README.md
-# DynaBERT: Dynamic BERT with Adaptive Width and Depth
+## DynaBERT: Dynamic BERT with Adaptive Width and Depth

 * DynaBERT can flexibly adjust the size and latency by selecting adaptive width and depth, and 
 the subnetworks of it have competitive performances as other similar-sized compressed models.
 The training process of DynaBERT includes first training a width-adaptive BERT and then 
 allowing both adaptive width and depth using knowledge distillation. 

-* This code is modified based on the repository developed by Hugging Face: [Transformers v2.1.1](https://github.com/huggingface/transformers/tree/v2.1.1)
-* The results in the paper are produced by using single V100 GPU.
+* This code is modified based on the repository developed by Hugging Face: [Transformers v2.1.1](https://github.com/huggingface/transformers/tree/v2.1.1), and is released in [GitHub](https://github.com/huawei-noah/Pretrained-Language-Model/tree/master/DynaBERT).
+
+### Reference
+Lu Hou, Zhiqi Huang, Lifeng Shang, Xin Jiang, Qun Liu.
+[DynaBERT: Dynamic BERT with Adaptive Width and Depth](https://arxiv.org/abs/2004.04037).
+```
+@inproceedings{hou2020dynabert,
+  title = {DynaBERT: Dynamic BERT with Adaptive Width and Depth},
+  author = {Lu Hou, Zhiqi Huang, Lifeng Shang, Xin Jiang, Qun Liu},  
+  booktitle = {NeurIPS},
+  year = {2020}
+}
+```