"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "1cbb32a5428b6a3ad1fbc0de6b0709a63cfda903"
Feed forward chunking (#6024)
* Chunked feed forward for Bert
This is an initial implementation to test applying feed forward chunking for BERT.
Will need additional modifications based on output and benchmark results.
* Black and cleanup
* Feed forward chunking in BertLayer class.
* Isort
* add chunking for all models
* fix docs
* Fix typo
Co-authored-by:
patrickvonplaten <patrick.v.platen@gmail.com>
Showing
src/transformers/configuration_utils.py
100644 → 100755
src/transformers/modeling_bert.py
100644 → 100755
Please register or sign in to comment