Add scripts for working with txt files containing document boundaries
Summary: Pull Request resolved: https://github.com/pytorch/fairseq/pull/736 Differential Revision: D15314626 Pulled By: myleott fbshipit-source-id: 1e0c32529afee57e43fe5d6c7991cd13eb8a52c4
Showing
scripts/count_docs.py
0 → 100644
scripts/shard_docs.py
0 → 100644
Please register or sign in to comment