"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "273617b86dbe5cd15afb795e994dffc44e09e2df"
Allow tokenization of sequences > 512 for caching
For many applications requiring randomized data access, it's easier to cache the tokenized representations than the words. So why not turn this into a warning?
Showing
Please register or sign in to comment