"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "6241c873cd24551d33cf78cf3df66f7c8b563f8b"
add cache_dir for tokenizer verification loading (#14508)
When loading a pretrained tokenizer, a verification is done to ensure that the actual tokenizer class matches the class it was called from. If the tokenizer is absent, its config file is loaded from the repo. However, the cache_dir for downloading is not provided, which leads to ignoring of the user-specified cache_dir, storing files in several places and and may result in incorrect warnings when the default cache_dir is unreachsble. This commit fixes that.
Showing
Please register or sign in to comment