[OCR-VQA](https://drive.google.com/drive/folders/1_GYPY5UkUy7HIcR0zq3ZCFgeZN7BAfm_?usp=sharing)(we save all files as .jpg) -> data/MGM-Finetune/ocr_vqa
[TextVQA](https://dl.fbaipublicfiles.com/textvqa/images/train_val_images.zip)(not included for training) -> data/MGM-Finetune/textvqa