======================= Modified FLAGS detected ======================= FLAGS(name='FLAGS_selected_gpus', current_value='4', default_value='') FLAGS(name='FLAGS_cudnn_batchnorm_spatial_persistent', current_value=True, default_value=False) ======================================================================= I0727 11:19:01.148553 2865 tcp_utils.cc:107] Retry to connect to 127.0.0.1:52742 while the server is not yet listening. I0727 11:19:04.148708 2865 tcp_utils.cc:130] Successfully connected to 127.0.0.1:52742 I0727 11:19:04.179193 2865 process_group_nccl.cc:120] ProcessGroupNCCL pg_timeout_ 1800000 W0727 11:19:04.196408 2865 gpu_resources.cc:119] Please NOTE: device: 4, GPU Compute Capability: 90.2, Driver API Version: 50724.2, Runtime API Version: 50724.2 eval model:: 0%| | 0/500 [00:00 >) ---------------------- Error Message Summary: ---------------------- FatalError: `Termination signal` is detected by the operating system. [TimeInfo: *** Aborted at 1722504078 (unix time) try "date -d @1722504078" if you are using GNU date ***] [SignalInfo: *** SIGTERM (@0x148) received by PID 419 (TID 0x7f8226fff640) from PID 328 ***] ======================= Modified FLAGS detected ======================= FLAGS(name='FLAGS_cudnn_batchnorm_spatial_persistent', current_value=True, default_value=False) FLAGS(name='FLAGS_selected_gpus', current_value='4', default_value='') ======================================================================= I0801 17:43:09.950397 746 tcp_utils.cc:107] Retry to connect to 10.8.145.246:62911 while the server is not yet listening. -------------------------------------- C++ Traceback (most recent call last): -------------------------------------- 0 phi::distributed::CreateOrGetGlobalTCPStore() 1 phi::distributed::TCPStore::TCPStore(std::string, unsigned short, bool, unsigned long, int) 2 phi::distributed::detail::TCPClient::connect(std::string, unsigned short) 3 phi::distributed::tcputils::tcp_connect(std::string, std::string, int, std::chrono::duration >) ---------------------- Error Message Summary: ---------------------- FatalError: `Termination signal` is detected by the operating system. [TimeInfo: *** Aborted at 1722505391 (unix time) try "date -d @1722505391" if you are using GNU date ***] [SignalInfo: *** SIGTERM (@0x28f) received by PID 746 (TID 0x7f5f2bd8e740) from PID 655 ***] ======================= Modified FLAGS detected ======================= FLAGS(name='FLAGS_cudnn_batchnorm_spatial_persistent', current_value=True, default_value=False) FLAGS(name='FLAGS_selected_gpus', current_value='4', default_value='') ======================================================================= I0801 17:45:05.517232 1068 tcp_utils.cc:130] Successfully connected to 10.8.145.246:49295 I0801 17:45:05.558315 1068 process_group_nccl.cc:120] ProcessGroupNCCL pg_timeout_ 1800000 W0801 17:45:05.576678 1068 gpu_resources.cc:119] Please NOTE: device: 4, GPU Compute Capability: 90.2, Driver API Version: 50724.2, Runtime API Version: 50724.2 eval model:: 0%| | 0/500 [00:00 from ppocr.data import build_dataloader File "/root/paddle_dbnet/ppocr/data/__init__.py", line 35, in from ppocr.data.imaug import transform, create_operators File "/root/paddle_dbnet/ppocr/data/imaug/__init__.py", line 19, in from .iaa_augment import IaaAugment File "/root/paddle_dbnet/ppocr/data/imaug/iaa_augment.py", line 24, in import imgaug File "/root/anaconda3/envs/dbnet_test/lib/python3.10/site-packages/imgaug/__init__.py", line 7, in from imgaug.imgaug import * # pylint: disable=redefined-builtin File "/root/anaconda3/envs/dbnet_test/lib/python3.10/site-packages/imgaug/imgaug.py", line 18, in import cv2 ImportError: numpy.core.multiarray failed to import ======================= Modified FLAGS detected ======================= FLAGS(name='FLAGS_selected_gpus', current_value='4', default_value='') FLAGS(name='FLAGS_cudnn_batchnorm_spatial_persistent', current_value=True, default_value=False) ======================================================================= I0809 17:16:34.717710 262690 tcp_utils.cc:107] Retry to connect to 127.0.0.1:40029 while the server is not yet listening. I0809 17:16:37.717845 262690 tcp_utils.cc:130] Successfully connected to 127.0.0.1:40029 I0809 17:16:37.747900 262690 process_group_nccl.cc:120] ProcessGroupNCCL pg_timeout_ 1800000 Traceback (most recent call last): File "/root/paddle_dbnet/tools/train.py", line 198, in main(config, device, logger, vdl_writer) File "/root/paddle_dbnet/tools/train.py", line 53, in main train_dataloader = build_dataloader(config, 'Train', device, logger) File "/root/paddle_dbnet/ppocr/data/__init__.py", line 65, in build_dataloader dataset = eval(module_name)(config, mode, logger, seed) File "/root/paddle_dbnet/ppocr/data/simple_dataset.py", line 47, in __init__ self.data_lines = self.get_image_info_list(label_file_list, ratio_list) File "/root/paddle_dbnet/ppocr/data/simple_dataset.py", line 61, in get_image_info_list with open(file, "rb") as f: FileNotFoundError: [Errno 2] No such file or directory: '/datasets/icdar2015/text_localization/train_icdar2015_label.txt'