======================= Modified FLAGS detected ======================= FLAGS(name='FLAGS_selected_gpus', current_value='6', default_value='') FLAGS(name='FLAGS_cudnn_batchnorm_spatial_persistent', current_value=True, default_value=False) ======================================================================= I0727 11:19:01.148191 2885 tcp_utils.cc:107] Retry to connect to 127.0.0.1:52742 while the server is not yet listening. I0727 11:19:04.148348 2885 tcp_utils.cc:130] Successfully connected to 127.0.0.1:52742 I0727 11:19:04.179088 2885 process_group_nccl.cc:120] ProcessGroupNCCL pg_timeout_ 1800000 W0727 11:19:04.198612 2885 gpu_resources.cc:119] Please NOTE: device: 6, GPU Compute Capability: 90.2, Driver API Version: 50724.2, Runtime API Version: 50724.2 eval model:: 0%| | 0/500 [00:00 >) ---------------------- Error Message Summary: ---------------------- FatalError: `Termination signal` is detected by the operating system. [TimeInfo: *** Aborted at 1722504078 (unix time) try "date -d @1722504078" if you are using GNU date ***] [SignalInfo: *** SIGTERM (@0x148) received by PID 438 (TID 0x7f5ff6315640) from PID 328 ***] ======================= Modified FLAGS detected ======================= FLAGS(name='FLAGS_cudnn_batchnorm_spatial_persistent', current_value=True, default_value=False) FLAGS(name='FLAGS_selected_gpus', current_value='6', default_value='') ======================================================================= I0801 17:43:09.939770 762 tcp_utils.cc:107] Retry to connect to 10.8.145.246:62911 while the server is not yet listening. -------------------------------------- C++ Traceback (most recent call last): -------------------------------------- 0 phi::distributed::CreateOrGetGlobalTCPStore() 1 phi::distributed::TCPStore::TCPStore(std::string, unsigned short, bool, unsigned long, int) 2 phi::distributed::detail::TCPClient::connect(std::string, unsigned short) 3 phi::distributed::tcputils::tcp_connect(std::string, std::string, int, std::chrono::duration >) ---------------------- Error Message Summary: ---------------------- FatalError: `Termination signal` is detected by the operating system. [TimeInfo: *** Aborted at 1722505392 (unix time) try "date -d @1722505392" if you are using GNU date ***] [SignalInfo: *** SIGTERM (@0x28f) received by PID 762 (TID 0x7fb20685a740) from PID 655 ***] ======================= Modified FLAGS detected ======================= FLAGS(name='FLAGS_cudnn_batchnorm_spatial_persistent', current_value=True, default_value=False) FLAGS(name='FLAGS_selected_gpus', current_value='6', default_value='') ======================================================================= I0801 17:45:05.447449 1081 tcp_utils.cc:107] Retry to connect to 10.8.145.246:49295 while the server is not yet listening. I0801 17:45:08.447674 1081 tcp_utils.cc:130] Successfully connected to 10.8.145.246:49295 I0801 17:45:08.505410 1081 process_group_nccl.cc:120] ProcessGroupNCCL pg_timeout_ 1800000 W0801 17:45:08.518016 1081 gpu_resources.cc:119] Please NOTE: device: 6, GPU Compute Capability: 90.2, Driver API Version: 50724.2, Runtime API Version: 50724.2 eval model:: 0%| | 0/500 [00:00 from ppocr.data import build_dataloader File "/root/paddle_dbnet/ppocr/data/__init__.py", line 35, in from ppocr.data.imaug import transform, create_operators File "/root/paddle_dbnet/ppocr/data/imaug/__init__.py", line 19, in from .iaa_augment import IaaAugment File "/root/paddle_dbnet/ppocr/data/imaug/iaa_augment.py", line 24, in import imgaug File "/root/anaconda3/envs/dbnet_test/lib/python3.10/site-packages/imgaug/__init__.py", line 7, in from imgaug.imgaug import * # pylint: disable=redefined-builtin File "/root/anaconda3/envs/dbnet_test/lib/python3.10/site-packages/imgaug/imgaug.py", line 18, in import cv2 ImportError: numpy.core.multiarray failed to import ======================= Modified FLAGS detected ======================= FLAGS(name='FLAGS_selected_gpus', current_value='6', default_value='') FLAGS(name='FLAGS_cudnn_batchnorm_spatial_persistent', current_value=True, default_value=False) ======================================================================= I0809 17:16:34.717787 262703 tcp_utils.cc:107] Retry to connect to 127.0.0.1:40029 while the server is not yet listening. I0809 17:16:37.717911 262703 tcp_utils.cc:130] Successfully connected to 127.0.0.1:40029 I0809 17:16:37.747879 262703 process_group_nccl.cc:120] ProcessGroupNCCL pg_timeout_ 1800000 Traceback (most recent call last): File "/root/paddle_dbnet/tools/train.py", line 198, in main(config, device, logger, vdl_writer) File "/root/paddle_dbnet/tools/train.py", line 53, in main train_dataloader = build_dataloader(config, 'Train', device, logger) File "/root/paddle_dbnet/ppocr/data/__init__.py", line 65, in build_dataloader dataset = eval(module_name)(config, mode, logger, seed) File "/root/paddle_dbnet/ppocr/data/simple_dataset.py", line 47, in __init__ self.data_lines = self.get_image_info_list(label_file_list, ratio_list) File "/root/paddle_dbnet/ppocr/data/simple_dataset.py", line 61, in get_image_info_list with open(file, "rb") as f: FileNotFoundError: [Errno 2] No such file or directory: '/datasets/icdar2015/text_localization/train_icdar2015_label.txt'