2024-12-16 09:45:24,954 INFO MainThread:123 [wandb_setup.py:_flush():68] Current SDK version is 0.19.1
2024-12-16 09:45:24,954 INFO MainThread:123 [wandb_setup.py:_flush():68] Configure stats pid to 123
2024-12-16 09:45:24,954 INFO MainThread:123 [wandb_setup.py:_flush():68] Loading settings from /root/.config/wandb/settings
2024-12-16 09:45:24,954 INFO MainThread:123 [wandb_setup.py:_flush():68] Loading settings from /mnt/xgen-mm/LAVIS/wandb/settings
2024-12-16 09:45:24,954 INFO MainThread:123 [wandb_setup.py:_flush():68] Loading settings from environment variables
2024-12-16 09:45:24,954 INFO MainThread:123 [wandb_init.py:_log_setup():528] Logging user logs to /mnt/xgen-mm/LAVIS/wandb/run-20241216_094524-qoi2bab2/logs/debug.log
2024-12-16 09:45:24,954 INFO MainThread:123 [wandb_init.py:_log_setup():529] Logging internal logs to /mnt/xgen-mm/LAVIS/wandb/run-20241216_094524-qoi2bab2/logs/debug-internal.log
2024-12-16 09:45:24,954 INFO MainThread:123 [wandb_init.py:init():644] calling init triggers
2024-12-16 09:45:24,954 INFO MainThread:123 [wandb_init.py:init():650] wandb.init called with sweep_config: {}
{"time":"2024-12-16T10:12:04.091970789+08:00","level":"ERROR","msg":"sender: upsertRun:","error":"failed to upsert bucket: api: failed sending: context canceled"}
{"time":"2024-12-16T10:12:04.092014628+08:00","level":"WARN","msg":"runwork: ignoring record after close","work":{"Record":{"RecordType":{"Request":{"RequestType":{"Defer":{}}}},"control":{"always_send":true}}}}
2024-12-16 10:10:34,054 INFO MainThread:1952 [wandb_setup.py:_flush():68] Current SDK version is 0.19.1
2024-12-16 10:10:34,054 INFO MainThread:1952 [wandb_setup.py:_flush():68] Configure stats pid to 1952
2024-12-16 10:10:34,054 INFO MainThread:1952 [wandb_setup.py:_flush():68] Loading settings from /root/.config/wandb/settings
2024-12-16 10:10:34,054 INFO MainThread:1952 [wandb_setup.py:_flush():68] Loading settings from /mnt/xgen-mm/LAVIS/wandb/settings
2024-12-16 10:10:34,054 INFO MainThread:1952 [wandb_setup.py:_flush():68] Loading settings from environment variables
2024-12-16 10:10:34,054 INFO MainThread:1952 [wandb_init.py:_log_setup():528] Logging user logs to /mnt/xgen-mm/LAVIS/wandb/run-20241216_101034-bluz2d3p/logs/debug.log
2024-12-16 10:10:34,054 INFO MainThread:1952 [wandb_init.py:_log_setup():529] Logging internal logs to /mnt/xgen-mm/LAVIS/wandb/run-20241216_101034-bluz2d3p/logs/debug-internal.log
2024-12-16 10:10:34,054 INFO MainThread:1952 [wandb_init.py:init():644] calling init triggers
2024-12-16 10:10:34,054 INFO MainThread:1952 [wandb_init.py:init():650] wandb.init called with sweep_config: {}
2024-12-16 10:10:34,054 INFO MainThread:1952 [wandb_init.py:init():680] starting backend
2024-12-16 10:10:34,054 INFO MainThread:1952 [wandb_init.py:init():684] sending inform_init request
2024-12-16 10:10:34,058 INFO MainThread:1952 [backend.py:_multiprocessing_setup():104] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
2024-12-16 10:10:34,058 INFO MainThread:1952 [wandb_init.py:init():697] backend started and connected
2024-12-16 10:10:34,061 INFO MainThread:1952 [wandb_init.py:init():790] updated telemetry
2024-12-16 10:10:34,068 INFO MainThread:1952 [wandb_init.py:init():822] communicating run to backend with 90.0 second timeout
2024-12-16 10:11:02,254 INFO Thread-2 (wrapped_target):1952 [retry.py:__call__():172] Retry attempt failed:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/urllib3/connection.py", line 199, in _new_conn
sock = connection.create_connection(
File "/usr/local/lib/python3.10/site-packages/urllib3/util/connection.py", line 60, in create_connection
for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM):
File "/usr/local/lib/python3.10/socket.py", line 955, in getaddrinfo
for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
socket.gaierror: [Errno -3] Temporary failure in name resolution
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 789, in urlopen
response = self._make_request(
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 490, in _make_request
raise new_e
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 466, in _make_request
self._validate_conn(conn)
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 1095, in _validate_conn
conn.connect()
File "/usr/local/lib/python3.10/site-packages/urllib3/connection.py", line 693, in connect
self.sock = sock = self._new_conn()
File "/usr/local/lib/python3.10/site-packages/urllib3/connection.py", line 206, in _new_conn
raise NameResolutionError(self.host, self, e) from e
urllib3.exceptions.NameResolutionError: <urllib3.connection.HTTPSConnection object at 0x7f5d48759540>: Failed to resolve 'api.wandb.ai' ([Errno -3] Temporary failure in name resolution)
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/requests/adapters.py", line 667, in send
resp = conn.urlopen(
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 843, in urlopen
retries = retries.increment(
File "/usr/local/lib/python3.10/site-packages/urllib3/util/retry.py", line 519, in increment
raise MaxRetryError(_pool, url, reason) from reason # type: ignore[arg-type]
urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='api.wandb.ai', port=443): Max retries exceeded with url: /graphql (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7f5d48759540>: Failed to resolve 'api.wandb.ai' ([Errno -3] Temporary failure in name resolution)"))
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/lib/retry.py", line 131, in __call__
result = self._call_fn(*args, **kwargs)
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/internal/internal_api.py", line 393, in execute
File "/usr/local/lib/python3.10/site-packages/requests/sessions.py", line 589, in request
resp = self.send(prep, **send_kwargs)
File "/usr/local/lib/python3.10/site-packages/requests/sessions.py", line 703, in send
r = adapter.send(request, **kwargs)
File "/usr/local/lib/python3.10/site-packages/requests/adapters.py", line 700, in send
raise ConnectionError(e, request=request)
requests.exceptions.ConnectionError: HTTPSConnectionPool(host='api.wandb.ai', port=443): Max retries exceeded with url: /graphql (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7f5d48759540>: Failed to resolve 'api.wandb.ai' ([Errno -3] Temporary failure in name resolution)"))
2024-12-16 10:12:04,084 ERROR MainThread:1952 [wandb_init.py:init():849] encountered error: Run initialization has timed out after 90.0 sec. Please try increasing the timeout with the `init_timeout` setting: `wandb.init(settings=wandb.Settings(init_timeout=120))`.
2024-12-16 10:12:04,085 ERROR MainThread:1952 [wandb_init.py:init():1308] error in wandb.init()
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/wandb_init.py", line 1298, in init
return wi.init()
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/wandb_init.py", line 855, in init
raise error
wandb.errors.errors.CommError: Run initialization has timed out after 90.0 sec. Please try increasing the timeout with the `init_timeout` setting: `wandb.init(settings=wandb.Settings(init_timeout=120))`.
2024-12-16 10:12:04,091 WARNING MsgRouterThr:1952 [router.py:message_loop():75] message_loop has been closed
{"time":"2024-12-16T10:17:57.500022511+08:00","level":"ERROR","msg":"sender: upsertRun:","error":"failed to upsert bucket: api: failed sending: context canceled"}
{"time":"2024-12-16T10:17:57.500065863+08:00","level":"WARN","msg":"runwork: ignoring record after close","work":{"Record":{"RecordType":{"Request":{"RequestType":{"Defer":{}}}},"control":{"always_send":true}}}}
2024-12-16 10:16:27,450 INFO MainThread:2689 [wandb_setup.py:_flush():68] Current SDK version is 0.19.1
2024-12-16 10:16:27,450 INFO MainThread:2689 [wandb_setup.py:_flush():68] Configure stats pid to 2689
2024-12-16 10:16:27,451 INFO MainThread:2689 [wandb_setup.py:_flush():68] Loading settings from /root/.config/wandb/settings
2024-12-16 10:16:27,451 INFO MainThread:2689 [wandb_setup.py:_flush():68] Loading settings from /mnt/xgen-mm/LAVIS/wandb/settings
2024-12-16 10:16:27,451 INFO MainThread:2689 [wandb_setup.py:_flush():68] Loading settings from environment variables
2024-12-16 10:16:27,451 INFO MainThread:2689 [wandb_init.py:_log_setup():528] Logging user logs to /mnt/xgen-mm/LAVIS/wandb/run-20241216_101627-3las4gzs/logs/debug.log
2024-12-16 10:16:27,451 INFO MainThread:2689 [wandb_init.py:_log_setup():529] Logging internal logs to /mnt/xgen-mm/LAVIS/wandb/run-20241216_101627-3las4gzs/logs/debug-internal.log
2024-12-16 10:16:27,451 INFO MainThread:2689 [wandb_init.py:init():644] calling init triggers
2024-12-16 10:16:27,451 INFO MainThread:2689 [wandb_init.py:init():650] wandb.init called with sweep_config: {}
2024-12-16 10:16:27,451 INFO MainThread:2689 [wandb_init.py:init():680] starting backend
2024-12-16 10:16:27,451 INFO MainThread:2689 [wandb_init.py:init():684] sending inform_init request
2024-12-16 10:16:27,455 INFO MainThread:2689 [backend.py:_multiprocessing_setup():104] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
2024-12-16 10:16:27,455 INFO MainThread:2689 [wandb_init.py:init():697] backend started and connected
2024-12-16 10:16:27,457 INFO MainThread:2689 [wandb_init.py:init():790] updated telemetry
2024-12-16 10:16:27,464 INFO MainThread:2689 [wandb_init.py:init():822] communicating run to backend with 90.0 second timeout
2024-12-16 10:16:55,651 INFO Thread-2 (wrapped_target):2689 [retry.py:__call__():172] Retry attempt failed:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/urllib3/connection.py", line 199, in _new_conn
sock = connection.create_connection(
File "/usr/local/lib/python3.10/site-packages/urllib3/util/connection.py", line 60, in create_connection
for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM):
File "/usr/local/lib/python3.10/socket.py", line 955, in getaddrinfo
for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
socket.gaierror: [Errno -3] Temporary failure in name resolution
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 789, in urlopen
response = self._make_request(
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 490, in _make_request
raise new_e
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 466, in _make_request
self._validate_conn(conn)
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 1095, in _validate_conn
conn.connect()
File "/usr/local/lib/python3.10/site-packages/urllib3/connection.py", line 693, in connect
self.sock = sock = self._new_conn()
File "/usr/local/lib/python3.10/site-packages/urllib3/connection.py", line 206, in _new_conn
raise NameResolutionError(self.host, self, e) from e
urllib3.exceptions.NameResolutionError: <urllib3.connection.HTTPSConnection object at 0x7f7fd65755a0>: Failed to resolve 'api.wandb.ai' ([Errno -3] Temporary failure in name resolution)
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/requests/adapters.py", line 667, in send
resp = conn.urlopen(
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 843, in urlopen
retries = retries.increment(
File "/usr/local/lib/python3.10/site-packages/urllib3/util/retry.py", line 519, in increment
raise MaxRetryError(_pool, url, reason) from reason # type: ignore[arg-type]
urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='api.wandb.ai', port=443): Max retries exceeded with url: /graphql (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7f7fd65755a0>: Failed to resolve 'api.wandb.ai' ([Errno -3] Temporary failure in name resolution)"))
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/lib/retry.py", line 131, in __call__
result = self._call_fn(*args, **kwargs)
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/internal/internal_api.py", line 393, in execute
File "/usr/local/lib/python3.10/site-packages/requests/sessions.py", line 589, in request
resp = self.send(prep, **send_kwargs)
File "/usr/local/lib/python3.10/site-packages/requests/sessions.py", line 703, in send
r = adapter.send(request, **kwargs)
File "/usr/local/lib/python3.10/site-packages/requests/adapters.py", line 700, in send
raise ConnectionError(e, request=request)
requests.exceptions.ConnectionError: HTTPSConnectionPool(host='api.wandb.ai', port=443): Max retries exceeded with url: /graphql (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7f7fd65755a0>: Failed to resolve 'api.wandb.ai' ([Errno -3] Temporary failure in name resolution)"))
2024-12-16 10:17:57,494 ERROR MainThread:2689 [wandb_init.py:init():849] encountered error: Run initialization has timed out after 90.0 sec. Please try increasing the timeout with the `init_timeout` setting: `wandb.init(settings=wandb.Settings(init_timeout=120))`.
2024-12-16 10:17:57,494 ERROR MainThread:2689 [wandb_init.py:init():1308] error in wandb.init()
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/wandb_init.py", line 1298, in init
return wi.init()
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/wandb_init.py", line 855, in init
raise error
wandb.errors.errors.CommError: Run initialization has timed out after 90.0 sec. Please try increasing the timeout with the `init_timeout` setting: `wandb.init(settings=wandb.Settings(init_timeout=120))`.
2024-12-16 10:17:57,500 WARNING MsgRouterThr:2689 [router.py:message_loop():75] message_loop has been closed
{"time":"2024-12-16T10:38:41.423690712+08:00","level":"WARN","msg":"runwork: ignoring record after close","work":{"Record":{"RecordType":{"Request":{"RequestType":{"Defer":{}}}},"control":{"always_send":true}}}}
2024-12-16 10:37:11,394 INFO MainThread:5226 [wandb_setup.py:_flush():68] Current SDK version is 0.19.1
2024-12-16 10:37:11,394 INFO MainThread:5226 [wandb_setup.py:_flush():68] Configure stats pid to 5226
2024-12-16 10:37:11,394 INFO MainThread:5226 [wandb_setup.py:_flush():68] Loading settings from /root/.config/wandb/settings
2024-12-16 10:37:11,394 INFO MainThread:5226 [wandb_setup.py:_flush():68] Loading settings from /mnt/xgen-mm/LAVIS/wandb/settings
2024-12-16 10:37:11,394 INFO MainThread:5226 [wandb_setup.py:_flush():68] Loading settings from environment variables
2024-12-16 10:37:11,394 INFO MainThread:5226 [wandb_init.py:_log_setup():528] Logging user logs to /mnt/xgen-mm/LAVIS/wandb/run-20241216_103711-tpkv930s/logs/debug.log
2024-12-16 10:37:11,394 INFO MainThread:5226 [wandb_init.py:_log_setup():529] Logging internal logs to /mnt/xgen-mm/LAVIS/wandb/run-20241216_103711-tpkv930s/logs/debug-internal.log
2024-12-16 10:37:11,394 INFO MainThread:5226 [wandb_init.py:init():644] calling init triggers
2024-12-16 10:37:11,394 INFO MainThread:5226 [wandb_init.py:init():650] wandb.init called with sweep_config: {}
2024-12-16 10:37:11,394 INFO MainThread:5226 [wandb_init.py:init():680] starting backend
2024-12-16 10:37:11,394 INFO MainThread:5226 [wandb_init.py:init():684] sending inform_init request
2024-12-16 10:37:11,397 INFO MainThread:5226 [backend.py:_multiprocessing_setup():104] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
2024-12-16 10:37:11,398 INFO MainThread:5226 [wandb_init.py:init():697] backend started and connected
2024-12-16 10:37:11,400 INFO MainThread:5226 [wandb_init.py:init():790] updated telemetry
2024-12-16 10:37:11,405 INFO MainThread:5226 [wandb_init.py:init():822] communicating run to backend with 90.0 second timeout
2024-12-16 10:37:39,595 INFO Thread-2 (wrapped_target):5226 [retry.py:__call__():172] Retry attempt failed:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/urllib3/connection.py", line 199, in _new_conn
sock = connection.create_connection(
File "/usr/local/lib/python3.10/site-packages/urllib3/util/connection.py", line 60, in create_connection
for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM):
File "/usr/local/lib/python3.10/socket.py", line 955, in getaddrinfo
for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
socket.gaierror: [Errno -3] Temporary failure in name resolution
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 789, in urlopen
response = self._make_request(
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 490, in _make_request
raise new_e
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 466, in _make_request
self._validate_conn(conn)
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 1095, in _validate_conn
conn.connect()
File "/usr/local/lib/python3.10/site-packages/urllib3/connection.py", line 693, in connect
self.sock = sock = self._new_conn()
File "/usr/local/lib/python3.10/site-packages/urllib3/connection.py", line 206, in _new_conn
raise NameResolutionError(self.host, self, e) from e
urllib3.exceptions.NameResolutionError: <urllib3.connection.HTTPSConnection object at 0x7f46d5f21510>: Failed to resolve 'api.wandb.ai' ([Errno -3] Temporary failure in name resolution)
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/requests/adapters.py", line 667, in send
resp = conn.urlopen(
File "/usr/local/lib/python3.10/site-packages/urllib3/connectionpool.py", line 843, in urlopen
retries = retries.increment(
File "/usr/local/lib/python3.10/site-packages/urllib3/util/retry.py", line 519, in increment
raise MaxRetryError(_pool, url, reason) from reason # type: ignore[arg-type]
urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='api.wandb.ai', port=443): Max retries exceeded with url: /graphql (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7f46d5f21510>: Failed to resolve 'api.wandb.ai' ([Errno -3] Temporary failure in name resolution)"))
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/lib/retry.py", line 131, in __call__
result = self._call_fn(*args, **kwargs)
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/internal/internal_api.py", line 393, in execute
File "/usr/local/lib/python3.10/site-packages/requests/sessions.py", line 589, in request
resp = self.send(prep, **send_kwargs)
File "/usr/local/lib/python3.10/site-packages/requests/sessions.py", line 703, in send
r = adapter.send(request, **kwargs)
File "/usr/local/lib/python3.10/site-packages/requests/adapters.py", line 700, in send
raise ConnectionError(e, request=request)
requests.exceptions.ConnectionError: HTTPSConnectionPool(host='api.wandb.ai', port=443): Max retries exceeded with url: /graphql (Caused by NameResolutionError("<urllib3.connection.HTTPSConnection object at 0x7f46d5f21510>: Failed to resolve 'api.wandb.ai' ([Errno -3] Temporary failure in name resolution)"))
2024-12-16 10:38:41,418 ERROR MainThread:5226 [wandb_init.py:init():849] encountered error: Run initialization has timed out after 90.0 sec. Please try increasing the timeout with the `init_timeout` setting: `wandb.init(settings=wandb.Settings(init_timeout=120))`.
2024-12-16 10:38:41,418 ERROR MainThread:5226 [wandb_init.py:init():1308] error in wandb.init()
Traceback (most recent call last):
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/wandb_init.py", line 1298, in init
return wi.init()
File "/usr/local/lib/python3.10/site-packages/wandb/sdk/wandb_init.py", line 855, in init
raise error
wandb.errors.errors.CommError: Run initialization has timed out after 90.0 sec. Please try increasing the timeout with the `init_timeout` setting: `wandb.init(settings=wandb.Settings(init_timeout=120))`.
2024-12-16 10:38:41,423 WARNING MsgRouterThr:5226 [router.py:message_loop():75] message_loop has been closed