lidavidm commented on pull request #12442: URL: https://github.com/apache/arrow/pull/12442#issuecomment-1080586810
Odd. I'll see if I can replicate. Two things stand out to me: 1. UCX appears to have been compiled without multi-thread support, we do request this: ``` [1648461566.472887] [my-test-host:4147425:0] ucp_worker.c:2118 UCX DIAG multi-threaded worker is requested, but library is built without multi-thread support ``` What is `ucx_info -v`? 2. It appears after initial connect, something is trying to connect an IPv4 source address to an IPv6 destination address (I would think this is not allowed?) ``` [1648462634.576737] [my-test-host:4152820:1] tcp_cm.c:96 UCX DEBUG tcp_ep 0xffff8c567690: CLOSED -> CONNECTING for the [127.0.0.1:37347]<->[::1:35635]:3 connection [-:Rx] [1648462634.576759] [my-test-host:4152820:1] tcp_cm.c:96 UCX DEBUG tcp_ep 0xffff8c567690: CONNECTING -> CONNECTING for the [127.0.0.1:37347]<->[::1:35635]:3 connection [-:Rx] [1648462634.576804] [my-test-host:4152820:1] sock.c:325 UCX ERROR connect(fd=85, dest_addr=::1:35635) failed: Connection refused [1648462634.576828] [my-test-host:4152820:1] wireup_cm.c:1203 UCX WARN server ep 0xffff94561000 failed to connect to remote address on device lo, tl_bitmap 0x40 0x0, status Destination is unreachable ``` Maybe try with `UCX_NET_DEVICES=lo`? (Not sure if that is related.) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org