stan wrote at 10:56 -0400 on Aug 21, 2009: > OK here is the latest on this saga :-) > > On one of the OpenBSD 4.5 machines I have built 2.5.0p1, and was able to > back this machine up successfully (using classic UDP based authentication) > > On another of them, I built 2.5.2p1. The first attempt to back this machine > up failed. I checked the log files, and found they were having issues > because /etc/amdates was missing. I corrected that, and started a 2nd > backup run. (Remember amcheck reports all is well with this machine). I > got the following from amstatus when I attempted to back up this machine. > Also remember, one of the test I ran with a 2.6.1 client was to connect a > test machine directly to the client, using a crossover cable to eliminate > any firewall, or router type issues. > > I am attaching, what I think is, the amadnad debug file associated with this > failure. > > Can anyone suggest what I can do to further troubleshoot this? > > pb48:wd0f 1 dumper: [could not connect DATA stream: > can't connect stream to pb48.meadwestvaco.com port 11996: Connection > refused] (10:37:27) > . . . > amandad: time 30.019: stream_accept: timeout after 30 seconds > amandad: time 30.019: security_stream_seterr(0x86b67000, can't accept new > stream connection: No such file or directory) > amandad: time 30.019: stream 0 accept failed: unknown protocol error > amandad: time 30.019: security_stream_close(0x86b67000) > amandad: time 60.027: stream_accept: timeout after 30 seconds > amandad: time 60.027: security_stream_seterr(0x81212000, can't accept new > stream connection: No such file or directory) > amandad: time 60.027: stream 1 accept failed: unknown protocol error > amandad: time 60.027: security_stream_close(0x81212000) > amandad: time 90.035: stream_accept: timeout after 30 seconds > amandad: time 90.036: security_stream_seterr(0x84877000, can't accept new > stream connection: No such file or directory) > amandad: time 90.036: stream 2 accept failed: unknown protocol error > amandad: time 90.036: security_stream_close(0x84877000) > amandad: time 90.036: security_close(handle=0x81bbf800, driver=0x298a9240 > (BSD)) > amandad: time 120.044: pid 17702 finish time Fri Aug 21 10:39:27 2009
For some reason the socket is not getting marked ready for read. select(2) is timing out waiting. Firewall setup perhaps? This bit of code in 2.5.2p1's common-src/stream.c is where the failure is happening for you... int stream_accept( int server_socket, int timeout, size_t sendsize, size_t recvsize) { SELECT_ARG_TYPE readset; struct timeval tv; int nfound, connected_socket; int save_errno; int ntries = 0; in_port_t port; assert(server_socket >= 0); do { ntries++; memset(&tv, 0, SIZEOF(tv)); tv.tv_sec = timeout; memset(&readset, 0, SIZEOF(readset)); FD_ZERO(&readset); FD_SET(server_socket, &readset); nfound = select(server_socket+1, &readset, NULL, NULL, &tv); if(nfound <= 0 || !FD_ISSET(server_socket, &readset)) { save_errno = errno; if(nfound < 0) { dbprintf(("%s: stream_accept: select() failed: %s\n", debug_prefix_time(NULL), strerror(save_errno))); } else if(nfound == 0) { dbprintf(("%s: stream_accept: timeout after %d second%s\n", debug_prefix_time(NULL), timeout, (timeout == 1) ? "" : "s")); errno = ENOENT; /* ??? */ return -1;