Here is some log output from the client related to the timeout and the file 
descriptors:

distcc[35805] [Tue May  5 15:28:36 2015] (dcc_lock_host) got cpu lock on 
machine2/8,cpp,lzo slot 0 as fd7
distcc[35805] [Tue May  5 15:28:36 2015] (dcc_lock_host) got cpu lock on 
localhost slot 1 as fd8
distcc[35805] [Tue May  5 15:28:36 2015] (dcc_connect_by_addr) created socket 
on fd9
distcc[35805] [Tue May  5 15:28:36 2015] (dcc_select_for_write) select for 
write on fd9 for 20s
distcc[35805] [Tue May  5 15:28:36 2015] (dcc_get_io_timeout) Using IO timeout 
value: 600
distcc[35805] [Tue May  5 15:28:36 2015] (dcc_select_for_read) select for read 
on fd9 for 600s
distcc[35805] [Tue May  5 15:28:37 2015] (dcc_unlock) release lock fd8
distcc[35805] [Tue May  5 15:28:37 2015] (dcc_connect_by_addr) created socket 
on fd8
distcc[35805] [Tue May  5 15:28:37 2015] (dcc_select_for_write) select for 
write on fd8 for 20s
distcc[35805] [Tue May  5 15:28:57 2015] (dcc_select_for_write) ERROR: IO 
timeout
distcc[35805] [Tue May  5 15:28:57 2015] ERROR: timeout while connecting to 
172.xx.xx.xx:3632
distcc[35805] [Tue May  5 15:28:57 2015] (dcc_unlock) release lock fd7

I added an extra rs_trace call to print the “created socket on” message so I 
could see where the fd for the timed out select was coming from.

   Michael

On May 5, 2015, at 3:10 PM, Martin Pool 
<m...@sourcefrog.net<mailto:m...@sourcefrog.net>> wrote:

I wonder if you have a firewall on the server.

On Tue, May 5, 2015 at 3:08 PM, Morrell, Michael 
<michael.morr...@intel.com<mailto:michael.morr...@intel.com>> wrote:
Nothing is in the server’s logs.  I started it with:

  distccd —daemon -a xxx.xx.xx.xx —log-file ~/distccd.log —verbose

The last line is “(dcc_create_kids) up to 10 children” from its initialization.

   Michael

On May 5, 2015, at 2:40 PM, Martin Pool 
<m...@sourcefrog.net<mailto:m...@sourcefrog.net><mailto:m...@sourcefrog.net<mailto:m...@sourcefrog.net>>>
 wrote:

What's in the server's logs?

On Tue, May 5, 2015 at 2:31 PM Morrell, Michael 
<michael.morr...@intel.com<mailto:michael.morr...@intel.com><mailto:michael.morr...@intel.com<mailto:michael.morr...@intel.com>>>
 wrote:
I’m new to distcc.  I downloaded 3.2rc1 and compiled it on OS X recently and 
began experimenting.

I’m getting a lot of “IO timeout” errors and I don’t know what could be causing 
them.

I am using two servers running distccd, each with 8 processors.

My DISTCC_HOSTS is “machine1/8,cpp,lzo machine2/8,cpp,lzo”.

I have DISTCC_FALLBACK set to 0.

I’m running the command “pump make -j16”.

Usually, I get no compilation done at all.  Everyone that is attempted gets a 
timeout.

I’m even increased dcc_connect_timeout from 4 to 20 seconds and it still occurs.

From the logs, I see 31 object files being tried, and 17 get the error 107 
(EXIT_IO_ERROR) and another 14 get error 116 (EXIT_NO_HOSTS).

I’d appreciate any ideas on what to do here.

Thanks,

   Michael


__
distcc mailing list            http://distcc.samba.org/
To unsubscribe or change options:
https://lists.samba.org/mailman/listinfo/distcc




--
Martin

__
distcc mailing list            http://distcc.samba.org/
To unsubscribe or change options:
https://lists.samba.org/mailman/listinfo/distcc

Reply via email to