I am seeing the same issue on CentOS 6 with pyOpenSSL-0.10-2.el6.x86_64.

On Wed, Oct 19, 2011 at 6:19 PM, 马新成|Jackie Ma <[email protected]> wrote:

> Make sure PyOpenSSL's version. UPGRADE pyOpenSSL of all machines' to
> pyOpenSSL-0.6-2.el5.
> Check the hosts. Make sure all machine can resolve each other.
>
> On Thu, Oct 20, 2011 at 12:07 AM, Alison Young 
> <[email protected]>wrote:
>
>>  Hello,
>>
>> We are seeing an occasional problem where restarts of funcd on the minions
>> are not successful and the func daemon is stopped but not able to start
>> again.
>>
>> Checking func.log gives:
>>
>> 2011-10-02 04:02:04,321 - INFO - Exception occured: socket.error
>> 2011-10-02 04:02:04,321 - INFO - Exception value: (98, 'Address already in
>> use')
>> 2011-10-02 04:02:04,322 - INFO - Exception Info:
>>   File "/usr/bin/funcd", line 23, in ?
>>     server.main(sys.argv)
>>    File "/usr/lib/python2.4/site-packages/func/minion/server.py", line
>> 413, in main
>>     serve()
>>    File "/usr/lib/python2.4/site-packages/func/minion/server.py", line
>> 225, in serve
>>     server = setup_server()
>>    File "/usr/lib/python2.4/site-packages/func/minion/server.py", line
>> 220, in setup_server
>>     server = FuncSSLXMLRPCServer((listen_addr, listen_port),
>> config.module_list)
>>    File "/usr/lib/python2.4/site-packages/func/minion/server.py", line
>> 279, in __init__
>>     self.ca)
>>    File
>> "/usr/lib/python2.4/site-packages/func/minion/AuthedXMLRPCServer.py", line
>> 74, in __init__
>>     SimpleXMLRPCServer.SimpleXMLRPCServer.__init__(self, address,
>> AuthedSimpleXMLRPCRequestHandler)
>>    File "/usr/lib64/python2.4/SimpleXMLRPCServer.py", line 473, in
>> __init__
>>     SocketServer.TCPServer.__init__(self, addr, requestHandler)
>>    File "/usr/lib64/python2.4/SocketServer.py", line 330, in __init__
>>     self.server_bind()
>>    File "/usr/lib64/python2.4/SocketServer.py", line 341, in server_bind
>>     self.socket.bind(self.server_address)
>>    File "<string>", line 1, in bind
>>
>>
>> As you may guess from the timestamp we are seeing this problem most often
>> at 4:02am on Sundays, i.e. as part of the logrotate of func logs. Logging in
>> to the server and starting the func service once we spot it is stopped has
>> always worked so far without needing manual removal of any pid or lock file.
>>
>> One theory is that this problem occurred when the func minion was
>> processing a command and told to restart part way through. From watching
>> netstat, it looks like the func daemon stops listening on the minion port to
>> allow the spawned process to communicate with the master. If the daemon
>> stops, the spawned process blocks a new daemon from starting ('Address
>> already in use') but that spawned process then exits and we're left with no
>> daemons.
>>
>> Does this ring any bells with anyone? Is this a known bug?
>>
>> We've already added monit to mop up after this, but it'd be much
>> preferable to find a proper fix.
>>
>> Alison
>>
>> _______________________________________________
>> Func-list mailing list
>> [email protected]
>> https://www.redhat.com/mailman/listinfo/func-list
>>
>
>
>
> --
> --------------------------
>
> 马新成 | Jackie Ma
>
> MSN: [email protected]   QQ: 2252339967
> Twitter: @JackieMa2   G+:  Jackie Ma
> My_web: http://jackiema.blog.chinaunix.net
>
>               http://cn.linkedin.com/in/jacknet
>
> 使IT运维简单,方便,智能,提高运维效率,节省人力
>
> _______________________________________________
> Func-list mailing list
> [email protected]
> https://www.redhat.com/mailman/listinfo/func-list
>
_______________________________________________
Func-list mailing list
[email protected]
https://www.redhat.com/mailman/listinfo/func-list

Reply via email to