[ 
https://issues.apache.org/jira/browse/AVRO-943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13137626#comment-13137626
 ] 

James Baldassari commented on AVRO-943:
---------------------------------------

The root of the problem seems to be the unbounded waiting on the connect future 
(channelFuture.awaitUninterruptibly()) when NettyTransceiver attempts to 
connect to the remote peer.  Under some circumstances this call never returns.  
I have a patch which addresses the issue by making sure that we always supply a 
connection timeout.  I added new NettyTransceiver constructors for specifying 
the connection timeout.  Backward compatibility is preserved by using a default 
connection timeout if none is specified.  I picked 60 seconds for the default 
timeout, but let me know if you think a different value is more appropriate.  
In the unit tests I've set the connection timeout to 2 seconds.  I ran these in 
a loop as you did, and I haven't seen any more hanging.  Please try out the 
patch and verify when you get a chance.

I've also addressed the concerns in AVRO-848 by removing some log statements 
and changing others to debug level.
                
> TestNettyServerWithCallbacks sometimes hangs
> --------------------------------------------
>
>                 Key: AVRO-943
>                 URL: https://issues.apache.org/jira/browse/AVRO-943
>             Project: Avro
>          Issue Type: Bug
>          Components: java
>            Reporter: Doug Cutting
>
> I'm periodically seeing tests hang in TestNettyServerWithCallbacks.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to