I don't know how I missed that :) QA said this is reproducible, so I'll try commenting this line out. Thanks Flavio!
On Wed, Jul 9, 2014 at 6:18 AM, Flavio Junqueira <[email protected]> wrote: > I wonder if this is the culprit: > > // OK to wait until socket disconnects while reading. > sock.setSoTimeout(0); > > > On Wednesday, July 9, 2014 5:55 AM, Michi Mutsuzaki <[email protected]> > wrote: > > > > Hi, > > I'm using ZooKeeper 3.4.5 (over IPSec!), and I saw a case where the > quorum connection manager takes a long time to shut down. It looks > like one of the receiver threads didn't exit for ~14 minutes. > > https://paste.apache.org/2wFN?action=download > > The tickTime is set to 3000 and initLimit is set to 5, so readInt() > should have gotten a socket timeout exception after 15 seconds. > Instead, it got an eof exception after 14 minutes. I didn't get a > chance to do a thread dump when this happened, but has anybody seen > something similiar? > >
