Hi,

Has anybody had a problem running KMeans leaking TCP connections, which
eventually causes the OS to issue a "Too Many Files Open" error? In other
words, it looks like a file descriptor leak.

I'm running a four machine cluster of Hadoop 18.3, with the current version
of Mahout from the trunk. Each k-means iteration is causing an extra 40 or
so TCP connections to be created that get orphaned in CLOSE_WAIT. I've
attached the output of lsof from the middle of a run.

Any help would be appreciated!

Cheers,
Mark

Reply via email to