[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-10-13 Thread Suresh Srinivas (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13127034#comment-13127034
 ] 

Suresh Srinivas commented on HDFS-1836:
---

This patch is not required for trunk as it has equivalent code.

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung
Assignee: Bharath Mundlapudi
 Fix For: 0.20.3, 0.20.205.0

 Attachments: hdfs-1836-0.20.205.txt, hdfs-1836-0.20.txt, 
 hdfs-1836-0.20.txt, patch-draft-1836.patch


 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-07-26 Thread Nathan Roberts (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13071171#comment-13071171
 ] 

Nathan Roberts commented on HDFS-1836:
--

@Matt, I think this should be marked as fixed in 0.20.205.0. Is that correct?

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung
Assignee: Bharath Mundlapudi
 Fix For: 0.20.3, 0.20.204.0

 Attachments: hdfs-1836-0.20.205.txt, hdfs-1836-0.20.txt, 
 hdfs-1836-0.20.txt, patch-draft-1836.patch


 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-05-16 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034554#comment-13034554
 ] 

Hadoop QA commented on HDFS-1836:
-

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12479218/hdfs-1836-0.20.txt
  against trunk revision 1103987.

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

-1 patch.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/hudson/job/PreCommit-HDFS-Build/536//console

This message is automatically generated.

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung
 Attachments: hdfs-1836-0.20.txt, hdfs-1836-0.20.txt, 
 patch-draft-1836.patch


 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-05-13 Thread Bharath Mundlapudi (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033466#comment-13033466
 ] 

Bharath Mundlapudi commented on HDFS-1836:
--

Thats correct. This code is part of trunk already.

Todd, One minor comment.

1. Can we also pass LOG object to this method? Users who wants to debug can 
enable debug option.
IOUtils.cleanup(LOG, blockStream, blockReplyStream);

Otherwise, patch looks good. Thank you. 

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung
 Attachments: hdfs-1836-0.20.txt, patch-draft-1836.patch


 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-05-06 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13029794#comment-13029794
 ] 

Hadoop QA commented on HDFS-1836:
-

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12478378/patch-draft-1836.patch
  against trunk revision 1100054.

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

-1 patch.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/hudson/job/PreCommit-HDFS-Build/463//console

This message is automatically generated.

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung
 Attachments: patch-draft-1836.patch


 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-05-06 Thread Bharath Mundlapudi (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13029802#comment-13029802
 ] 

Bharath Mundlapudi commented on HDFS-1836:
--

Dennis,

For me, the following code seems like an issue.

  try {
blockStream.close();
blockReplyStream.close();
  } catch (IOException e) {
  }

Reason: if blockStream throws an exception, blockReplyStream will not be closed.

Can we replace all the places (2 places) in DFSClient with the following and 
try?


  try {
blockStream.close();
  } catch (IOException e) {
  }
  try {
blockReplyStream.close();
  } catch (IOException e) {
  }

Can you just try this change in your environment? 

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung
 Attachments: patch-draft-1836.patch


 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-05-06 Thread Bharath Mundlapudi (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13029805#comment-13029805
 ] 

Bharath Mundlapudi commented on HDFS-1836:
--

Retyping with format for better readability.

Change the following code
{code}
try { 
blockStream.close(); 
blockReplyStream.close(); 
} catch (IOException e) {
}
{code}

to this:

{code}
try { 
   blockStream.close(); 
} catch (IOException e) {
}
try { 
   blockReplyStream.close(); 
} catch (IOException e) {
}
{code}

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung
 Attachments: patch-draft-1836.patch


 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-05-06 Thread Todd Lipcon (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13029913#comment-13029913
 ] 

Todd Lipcon commented on HDFS-1836:
---

How about using IOUtils.cleanup, which does the above but also logs errors, etc?

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung
 Attachments: patch-draft-1836.patch


 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-05-06 Thread Bharath Mundlapudi (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13029950#comment-13029950
 ] 

Bharath Mundlapudi commented on HDFS-1836:
--

+1 to Todd's suggestion.

Though current code logs errors in debug mode only.



 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung
 Attachments: patch-draft-1836.patch


 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-04-26 Thread Dennis Cheung (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13025084#comment-13025084
 ] 

Dennis Cheung commented on HDFS-1836:
-

It creates a singleton FileSystem object which is connected to a HDFS cloud.
And it does nothing with hadoop but one simple thing, copy local files into 
hadoop.


We use only simple FileSystem.create(path, true) and  IOUtils.copyLarge(). And 
already applied IOUtils.closeQuietly(out); in the finally block.

Nothing more than that.

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung

 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-04-26 Thread Dennis Cheung (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13025592#comment-13025592
 ] 

Dennis Cheung commented on HDFS-1836:
-

Two new finding after reading the hadoop source.

1. If there were an exception between lines DFSClient.java:2883-2901. The 
reference to blockReplyStream may lost and remain unclosed condition.

2. processDatanodeError() will not close the previous Socket, if any.


 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung

 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-04-25 Thread Dennis Cheung (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13025030#comment-13025030
 ] 

Dennis Cheung commented on HDFS-1836:
-

@Allen Wittenauer
Update.

None.  `netstat -ano` shown NO any connection to the client. But only a few 
ESTABLISHED connection from some other datanode / clients

Recheck on the client side, the connection may still remain CLOSE_WAIT.

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung

 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-04-25 Thread Bharath Mundlapudi (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13025061#comment-13025061
 ] 

Bharath Mundlapudi commented on HDFS-1836:
--

Hi Dennis,

What is this client code doing? Is this your program which uses HDFS APIs to 
talk to datanode and namenode? 

-Bharath

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung

 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-04-17 Thread Dennis Cheung (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13020736#comment-13020736
 ] 

Dennis Cheung commented on HDFS-1836:
-

@Allen Wittenauer
Unknown, I've no access to the remote system

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung

 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HDFS-1836) Thousand of CLOSE_WAIT socket

2011-04-14 Thread Allen Wittenauer (JIRA)

[ 
https://issues.apache.org/jira/browse/HDFS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019927#comment-13019927
 ] 

Allen Wittenauer commented on HDFS-1836:


What is the state of the other side of the socket connection?

 Thousand of CLOSE_WAIT socket 
 --

 Key: HDFS-1836
 URL: https://issues.apache.org/jira/browse/HDFS-1836
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.20.2
 Environment: Linux 2.6.18-194.32.1.el5 #1 SMP Wed Jan 5 17:52:25 EST 
 2011 x86_64 x86_64 x86_64 GNU/Linux
 java version 1.6.0_23
 Java(TM) SE Runtime Environment (build 1.6.0_23-b05)
 Java HotSpot(TM) 64-Bit Server VM (build 19.0-b09, mixed mode)
Reporter: Dennis Cheung

 $ /usr/sbin/lsof -i TCP:50010 | grep -c CLOSE_WAIT
 4471
 It is better if everything runs normal. 
 However, from time to time there are some DataStreamer Exception: 
 java.net.SocketTimeoutException and DFSClient.processDatanodeError(2507) | 
 Error Recovery for can be found from log file and the number of CLOSE_WAIT 
 socket just keep increasing
 The CLOSE_WAIT handles may remain for hours and days; then Too many open 
 file some day.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira