date:20131203


[ 
https://issues.apache.org/jira/browse/HDFS-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837461#comment-13837461
 ] 

Hadoop QA commented on HDFS-5536:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12616702/HDFS-5536.007.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 4 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-common-project/hadoop-common hadoop-hdfs-project/hadoop-hdfs:

  org.apache.hadoop.hdfs.web.TestHttpsFileSystem

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/5620//testReport/
Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/5620//console

This message is automatically generated.

 Implement HTTP policy for Namenode and DataNode
 ---

 Key: HDFS-5536
 URL: https://issues.apache.org/jira/browse/HDFS-5536
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Haohui Mai
Assignee: Haohui Mai
 Attachments: HDFS-5536.000.patch, HDFS-5536.001.patch, 
 HDFS-5536.002.patch, HDFS-5536.003.patch, HDFS-5536.004.patch, 
 HDFS-5536.005.patch, HDFS-5536.006.patch, HDFS-5536.007.patch, 
 HDFS-5536.008.patch


 this jira implements the http and https policy in the namenode and the 
 datanode.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-2882) DN continues to start up, even if block pool fails to initialize


[ 
https://issues.apache.org/jira/browse/HDFS-2882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837477#comment-13837477
 ] 

Vinay commented on HDFS-2882:
-

Ok Colin,

Is anyone else could review the patch. ?
Thanks.


 DN continues to start up, even if block pool fails to initialize
 

 Key: HDFS-2882
 URL: https://issues.apache.org/jira/browse/HDFS-2882
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: datanode
Affects Versions: 2.0.2-alpha
Reporter: Todd Lipcon
Assignee: Vinay
 Attachments: HDFS-2882.patch, HDFS-2882.patch, HDFS-2882.patch, 
 HDFS-2882.patch, HDFS-2882.patch, HDFS-2882.patch, hdfs-2882.txt


 I started a DN on a machine that was completely out of space on one of its 
 drives. I saw the following:
 2012-02-02 09:56:50,499 FATAL 
 org.apache.hadoop.hdfs.server.datanode.DataNode: Initialization failed for 
 block pool Block pool BP-448349972-172.29.5.192-1323816762969 (storage id 
 DS-507718931-172.29.5.194-11072-12978
 42002148) service to styx01.sf.cloudera.com/172.29.5.192:8021
 java.io.IOException: Mkdirs failed to create 
 /data/1/scratch/todd/styx-datadir/current/BP-448349972-172.29.5.192-1323816762969/tmp
 at 
 org.apache.hadoop.hdfs.server.datanode.FSDataset$BlockPoolSlice.init(FSDataset.java:335)
 but the DN continued to run, spewing NPEs when it tried to do block reports, 
 etc. This was on the HDFS-1623 branch but may affect trunk as well.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-3405) Checkpointing should use HTTP POST or PUT instead of GET-GET to send merged fsimages


[ 
https://issues.apache.org/jira/browse/HDFS-3405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837478#comment-13837478
 ] 

Vinay commented on HDFS-3405:
-

Hi all,
Could someone take a look at changes..?
Thanks in advance.

 Checkpointing should use HTTP POST or PUT instead of GET-GET to send merged 
 fsimages
 

 Key: HDFS-3405
 URL: https://issues.apache.org/jira/browse/HDFS-3405
 Project: Hadoop HDFS
  Issue Type: Improvement
Affects Versions: 1.0.0, 3.0.0, 2.0.5-alpha
Reporter: Aaron T. Myers
Assignee: Vinay
 Attachments: HDFS-3405.patch, HDFS-3405.patch, HDFS-3405.patch, 
 HDFS-3405.patch, HDFS-3405.patch, HDFS-3405.patch


 As Todd points out in [this 
 comment|https://issues.apache.org/jira/browse/HDFS-3404?focusedCommentId=13272986page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13272986],
  the current scheme for a checkpointing daemon to upload a merged fsimage 
 file to an NN is to issue an HTTP get request to tell the target NN to issue 
 another GET request back to the checkpointing daemon to retrieve the merged 
 fsimage file. There's no fundamental reason the checkpointing daemon can't 
 just use an HTTP POST or PUT to send back the merged fsimage file, rather 
 than the double-GET scheme.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5570) Deprecate hftp / hsftp and replace them with webhdfs / swebhdfs

[
https://issues.apache.org/jira/browse/HDFS-5570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837485#comment-13837485
]

Hadoop QA commented on HDFS-5570:
-

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment
http://issues.apache.org/jira/secure/attachment/12616637/HDFS-5570.000.patch
against trunk revision .

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:green}+1 tests included{color}. The patch appears to include 10 new
or modified test files.

{color:green}+1 javac{color}. The applied patch does not increase the
total number of javac compiler warnings.

{color:green}+1 javadoc{color}. The javadoc tool did not generate any
warning messages.

{color:green}+1 eclipse:eclipse{color}. The patch built with
eclipse:eclipse.

{color:red}-1 findbugs{color}. The patch appears to introduce 1 new
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}. The applied patch does not increase
the total number of release audit warnings.

{color:red}-1 core tests{color}. The patch failed these unit tests in
hadoop-common-project/hadoop-common hadoop-hdfs-project/hadoop-hdfs
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient
hadoop-tools/hadoop-distcp hadoop-tools/hadoop-extras:

org.apache.hadoop.mapreduce.lib.input.TestFixedLengthInputFormat
org.apache.hadoop.mapred.TestFixedLengthInputFormat
org.apache.hadoop.mapreduce.security.TestJHSSecurity

The following test timeouts occurred in
hadoop-common-project/hadoop-common hadoop-hdfs-project/hadoop-hdfs
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-hs
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-jobclient
hadoop-tools/hadoop-distcp hadoop-tools/hadoop-extras:

org.apache.hadoop.tools.TestDelegationTokenFetcher

{color:green}+1 contrib tests{color}. The patch passed contrib unit tests.

Test results:
https://builds.apache.org/job/PreCommit-HDFS-Build/5619//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HDFS-Build/5619//artifact/trunk/patchprocess/newPatchFindbugsWarningshadoop-hdfs.html
Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/5619//console

This message is automatically generated.

Deprecate hftp / hsftp and replace them with webhdfs / swebhdfs
---

Key: HDFS-5570
URL: https://issues.apache.org/jira/browse/HDFS-5570
Project: Hadoop HDFS
Issue Type: Bug
Reporter: Haohui Mai
Assignee: Haohui Mai
Attachments: HDFS-5570.000.patch

Currently hftp / hsftp only provide a strict subset of functionality that
webhdfs / swebhdfs offer. Notably, hftp / hsftp do not support writes and HA
namenodes. Maintaining two piece of code with similar functionality introduce
unnecessary work.
Webhdfs has been around since Hadoop 1.0 therefore moving forward with
webhdfs does not seem to cause any significant migration issues.
This jira proposes to deprecate hftp / hsftp in branch-2 and remove them in
trunk.

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HDFS-5592) DIR* completeFile: /file is closed by DFSClient_ should be logged only for successful closure of the file.

Vinay created HDFS-5592:
---

 Summary: DIR* completeFile: /file is closed by DFSClient_ should 
be logged only for successful closure of the file.
 Key: HDFS-5592
 URL: https://issues.apache.org/jira/browse/HDFS-5592
 Project: Hadoop HDFS
  Issue Type: Bug
Reporter: Vinay
Assignee: Vinay


Following log message in {{FSNameSystem#completeFile(..)}} should be logged 
only if the file is closed.

{code}getEditLog().logSync();
NameNode.stateChangeLog.info(DIR* completeFile:  + src +  is closed by 
+ holder);
return success;{code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HDFS-5592) DIR* completeFile: /file is closed by DFSClient_ should be logged only for successful closure of the file.


 [ 
https://issues.apache.org/jira/browse/HDFS-5592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinay updated HDFS-5592:


Attachment: HDFS-5592.patch

Attached the patch

 DIR* completeFile: /file is closed by DFSClient_ should be logged only for 
 successful closure of the file.
 

 Key: HDFS-5592
 URL: https://issues.apache.org/jira/browse/HDFS-5592
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 3.0.0, 2.2.0
Reporter: Vinay
Assignee: Vinay
 Attachments: HDFS-5592.patch


 Following log message in {{FSNameSystem#completeFile(..)}} should be logged 
 only if the file is closed.
 {code}getEditLog().logSync();
 NameNode.stateChangeLog.info(DIR* completeFile:  + src +  is closed by 
 
 + holder);
 return success;{code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HDFS-5592) DIR* completeFile: /file is closed by DFSClient_ should be logged only for successful closure of the file.


 [ 
https://issues.apache.org/jira/browse/HDFS-5592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinay updated HDFS-5592:


Affects Version/s: 3.0.0
   2.2.0
   Status: Patch Available  (was: Open)

 DIR* completeFile: /file is closed by DFSClient_ should be logged only for 
 successful closure of the file.
 

 Key: HDFS-5592
 URL: https://issues.apache.org/jira/browse/HDFS-5592
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.2.0, 3.0.0
Reporter: Vinay
Assignee: Vinay
 Attachments: HDFS-5592.patch


 Following log message in {{FSNameSystem#completeFile(..)}} should be logged 
 only if the file is closed.
 {code}getEditLog().logSync();
 NameNode.stateChangeLog.info(DIR* completeFile:  + src +  is closed by 
 
 + holder);
 return success;{code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HDFS-5592) DIR* completeFile: /file is closed by DFSClient_ should be logged only for successful closure of the file.


 [ 
https://issues.apache.org/jira/browse/HDFS-5592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinay updated HDFS-5592:


Affects Version/s: (was: 2.2.0)
   2.3.0

 DIR* completeFile: /file is closed by DFSClient_ should be logged only for 
 successful closure of the file.
 

 Key: HDFS-5592
 URL: https://issues.apache.org/jira/browse/HDFS-5592
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 3.0.0, 2.3.0
Reporter: Vinay
Assignee: Vinay
 Attachments: HDFS-5592.patch


 Following log message in {{FSNameSystem#completeFile(..)}} should be logged 
 only if the file is closed.
 {code}getEditLog().logSync();
 NameNode.stateChangeLog.info(DIR* completeFile:  + src +  is closed by 
 
 + holder);
 return success;{code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5536) Implement HTTP policy for Namenode and DataNode


[ 
https://issues.apache.org/jira/browse/HDFS-5536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837528#comment-13837528
 ] 

Hadoop QA commented on HDFS-5536:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12616705/HDFS-5536.008.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 4 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The patch failed these unit tests in 
hadoop-common-project/hadoop-common hadoop-hdfs-project/hadoop-hdfs:

  org.apache.hadoop.hdfs.web.TestHttpsFileSystem

  The test build failed in 
hadoop-common-project/hadoop-common 

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/5621//testReport/
Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/5621//console

This message is automatically generated.

 Implement HTTP policy for Namenode and DataNode
 ---

 Key: HDFS-5536
 URL: https://issues.apache.org/jira/browse/HDFS-5536
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Haohui Mai
Assignee: Haohui Mai
 Attachments: HDFS-5536.000.patch, HDFS-5536.001.patch, 
 HDFS-5536.002.patch, HDFS-5536.003.patch, HDFS-5536.004.patch, 
 HDFS-5536.005.patch, HDFS-5536.006.patch, HDFS-5536.007.patch, 
 HDFS-5536.008.patch


 this jira implements the http and https policy in the namenode and the 
 datanode.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5581) NameNodeFsck should use only one instance of BlockPlacementPolicy


[ 
https://issues.apache.org/jira/browse/HDFS-5581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837577#comment-13837577
 ] 

Hudson commented on HDFS-5581:
--

FAILURE: Integrated in Hadoop-Yarn-trunk #410 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/410/])
move HDFS-5581 to 2.3 (cmccabe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547094)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
HDFS-5581. NameNodeFsck should use only one instance of BlockPlacementPolicy 
(vinay via cmccabe) (cmccabe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547088)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/NamenodeFsck.java


 NameNodeFsck should use only one instance of BlockPlacementPolicy
 -

 Key: HDFS-5581
 URL: https://issues.apache.org/jira/browse/HDFS-5581
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: namenode
Reporter: Vinay
Assignee: Vinay
 Fix For: 2.4.0

 Attachments: HDFS-5581.patch, HDFS-5581.patch


 While going through NameNodeFsck I found that following code creates the new 
 instance of BlockPlacementPolicy for every block.
 {code}  // verify block placement policy
   BlockPlacementStatus blockPlacementStatus = 
   BlockPlacementPolicy.getInstance(conf, null, networktopology).
   verifyBlockPlacement(path, lBlk, targetFileReplication);{code}
 It would be better to use the namenode's BPP itself instead of creating a new 
 one.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5557) Write pipeline recovery for the last packet in the block may cause rejection of valid replicas


[ 
https://issues.apache.org/jira/browse/HDFS-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837576#comment-13837576
 ] 

Hudson commented on HDFS-5557:
--

FAILURE: Integrated in Hadoop-Yarn-trunk #410 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/410/])
HDFS-5557. Write pipeline recovery for the last packet in the block may cause 
rejection of valid replicas. Contributed by Kihwal Lee. (kihwal: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547173)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/DFSOutputStream.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/BlockInfoUnderConstruction.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/BlockManager.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestClientProtocolForPipelineRecovery.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/blockmanagement/TestReplicationPolicy.java


 Write pipeline recovery for the last packet in the block may cause rejection 
 of valid replicas
 --

 Key: HDFS-5557
 URL: https://issues.apache.org/jira/browse/HDFS-5557
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 0.23.9, 2.4.0
Reporter: Kihwal Lee
Assignee: Kihwal Lee
Priority: Critical
 Fix For: 3.0.0, 2.4.0, 0.23.10

 Attachments: HDFS-5557.patch, HDFS-5557.patch, HDFS-5557.patch, 
 HDFS-5557.patch


 When a block is reported from a data node while the block is under 
 construction (i.e. not committed or completed), BlockManager calls 
 BlockInfoUnderConstruction.addReplicaIfNotPresent() to update the reported 
 replica state. But BlockManager is calling it with the stored block, not 
 reported block.  This causes the recorded replicas' gen stamp to be that of 
 BlockInfoUnderConstruction itself, not the one from reported replica.
 When a pipeline recovery is done for the last packet of a block, the 
 incremental block reports with the new gen stamp may come before the client 
 calling updatePipeline(). If this happens, these replicas will be incorrectly 
 recorded with the old gen stamp and get removed later.  The result is close 
 or addAdditionalBlock failure.
 If the last block is completed, but the penultimate block is not because of 
 this issue, the file won't be closed. If this file is not cleared, but the 
 client goes away, the lease manager will try to recover the lease/block, at 
 which point it will crash. I will file a separate jira for this shortly.
 The worst case is to reject all good ones and accepting a bad one. In this 
 case, the block will get completed, but the data cannot be read until the 
 next full block report containing one of the valid replicas is received.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5560) Trash configuration log statements prints incorrect units


[ 
https://issues.apache.org/jira/browse/HDFS-5560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837579#comment-13837579
 ] 

Hudson commented on HDFS-5560:
--

FAILURE: Integrated in Hadoop-Yarn-trunk #410 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/410/])
HDFS-5560. Trash configuration log statements prints incorrect units. 
Contributed by Josh Elser. (wang: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547266)
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/TrashPolicyDefault.java


 Trash configuration log statements prints incorrect units
 -

 Key: HDFS-5560
 URL: https://issues.apache.org/jira/browse/HDFS-5560
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.2.0
Reporter: Josh Elser
Assignee: Josh Elser
 Fix For: 2.3.0

 Attachments: HDFS-5560.patch


 I ran `hdfs dfs -expunge` on a 2.2.0 system, and noticed the following the 
 message printed out on the console:
 {noformat}
 $ hdfs dfs -expunge
 13/11/23 22:12:17 INFO fs.TrashPolicyDefault: Namenode trash configuration: 
 Deletion interval = 180 minutes, Emptier interval = 0 minutes.
 {noformat}
 The configuration for both the deletion interval and emptier interval are 
 given in minutes, converted to milliseconds and then logged as milliseconds 
 but with a label of minutes. It looks like this was introduced in HDFS-4903.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5592) DIR* completeFile: /file is closed by DFSClient_ should be logged only for successful closure of the file.

2013-12-03 Thread Uma Maheswara Rao G (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-5592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837585#comment-13837585
 ] 

Uma Maheswara Rao G commented on HDFS-5592:
---

+1

 DIR* completeFile: /file is closed by DFSClient_ should be logged only for 
 successful closure of the file.
 

 Key: HDFS-5592
 URL: https://issues.apache.org/jira/browse/HDFS-5592
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 3.0.0, 2.3.0
Reporter: Vinay
Assignee: Vinay
 Attachments: HDFS-5592.patch


 Following log message in {{FSNameSystem#completeFile(..)}} should be logged 
 only if the file is closed.
 {code}getEditLog().logSync();
 NameNode.stateChangeLog.info(DIR* completeFile:  + src +  is closed by 
 
 + holder);
 return success;{code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5557) Write pipeline recovery for the last packet in the block may cause rejection of valid replicas


[ 
https://issues.apache.org/jira/browse/HDFS-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837598#comment-13837598
 ] 

Hudson commented on HDFS-5557:
--

FAILURE: Integrated in Hadoop-Hdfs-0.23-Build #809 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-0.23-Build/809/])
svn merge -c 1547173 merging from trunk to branch-0.23 to fix: HDFS-5557. Write 
pipeline recovery for the last packet in the block may cause rejection of valid 
replicas. (kihwal: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547181)
* 
/hadoop/common/branches/branch-0.23/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
/hadoop/common/branches/branch-0.23/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/DFSOutputStream.java
* 
/hadoop/common/branches/branch-0.23/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/BlockInfoUnderConstruction.java
* 
/hadoop/common/branches/branch-0.23/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/BlockManager.java
* 
/hadoop/common/branches/branch-0.23/hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestClientProtocolForPipelineRecovery.java


 Write pipeline recovery for the last packet in the block may cause rejection 
 of valid replicas
 --

 Key: HDFS-5557
 URL: https://issues.apache.org/jira/browse/HDFS-5557
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 0.23.9, 2.4.0
Reporter: Kihwal Lee
Assignee: Kihwal Lee
Priority: Critical
 Fix For: 3.0.0, 2.4.0, 0.23.10

 Attachments: HDFS-5557.patch, HDFS-5557.patch, HDFS-5557.patch, 
 HDFS-5557.patch


 When a block is reported from a data node while the block is under 
 construction (i.e. not committed or completed), BlockManager calls 
 BlockInfoUnderConstruction.addReplicaIfNotPresent() to update the reported 
 replica state. But BlockManager is calling it with the stored block, not 
 reported block.  This causes the recorded replicas' gen stamp to be that of 
 BlockInfoUnderConstruction itself, not the one from reported replica.
 When a pipeline recovery is done for the last packet of a block, the 
 incremental block reports with the new gen stamp may come before the client 
 calling updatePipeline(). If this happens, these replicas will be incorrectly 
 recorded with the old gen stamp and get removed later.  The result is close 
 or addAdditionalBlock failure.
 If the last block is completed, but the penultimate block is not because of 
 this issue, the file won't be closed. If this file is not cleared, but the 
 client goes away, the lease manager will try to recover the lease/block, at 
 which point it will crash. I will file a separate jira for this shortly.
 The worst case is to reject all good ones and accepting a bad one. In this 
 case, the block will get completed, but the data cannot be read until the 
 next full block report containing one of the valid replicas is received.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5558) LeaseManager monitor thread can crash if the last block is complete but another block is not.


[ 
https://issues.apache.org/jira/browse/HDFS-5558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837599#comment-13837599
 ] 

Hudson commented on HDFS-5558:
--

FAILURE: Integrated in Hadoop-Hdfs-0.23-Build #809 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-0.23-Build/809/])
HDFS-5558. LeaseManager monitor thread can crash if the last block is complete 
but another block is not. Contributed by Kihwal Lee. (kihwal: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547197)
* 
/hadoop/common/branches/branch-0.23/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
/hadoop/common/branches/branch-0.23/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSNamesystem.java


 LeaseManager monitor thread can crash if the last block is complete but 
 another block is not.
 -

 Key: HDFS-5558
 URL: https://issues.apache.org/jira/browse/HDFS-5558
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 0.23.9, 2.4.0
Reporter: Kihwal Lee
Assignee: Kihwal Lee
 Attachments: HDFS-5558.branch-023.patch, HDFS-5558.branch-023.patch, 
 HDFS-5558.patch, HDFS-5558.patch


 As mentioned in HDFS-5557, if a file has its last and penultimate block not 
 completed and the file is being closed, the last block may be completed but 
 the penultimate one might not. If this condition lasts long and the file is 
 abandoned, LeaseManager will try to recover the lease and the block. But 
 {{internalReleaseLease()}} will fail with invalid cast exception with this 
 kind of file.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5592) DIR* completeFile: /file is closed by DFSClient_ should be logged only for successful closure of the file.


[ 
https://issues.apache.org/jira/browse/HDFS-5592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837630#comment-13837630
 ] 

Hadoop QA commented on HDFS-5592:
-

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12616735/HDFS-5592.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in 
hadoop-hdfs-project/hadoop-hdfs.

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-HDFS-Build/5622//testReport/
Console output: https://builds.apache.org/job/PreCommit-HDFS-Build/5622//console

This message is automatically generated.

 DIR* completeFile: /file is closed by DFSClient_ should be logged only for 
 successful closure of the file.
 

 Key: HDFS-5592
 URL: https://issues.apache.org/jira/browse/HDFS-5592
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 3.0.0, 2.3.0
Reporter: Vinay
Assignee: Vinay
 Attachments: HDFS-5592.patch


 Following log message in {{FSNameSystem#completeFile(..)}} should be logged 
 only if the file is closed.
 {code}getEditLog().logSync();
 NameNode.stateChangeLog.info(DIR* completeFile:  + src +  is closed by 
 
 + holder);
 return success;{code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5560) Trash configuration log statements prints incorrect units


[ 
https://issues.apache.org/jira/browse/HDFS-5560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837676#comment-13837676
 ] 

Hudson commented on HDFS-5560:
--

FAILURE: Integrated in Hadoop-Mapreduce-trunk #1627 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1627/])
HDFS-5560. Trash configuration log statements prints incorrect units. 
Contributed by Josh Elser. (wang: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547266)
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/TrashPolicyDefault.java


 Trash configuration log statements prints incorrect units
 -

 Key: HDFS-5560
 URL: https://issues.apache.org/jira/browse/HDFS-5560
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.2.0
Reporter: Josh Elser
Assignee: Josh Elser
 Fix For: 2.3.0

 Attachments: HDFS-5560.patch


 I ran `hdfs dfs -expunge` on a 2.2.0 system, and noticed the following the 
 message printed out on the console:
 {noformat}
 $ hdfs dfs -expunge
 13/11/23 22:12:17 INFO fs.TrashPolicyDefault: Namenode trash configuration: 
 Deletion interval = 180 minutes, Emptier interval = 0 minutes.
 {noformat}
 The configuration for both the deletion interval and emptier interval are 
 given in minutes, converted to milliseconds and then logged as milliseconds 
 but with a label of minutes. It looks like this was introduced in HDFS-4903.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5581) NameNodeFsck should use only one instance of BlockPlacementPolicy


[ 
https://issues.apache.org/jira/browse/HDFS-5581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837675#comment-13837675
 ] 

Hudson commented on HDFS-5581:
--

FAILURE: Integrated in Hadoop-Mapreduce-trunk #1627 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1627/])
move HDFS-5581 to 2.3 (cmccabe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547094)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
HDFS-5581. NameNodeFsck should use only one instance of BlockPlacementPolicy 
(vinay via cmccabe) (cmccabe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547088)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/NamenodeFsck.java


 NameNodeFsck should use only one instance of BlockPlacementPolicy
 -

 Key: HDFS-5581
 URL: https://issues.apache.org/jira/browse/HDFS-5581
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: namenode
Reporter: Vinay
Assignee: Vinay
 Fix For: 2.4.0

 Attachments: HDFS-5581.patch, HDFS-5581.patch


 While going through NameNodeFsck I found that following code creates the new 
 instance of BlockPlacementPolicy for every block.
 {code}  // verify block placement policy
   BlockPlacementStatus blockPlacementStatus = 
   BlockPlacementPolicy.getInstance(conf, null, networktopology).
   verifyBlockPlacement(path, lBlk, targetFileReplication);{code}
 It would be better to use the namenode's BPP itself instead of creating a new 
 one.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5557) Write pipeline recovery for the last packet in the block may cause rejection of valid replicas


[ 
https://issues.apache.org/jira/browse/HDFS-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837674#comment-13837674
 ] 

Hudson commented on HDFS-5557:
--

FAILURE: Integrated in Hadoop-Mapreduce-trunk #1627 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1627/])
HDFS-5557. Write pipeline recovery for the last packet in the block may cause 
rejection of valid replicas. Contributed by Kihwal Lee. (kihwal: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547173)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/DFSOutputStream.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/BlockInfoUnderConstruction.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/BlockManager.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestClientProtocolForPipelineRecovery.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/blockmanagement/TestReplicationPolicy.java


 Write pipeline recovery for the last packet in the block may cause rejection 
 of valid replicas
 --

 Key: HDFS-5557
 URL: https://issues.apache.org/jira/browse/HDFS-5557
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 0.23.9, 2.4.0
Reporter: Kihwal Lee
Assignee: Kihwal Lee
Priority: Critical
 Fix For: 3.0.0, 2.4.0, 0.23.10

 Attachments: HDFS-5557.patch, HDFS-5557.patch, HDFS-5557.patch, 
 HDFS-5557.patch


 When a block is reported from a data node while the block is under 
 construction (i.e. not committed or completed), BlockManager calls 
 BlockInfoUnderConstruction.addReplicaIfNotPresent() to update the reported 
 replica state. But BlockManager is calling it with the stored block, not 
 reported block.  This causes the recorded replicas' gen stamp to be that of 
 BlockInfoUnderConstruction itself, not the one from reported replica.
 When a pipeline recovery is done for the last packet of a block, the 
 incremental block reports with the new gen stamp may come before the client 
 calling updatePipeline(). If this happens, these replicas will be incorrectly 
 recorded with the old gen stamp and get removed later.  The result is close 
 or addAdditionalBlock failure.
 If the last block is completed, but the penultimate block is not because of 
 this issue, the file won't be closed. If this file is not cleared, but the 
 client goes away, the lease manager will try to recover the lease/block, at 
 which point it will crash. I will file a separate jira for this shortly.
 The worst case is to reject all good ones and accepting a bad one. In this 
 case, the block will get completed, but the data cannot be read until the 
 next full block report containing one of the valid replicas is received.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5581) NameNodeFsck should use only one instance of BlockPlacementPolicy


[ 
https://issues.apache.org/jira/browse/HDFS-5581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837689#comment-13837689
 ] 

Hudson commented on HDFS-5581:
--

SUCCESS: Integrated in Hadoop-Hdfs-trunk #1601 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1601/])
move HDFS-5581 to 2.3 (cmccabe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547094)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
HDFS-5581. NameNodeFsck should use only one instance of BlockPlacementPolicy 
(vinay via cmccabe) (cmccabe: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547088)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/NamenodeFsck.java


 NameNodeFsck should use only one instance of BlockPlacementPolicy
 -

 Key: HDFS-5581
 URL: https://issues.apache.org/jira/browse/HDFS-5581
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: namenode
Reporter: Vinay
Assignee: Vinay
 Fix For: 2.4.0

 Attachments: HDFS-5581.patch, HDFS-5581.patch


 While going through NameNodeFsck I found that following code creates the new 
 instance of BlockPlacementPolicy for every block.
 {code}  // verify block placement policy
   BlockPlacementStatus blockPlacementStatus = 
   BlockPlacementPolicy.getInstance(conf, null, networktopology).
   verifyBlockPlacement(path, lBlk, targetFileReplication);{code}
 It would be better to use the namenode's BPP itself instead of creating a new 
 one.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5557) Write pipeline recovery for the last packet in the block may cause rejection of valid replicas


[ 
https://issues.apache.org/jira/browse/HDFS-5557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837688#comment-13837688
 ] 

Hudson commented on HDFS-5557:
--

SUCCESS: Integrated in Hadoop-Hdfs-trunk #1601 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1601/])
HDFS-5557. Write pipeline recovery for the last packet in the block may cause 
rejection of valid replicas. Contributed by Kihwal Lee. (kihwal: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547173)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/DFSOutputStream.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/BlockInfoUnderConstruction.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/blockmanagement/BlockManager.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/TestClientProtocolForPipelineRecovery.java
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/blockmanagement/TestReplicationPolicy.java


 Write pipeline recovery for the last packet in the block may cause rejection 
 of valid replicas
 --

 Key: HDFS-5557
 URL: https://issues.apache.org/jira/browse/HDFS-5557
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 0.23.9, 2.4.0
Reporter: Kihwal Lee
Assignee: Kihwal Lee
Priority: Critical
 Fix For: 3.0.0, 2.4.0, 0.23.10

 Attachments: HDFS-5557.patch, HDFS-5557.patch, HDFS-5557.patch, 
 HDFS-5557.patch


 When a block is reported from a data node while the block is under 
 construction (i.e. not committed or completed), BlockManager calls 
 BlockInfoUnderConstruction.addReplicaIfNotPresent() to update the reported 
 replica state. But BlockManager is calling it with the stored block, not 
 reported block.  This causes the recorded replicas' gen stamp to be that of 
 BlockInfoUnderConstruction itself, not the one from reported replica.
 When a pipeline recovery is done for the last packet of a block, the 
 incremental block reports with the new gen stamp may come before the client 
 calling updatePipeline(). If this happens, these replicas will be incorrectly 
 recorded with the old gen stamp and get removed later.  The result is close 
 or addAdditionalBlock failure.
 If the last block is completed, but the penultimate block is not because of 
 this issue, the file won't be closed. If this file is not cleared, but the 
 client goes away, the lease manager will try to recover the lease/block, at 
 which point it will crash. I will file a separate jira for this shortly.
 The worst case is to reject all good ones and accepting a bad one. In this 
 case, the block will get completed, but the data cannot be read until the 
 next full block report containing one of the valid replicas is received.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5560) Trash configuration log statements prints incorrect units


[ 
https://issues.apache.org/jira/browse/HDFS-5560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837690#comment-13837690
 ] 

Hudson commented on HDFS-5560:
--

SUCCESS: Integrated in Hadoop-Hdfs-trunk #1601 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/1601/])
HDFS-5560. Trash configuration log statements prints incorrect units. 
Contributed by Josh Elser. (wang: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547266)
* /hadoop/common/trunk/hadoop-common-project/hadoop-common/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/TrashPolicyDefault.java


 Trash configuration log statements prints incorrect units
 -

 Key: HDFS-5560
 URL: https://issues.apache.org/jira/browse/HDFS-5560
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 2.2.0
Reporter: Josh Elser
Assignee: Josh Elser
 Fix For: 2.3.0

 Attachments: HDFS-5560.patch


 I ran `hdfs dfs -expunge` on a 2.2.0 system, and noticed the following the 
 message printed out on the console:
 {noformat}
 $ hdfs dfs -expunge
 13/11/23 22:12:17 INFO fs.TrashPolicyDefault: Namenode trash configuration: 
 Deletion interval = 180 minutes, Emptier interval = 0 minutes.
 {noformat}
 The configuration for both the deletion interval and emptier interval are 
 given in minutes, converted to milliseconds and then logged as milliseconds 
 but with a label of minutes. It looks like this was introduced in HDFS-4903.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HDFS-5558) LeaseManager monitor thread can crash if the last block is complete but another block is not.

2013-12-03 Thread Kihwal Lee (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-5558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kihwal Lee updated HDFS-5558:
-

   Resolution: Fixed
Fix Version/s: 0.23.10
   2.4.0
   3.0.0
 Hadoop Flags: Reviewed
   Status: Resolved  (was: Patch Available)

Thanks for the reviews. I've committed this to trunk, branch-2 and branch-0.23.

 LeaseManager monitor thread can crash if the last block is complete but 
 another block is not.
 -

 Key: HDFS-5558
 URL: https://issues.apache.org/jira/browse/HDFS-5558
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 0.23.9, 2.4.0
Reporter: Kihwal Lee
Assignee: Kihwal Lee
 Fix For: 3.0.0, 2.4.0, 0.23.10

 Attachments: HDFS-5558.branch-023.patch, HDFS-5558.branch-023.patch, 
 HDFS-5558.patch, HDFS-5558.patch


 As mentioned in HDFS-5557, if a file has its last and penultimate block not 
 completed and the file is being closed, the last block may be completed but 
 the penultimate one might not. If this condition lasts long and the file is 
 abandoned, LeaseManager will try to recover the lease and the block. But 
 {{internalReleaseLease()}} will fail with invalid cast exception with this 
 kind of file.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5558) LeaseManager monitor thread can crash if the last block is complete but another block is not.


[ 
https://issues.apache.org/jira/browse/HDFS-5558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837730#comment-13837730
 ] 

Hudson commented on HDFS-5558:
--

SUCCESS: Integrated in Hadoop-trunk-Commit #4819 (See 
[https://builds.apache.org/job/Hadoop-trunk-Commit/4819/])
HDFS-5558. LeaseManager monitor thread can crash if the last block is complete 
but another block is not. Contributed by Kihwal Lee. (kihwal: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1547393)
* /hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/namenode/FSNamesystem.java


 LeaseManager monitor thread can crash if the last block is complete but 
 another block is not.
 -

 Key: HDFS-5558
 URL: https://issues.apache.org/jira/browse/HDFS-5558
 Project: Hadoop HDFS
  Issue Type: Bug
Affects Versions: 0.23.9, 2.4.0
Reporter: Kihwal Lee
Assignee: Kihwal Lee
 Fix For: 3.0.0, 2.4.0, 0.23.10

 Attachments: HDFS-5558.branch-023.patch, HDFS-5558.branch-023.patch, 
 HDFS-5558.patch, HDFS-5558.patch


 As mentioned in HDFS-5557, if a file has its last and penultimate block not 
 completed and the file is being closed, the last block may be completed but 
 the penultimate one might not. If this condition lasts long and the file is 
 abandoned, LeaseManager will try to recover the lease and the block. But 
 {{internalReleaseLease()}} will fail with invalid cast exception with this 
 kind of file.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Resolved] (HDFS-5484) StorageType and State in DatanodeStorageInfo in NameNode is not accurate

2013-12-03 Thread Arpit Agarwal (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-5484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arpit Agarwal resolved HDFS-5484.
-

   Resolution: Fixed
Fix Version/s: Heterogeneous Storage (HDFS-2832)
 Hadoop Flags: Reviewed

+1 for the updated patch. I committed it to branch HDFS-2832.

I agree that the test for this will take some work but we will need it once we 
start exposing Storage Types to applications. I will make a note in the test 
plan.

Thanks Eric!

 StorageType and State in DatanodeStorageInfo in NameNode is not accurate
 

 Key: HDFS-5484
 URL: https://issues.apache.org/jira/browse/HDFS-5484
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: datanode
Affects Versions: Heterogeneous Storage (HDFS-2832)
Reporter: Eric Sirianni
 Fix For: Heterogeneous Storage (HDFS-2832)

 Attachments: HDFS-5484-HDFS-2832--2.patch, HDFS-5484-HDFS-2832.patch


 The fields in DatanodeStorageInfo are updated from two distinct paths:
 # block reports
 # storage reports (via heartbeats)
 The {{state}} and {{storageType}} fields are updated via the Block Report.  
 However, as seen in the code blow, these fields are populated from a dummy 
 {{DatanodeStorage}} object constructed in the DataNode:
 {code}
 BPServiceActor.blockReport() {
 //...
 // Dummy DatanodeStorage object just for sending the block report.
 DatanodeStorage dnStorage = new DatanodeStorage(storageID);
 //...
 }
 {code}
 The net effect is that the {{state}} and {{storageType}} fields are always 
 the default of {{NORMAL}} and {{DISK}} in the NameNode.
 The recommended fix is to change {{FsDatasetSpi.getBlockReports()}} from:
 {code}
 public MapString, BlockListAsLongs getBlockReports(String bpid);
 {code}
 to:
 {code}
 public MapDatanodeStorage, BlockListAsLongs getBlockReports(String bpid);
 {code}
 thereby allowing {{BPServiceActor}} to send the real {{DatanodeStorage}} 
 object with the block report.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-5453) Support fine grain locking in FSNamesystem

2013-12-03 Thread Daryn Sharp (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-5453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13837875#comment-13837875
 ] 

Daryn Sharp commented on HDFS-5453:
---

[~sureshms] Sorry for the delay in response, I've been on vacation.  The 
initial simplistic implementation, w/o any fsdir lock changes, and with direct 
fsn access (no RPC), has a modest throughput improvement of ~2-15% depending on 
various read/write workloads of only listStatus/mkdir/delete with an ideal 
scenario of low path contention in the namesystem.  In practice other 
subsystems unnecessarily write locking the namesystem will probably negate most 
gains until they are addressed too.

A gain is achieved if handler threads have passed through the fsn lock(s), 
resolved their path, checked permissions, and are blocked on the fsdir lock - 
as opposed to all read/write handlers being blocked on the global fsn lock 
during any write op.

I don't have the numbers handy, but with complete removal of the fsdir lock 
(not yet feasible due the non-thread safe datastructures it protects) and 
desync of a few other methods such as UGI.getCurrentUser produced a multiplier 
of throughput.

At the moment, I only intend to lay the groundwork for larger changes.

 Support fine grain locking in FSNamesystem
 --

 Key: HDFS-5453
 URL: https://issues.apache.org/jira/browse/HDFS-5453
 Project: Hadoop HDFS
  Issue Type: New Feature
  Components: namenode
Affects Versions: 2.0.0-alpha, 3.0.0
Reporter: Daryn Sharp
Assignee: Daryn Sharp

 The namesystem currently uses a course grain lock to control access.  This 
 prevents concurrent writers in different branches of the tree, and prevents 
 readers from accessing branches that writers aren't using.
 Features that introduce latency to namesystem operations, such as cold 
 storage of inodes, will need fine grain locking to avoid degrading the entire 
 namesystem's throughput.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HDFS-2832) Enable support for heterogeneous storages in HDFS

2013-12-03 Thread Arpit Agarwal (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arpit Agarwal updated HDFS-2832:


Attachment: 20131203-HeterogeneousStorage-TestPlan.pdf

 Enable support for heterogeneous storages in HDFS
 -

 Key: HDFS-2832
 URL: https://issues.apache.org/jira/browse/HDFS-2832
 Project: Hadoop HDFS
  Issue Type: New Feature
Affects Versions: 0.24.0
Reporter: Suresh Srinivas
Assignee: Suresh Srinivas
 Attachments: 20130813-HeterogeneousStorage.pdf, 
 20131125-HeterogeneousStorage-TestPlan.pdf, 
 20131125-HeterogeneousStorage.pdf, 
 20131202-HeterogeneousStorage-TestPlan.pdf, 
 20131203-HeterogeneousStorage-TestPlan.pdf, H2832_20131107.patch, 
 editsStored, h2832_20131023.patch, h2832_20131023b.patch, 
 h2832_20131025.patch, h2832_20131028.patch, h2832_20131028b.patch, 
 h2832_20131029.patch, h2832_20131103.patch, h2832_20131104.patch, 
 h2832_20131105.patch, h2832_20131107b.patch, h2832_20131108.patch, 
 h2832_20131110.patch, h2832_20131110b.patch, h2832_2013.patch, 
 h2832_20131112.patch, h2832_20131112b.patch, h2832_20131114.patch, 
 h2832_20131118.patch, h2832_20131119.patch, h2832_20131119b.patch, 
 h2832_20131121.patch, h2832_20131122.patch, h2832_20131122b.patch, 
 h2832_20131123.patch, h2832_20131124.patch, h2832_20131202.patch


 HDFS currently supports configuration where storages are a list of 
 directories. Typically each of these directories correspond to a volume with 
 its own file system. All these directories are homogeneous and therefore 
 identified as a single storage at the namenode. I propose, change to the 
 current model where Datanode * is a * storage, to Datanode * is a collection 
 * of strorages. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HDFS-2832) Enable support for heterogeneous storages in HDFS