date:20110706

[jira] [Commented] (HDFS-2126) Improve Namenode startup time [umbrella task]

2011-07-06 Thread Matt Foley (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13060310#comment-13060310
 ] 

Matt Foley commented on HDFS-2126:
--

The intent is to close this umbrella Jira after HDFS-1391 and HDFS-1732 are 
committed.

Other ideas for speeding up Namenode startup have been proposed, and in some 
cases Jiras opened.  We record some of them here for historical interest, but 
they are more speculative and may be pursued under other Jiras or other 
projects:
* Fully background the FSImage writes when in Safe Mode (HDFS-1798)
* Further optimization of FSImage reads (e.g. HDFS-1366)
* Concurrent FSImage read processing, by splitting the FSImage file into 
independently-processable partitions (speculative)
* Improvements for Edits log read processing, similar to the efficiency 
improvements obtained for FSImage reads (speculative)
* Concurrent Block Report processing (e.g. HDFS-1667)
* Fully background Termination Scan (taking the improvements of HDFS-1391 to 
their maximum)


 Improve Namenode startup time [umbrella task]
 -

 Key: HDFS-2126
 URL: https://issues.apache.org/jira/browse/HDFS-2126
 Project: Hadoop HDFS
  Issue Type: Improvement
  Components: name-node
Affects Versions: 0.20.2
Reporter: Matt Foley

 This is an umbrella task to group the improvements in Namenode startup 
 latency made over the last few months, and track remaining ideas.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HDFS-2010) Clean up and test behavior under failed edit streams

2011-07-06 Thread Matt Foley (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13060326#comment-13060326
 ] 

Matt Foley commented on HDFS-2010:
--

Nope, lgtm!

 Clean up and test behavior under failed edit streams
 

 Key: HDFS-2010
 URL: https://issues.apache.org/jira/browse/HDFS-2010
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: name-node
Affects Versions: Edit log branch (HDFS-1073)
Reporter: Todd Lipcon
Assignee: Aaron T. Myers
 Fix For: Edit log branch (HDFS-1073)

 Attachments: hdfs-2010.0.patch, hdfs-2010.1.patch, hdfs-2010.2.patch


 Right now there is very little test coverage of situations where one or more 
 of the edits directories fails. In trunk, the behavior when all of the edits 
 directories are dead is that the NN prints a fatal level log message and 
 calls Runtime.exit(-1).
 I don't think this is really the behavior we want. Needs a bit of thought, 
 but I think something like the following would make more sense:
 - any calls currently waiting on logSync should end up throwing an exception
 - NN should probably enter safe mode
 - ops can restore edits directories and then ask the NN to restore storage, 
 at which point it could edit safemode
 - alternatively, ops could call ask the NN to do saveNamespace and then shut 
 it down

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HDFS-503) Implement erasure coding as a layer on HDFS

2011-07-06 Thread sri (JIRA)

[
https://issues.apache.org/jira/browse/HDFS-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13060343#comment-13060343
]

sri commented on HDFS-503:
--

I have couple of questions,

1)With, Raid being setup, I am not able to generate DFSAdmin report (hadoop
dfsadmin -report). Why is that so ?

2)I am not able to reduce the targetReplicationFactor to 0 (I want to run
mapreduce where the Bloackfixer retrives the data from the raided disks) Is der
any way to do this.

Thanks in advance

Implement erasure coding as a layer on HDFS
---

Key: HDFS-503
URL: https://issues.apache.org/jira/browse/HDFS-503
Project: Hadoop HDFS
Issue Type: New Feature
Components: contrib/raid
Reporter: dhruba borthakur
Assignee: dhruba borthakur
Fix For: 0.21.0

Attachments: raid1.txt, raid2.txt

The goal of this JIRA is to discuss how the cost of raw storage for a HDFS
file system can be reduced. Keeping three copies of the same data is very
costly, especially when the size of storage is huge. One idea is to reduce
the replication factor and do erasure coding of a set of blocks so that the
over probability of failure of a block remains the same as before.
Many forms of error-correcting codes are available, see
http://en.wikipedia.org/wiki/Erasure_code. Also, recent research from CMU has
described DiskReduce
https://opencirrus.org/system/files/Gibson-OpenCirrus-June9-09.ppt.
My opinion is to discuss implementation strategies that are not part of base
HDFS, but is a layer on top of HDFS.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HDFS-503) Implement erasure coding as a layer on HDFS

2011-07-06 Thread sri (JIRA)

[
https://issues.apache.org/jira/browse/HDFS-503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13060450#comment-13060450
]

sri commented on HDFS-503:
--

I would like to know, if the stripes just act as a recovery option(when other
datanodes have failed), or can they act as input to the mapreduce jobs(to
satisfy locality).

Implement erasure coding as a layer on HDFS
---

Attachments: raid1.txt, raid2.txt

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HDFS-1990) Resource leaks in HDFS

2011-07-06 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-1990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13060540#comment-13060540
 ] 

Hudson commented on HDFS-1990:
--

Integrated in Hadoop-Hdfs-trunk #717 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/717/])
HDFS-1990. Fix  resource leaks in BlockReceiver.close().  Contributed by 
Uma Maheswara Rao G

szetszwo : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1143147
Files : 
* /hadoop/common/trunk/hdfs/CHANGES.txt
* 
/hadoop/common/trunk/hdfs/src/java/org/apache/hadoop/hdfs/server/datanode/BlockReceiver.java


 Resource leaks in HDFS
 --

 Key: HDFS-1990
 URL: https://issues.apache.org/jira/browse/HDFS-1990
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: data-node
Affects Versions: 0.23.0
Reporter: ramkrishna.s.vasudevan
Assignee: Uma Maheswara Rao G
Priority: Minor
 Fix For: 0.23.0

 Attachments: HDFS-1990.patch, HDFS-1990.patch


 Possible resource leakage in HDFS.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HDFS-1753) Resource Leak in org.apache.hadoop.hdfs.server.namenode.StreamFile

2011-07-06 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-1753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13060541#comment-13060541
 ] 

Hudson commented on HDFS-1753:
--

Integrated in Hadoop-Hdfs-trunk #717 (See 
[https://builds.apache.org/job/Hadoop-Hdfs-trunk/717/])
HDFS-1753. Resource Leak in StreamFile. Contributed by Uma Maheswara Rao G

eli : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1143106
Files : 
* 
/hadoop/common/trunk/hdfs/src/test/hdfs/org/apache/hadoop/hdfs/server/namenode/TestStreamFile.java
* /hadoop/common/trunk/hdfs/CHANGES.txt
* 
/hadoop/common/trunk/hdfs/src/java/org/apache/hadoop/hdfs/server/namenode/StreamFile.java


 Resource Leak in org.apache.hadoop.hdfs.server.namenode.StreamFile
 --

 Key: HDFS-1753
 URL: https://issues.apache.org/jira/browse/HDFS-1753
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: name-node
Affects Versions: 0.20.1, 0.23.0
Reporter: Uma Maheswara Rao G
Assignee: Uma Maheswara Rao G
Priority: Minor
 Attachments: HDFS-1753.1.patch, HDFS-1753.2.patch, HDFS-1753.3.patch, 
 HDFS-1753.4.patch, HDFS-1753.patch


 In doGet Method, 
 final DFSInputStream in = dfs.open(filename);
 final long fileLen = in.getFileLength();
 OutputStream os = response.getOutputStream(); 
 Here this lines are present at out side of the try block.
 If response.getOutputStream() throws any exception then DFSInputStream will 
 not be closed.So, better to move response.getOutputStream() into try block.
  

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HDFS-2131) Tests for HADOOP-7361

2011-07-06 Thread Uma Maheswara Rao G (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-2131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uma Maheswara Rao G updated HDFS-2131:
--

Attachment: HADOOP-7361-test.patch

 Tests for HADOOP-7361
 -

 Key: HDFS-2131
 URL: https://issues.apache.org/jira/browse/HDFS-2131
 Project: Hadoop HDFS
  Issue Type: Test
Reporter: Uma Maheswara Rao G
Assignee: Uma Maheswara Rao G
 Attachments: HADOOP-7361-test.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2131) Tests for HADOOP-7361

2011-07-06 Thread Uma Maheswara Rao G (JIRA)

Tests for HADOOP-7361
-

 Key: HDFS-2131
 URL: https://issues.apache.org/jira/browse/HDFS-2131
 Project: Hadoop HDFS
  Issue Type: Test
Reporter: Uma Maheswara Rao G
Assignee: Uma Maheswara Rao G
 Attachments: HADOOP-7361-test.patch



--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HDFS-2131) Tests for HADOOP-7361

2011-07-06 Thread Uma Maheswara Rao G (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-2131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uma Maheswara Rao G updated HDFS-2131:
--

Status: Patch Available  (was: Open)

 Tests for HADOOP-7361
 -

 Key: HDFS-2131
 URL: https://issues.apache.org/jira/browse/HDFS-2131
 Project: Hadoop HDFS
  Issue Type: Test
Reporter: Uma Maheswara Rao G
Assignee: Uma Maheswara Rao G
 Attachments: HADOOP-7361-test.patch




--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HDFS-2126) Improve Namenode startup time [umbrella task]

2011-07-06 Thread Koji Noguchi (JIRA)


[ 
https://issues.apache.org/jira/browse/HDFS-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13060681#comment-13060681
 ] 

Koji Noguchi commented on HDFS-2126:


Is it only us(Yahoo) namenodes which hits FullGC right after the handlers first 
start up?  That wastes 1 to 3 mins.  (We are giving large heap size with -Xmx 
and -Xms.)

 Improve Namenode startup time [umbrella task]
 -

 Key: HDFS-2126
 URL: https://issues.apache.org/jira/browse/HDFS-2126
 Project: Hadoop HDFS
  Issue Type: Improvement
  Components: name-node
Affects Versions: 0.20.2
Reporter: Matt Foley

 This is an umbrella task to group the improvements in Namenode startup 
 latency made over the last few months, and track remaining ideas.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

70 matches

Mail list logo