from:"Hari Mankude"

Re: Next releases

2013-11-11 Thread Hari Mankude

Hi Arun,

Another feature that would be relevant and got deferred was the symlink
work (HADOOP-10020) that Colin and Andrew were working on. Can we include
this in hadoop-2.3.0 also?

thanks
hari


On Sun, Nov 10, 2013 at 2:07 PM, Alejandro Abdelnur wrote:

> Arun, thanks for jumping on this.
>
> On hadoop branch-2.2. I've quickly scanned the commit logs starting from
> the 2.2.0 release and I've found around 20 JIRAs that I like seeing in
> 2.2.1. Not all of them are bugs but the don't shake anything and improve
> usability.
>
> I presume others will have their own laundry lists as well and I wonder the
> union of all of them how much adds up to the current 81 commits.
>
> How about splitting the JIRAs among a few contributors to assert there is
> nothing risky in there? And if so get discuss getting rid of those commits
> for 2.2.1. IMO doing that would be cheaper than selectively applying
> commits on a fresh branch.
>
> Said this, I think we should get 2.2.1 out of the door before switching
> main efforts to 2.3.0. I volunteer myself to drive 2.2.1 a  release if ASAP
> if you don't have the bandwidth at the moment for it.
>
> Cheers.
>
> Alejandro
>
>
> 
> Commits in branch-2.2 that I'd like them to be in the 2.2.1 release:
>
> The ones prefixed with '*' technically are not bugs.
>
>  YARN-1284. LCE: Race condition leaves dangling cgroups entries for killed
> containers. (Alejandro Abdelnur via Sandy Ryza)
>  YARN-1265. Fair Scheduler chokes on unhealthy node reconnect (Sandy Ryza)
>  YARN-1044. used/min/max resources do not display info in the scheduler
> page (Sangjin Lee via Sandy Ryza)
>  YARN-305. Fair scheduler logs too many "Node offered to app" messages.
> (Lohit Vijayarenu via Sandy Ryza)
> *MAPREDUCE-5463. Deprecate SLOTS_MILLIS counters. (Tzuyoshi Ozawa via Sandy
> Ryza)
>  YARN-1259. In Fair Scheduler web UI, queue num pending and num active apps
> switched. (Robert Kanter via Sandy Ryza)
>  YARN-1295. In UnixLocalWrapperScriptBuilder, using bash -c can cause Text
> file busy errors. (Sandy Ryza)
> *MAPREDUCE-5457. Add a KeyOnlyTextOutputReader to enable streaming to write
> out text files without separators (Sandy Ryza)
> *YARN-1258. Allow configuring the Fair Scheduler root queue (Sandy Ryza)
> *YARN-1288. Make Fair Scheduler ACLs more user friendly (Sandy Ryza)
>  YARN-1330. Fair Scheduler: defaultQueueSchedulingPolicy does not take
> effect (Sandy Ryza)
>  HDFS-5403. WebHdfs client cannot communicate with older WebHdfs servers
> post HDFS-5306. Contributed by Aaron T. Myers.
> *YARN-1335. Move duplicate code from FSSchedulerApp and FiCaSchedulerApp
> into SchedulerApplication (Sandy Ryza)
> *YARN-1333. Support blacklisting in the Fair Scheduler (Tsuyoshi Ozawa via
> Sandy Ryza)
> *MAPREDUCE-4680. Job history cleaner should only check timestamps of files
> in old enough directories (Robert Kanter via Sandy Ryza)
>  YARN-1109. Demote NodeManager "Sending out status for container" logs to
> debug (haosdent via Sandy Ryza)
> *YARN-1321. Changed NMTokenCache to support both singleton and an instance
> usage. Contributed by Alejandro Abdelnur
>  YARN-1343. NodeManagers additions/restarts are not reported as node
> updates in AllocateResponse responses to AMs. (tucu)
>  YARN-1381. Same relaxLocality appears twice in exception message of
> AMRMClientImpl#checkLocalityRelaxationConflict() (Ted Yu via Sandy Ryza)
>  HADOOP-9898. Set SO_KEEPALIVE on all our sockets. Contributed by Todd
> Lipcon.
>  YARN-1388. Fair Scheduler page always displays blank fair share (Liyin
> Liang via Sandy Ryza)
>
>
>
> On Fri, Nov 8, 2013 at 10:35 PM, Chris Nauroth  >wrote:
>
> > Arun, what are your thoughts on test-only patches?  I know I've been
> > merging a lot of Windows test stabilization patches down to branch-2.2.
> >  These can't rightly be called blockers, but they do improve dev
> > experience, and there is no risk to product code.
> >
> > Chris Nauroth
> > Hortonworks
> > http://hortonworks.com/
> >
> >
> >
> > On Fri, Nov 8, 2013 at 1:30 AM, Steve Loughran  > >wrote:
> >
> > > On 8 November 2013 02:42, Arun C Murthy  wrote:
> > >
> > > > Gang,
> > > >
> > > >  Thinking through the next couple of releases here, appreciate f/b.
> > > >
> > > >  # hadoop-2.2.1
> > > >
> > > >  I was looking through commit logs and there is a *lot* of content
> here
> > > > (81 commits as on 11/7). Some are features/improvements and some are
> > > fixes
> > > > - it's really hard to distinguish what is important and what isn't.
> > > >
> > > >  I propose we start with a blank slate (i.e. blow away branch-2.2 and
> > > > start fresh from a copy of branch-2.2.0)  and then be very careful
> and
> > > > meticulous about including only *blocker* fixes in branch-2.2. So,
> most
> > > of
> > > > the content here comes via the next minor release (i.e. hadoop-2.3)
> > > >
> > > >  In future, we continue to be *very* parsimonious about what gets
> into

[jira] [Created] (HDFS-4873) callGetBlockLocations returns incorrect number of blocks for snapshotted files

2013-06-03 Thread Hari Mankude (JIRA)

Hari Mankude created HDFS-4873:
--

 Summary: callGetBlockLocations returns incorrect number of blocks 
for snapshotted files
 Key: HDFS-4873
 URL: https://issues.apache.org/jira/browse/HDFS-4873
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: snapshots
Affects Versions: 3.0.0
Reporter: Hari Mankude
Assignee: Jing Zhao


callGetBlockLocations() returns all the blocks of a file even when they are not 
present in the snap version

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HDFS-3205) testHANameNodesWithFederation is failing in trunk

2012-04-27 Thread Hari Mankude (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-3205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hari Mankude resolved HDFS-3205.


Resolution: Duplicate

This is a dup of hdfs-2960

> testHANameNodesWithFederation is failing in trunk
> -
>
> Key: HDFS-3205
> URL: https://issues.apache.org/jira/browse/HDFS-3205
> Project: Hadoop HDFS
>  Issue Type: Bug
>  Components: ha, name-node
>    Reporter: Hari Mankude
>Assignee: Hari Mankude
>Priority: Minor
>
> The test is failing with the error
> org.junit.ComparisonFailure: expected: but 
> was:

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3299) handle creation time also in journal service

2012-04-18 Thread Hari Mankude (Created) (JIRA)

handle creation time also in journal service


 Key: HDFS-3299
 URL: https://issues.apache.org/jira/browse/HDFS-3299
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hari Mankude




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3293) Implement equals for storageinfo and journainfo class.

2012-04-17 Thread Hari Mankude (Created) (JIRA)

Implement equals for storageinfo and journainfo class. 
---

 Key: HDFS-3293
 URL: https://issues.apache.org/jira/browse/HDFS-3293
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: name-node
Reporter: Hari Mankude
Priority: Minor


Implement equals for storageinfo and journalinfo class. Also journalinfo class 
needs a toString() method.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3274) Create a new journal_edits_dir key to support journal nodes

2012-04-13 Thread Hari Mankude (Created) (JIRA)

Create a new journal_edits_dir key to support journal nodes 


 Key: HDFS-3274
 URL: https://issues.apache.org/jira/browse/HDFS-3274
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hari Mankude
Assignee: Hari Mankude




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HDFS-3257) Fix synchronization issues with journal service

2012-04-12 Thread Hari Mankude (Resolved) (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-3257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hari Mankude resolved HDFS-3257.


Resolution: Invalid

closing the jira since there is only one thread as per Suresh's comment.

> Fix synchronization issues with journal service
> ---
>
> Key: HDFS-3257
> URL: https://issues.apache.org/jira/browse/HDFS-3257
> Project: Hadoop HDFS
>  Issue Type: Sub-task
>  Components: ha, name-node
>    Reporter: Hari Mankude
>Assignee: Hari Mankude
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3257) Fix synchronization issues with journal service

2012-04-11 Thread Hari Mankude (Created) (JIRA)

Fix synchronization issues with journal service
---

 Key: HDFS-3257
 URL: https://issues.apache.org/jira/browse/HDFS-3257
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hari Mankude
Assignee: Hari Mankude




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3217) ZKFC should restart NN when healthmonitor gets a SERVICE_NOT_RESPONDING exception

2012-04-06 Thread Hari Mankude (Created) (JIRA)

ZKFC should restart NN when healthmonitor gets a SERVICE_NOT_RESPONDING 
exception
-

 Key: HDFS-3217
 URL: https://issues.apache.org/jira/browse/HDFS-3217
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hari Mankude




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3213) JournalDaemon (server) should persist the cluster id and nsid in the storage directory

2012-04-05 Thread Hari Mankude (Created) (JIRA)

JournalDaemon (server) should persist the cluster id and nsid in the storage 
directory
--

 Key: HDFS-3213
 URL: https://issues.apache.org/jira/browse/HDFS-3213
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hari Mankude




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3205) testHANameNodesWithFederation is failing in trunk

2012-04-05 Thread Hari Mankude (Created) (JIRA)

testHANameNodesWithFederation is failing in trunk
-

 Key: HDFS-3205
 URL: https://issues.apache.org/jira/browse/HDFS-3205
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: ha, name-node
Reporter: Hari Mankude
Assignee: Hari Mankude
Priority: Minor


The test is failing with the error

org.junit.ComparisonFailure: expected: but 
was:

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3192) Active NN should exit when it has not received a getServiceStatus() rpc from ZKFC for timeout secs

2012-04-03 Thread Hari Mankude (Created) (JIRA)

Active NN should exit when it has not received a getServiceStatus() rpc from 
ZKFC for timeout secs
--

 Key: HDFS-3192
 URL: https://issues.apache.org/jira/browse/HDFS-3192
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hari Mankude




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3191) Active NN should exit when it cannot write to quorum number of Journal Daemons

2012-04-03 Thread Hari Mankude (Created) (JIRA)

Active NN should exit when it cannot write to quorum number of Journal Daemons
--

 Key: HDFS-3191
 URL: https://issues.apache.org/jira/browse/HDFS-3191
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hari Mankude




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3186) Sync lagging journal service from the active journal service

2012-04-03 Thread Hari Mankude (Created) (JIRA)

Sync lagging journal service from the active journal service


 Key: HDFS-3186
 URL: https://issues.apache.org/jira/browse/HDFS-3186
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hari Mankude




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3185) Setup configuration for Journal Manager and Journal Services

2012-04-03 Thread Hari Mankude (Created) (JIRA)

Setup configuration for Journal Manager and Journal Services


 Key: HDFS-3185
 URL: https://issues.apache.org/jira/browse/HDFS-3185
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hari Mankude
Assignee: Hari Mankude




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3126) Journal stream from the namenode to backup needs to have a timeout

2012-03-21 Thread Hari Mankude (Created) (JIRA)

Journal stream from the namenode to backup needs to have a timeout
--

 Key: HDFS-3126
 URL: https://issues.apache.org/jira/browse/HDFS-3126
 Project: Hadoop HDFS
  Issue Type: Sub-task
Reporter: Hari Mankude
Assignee: Hari Mankude




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-3009) DFSclient islocaladdress() can use similar routine in netutils

2012-02-24 Thread Hari Mankude (Created) (JIRA)

DFSclient islocaladdress() can use similar routine in netutils
--

 Key: HDFS-3009
 URL: https://issues.apache.org/jira/browse/HDFS-3009
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: hdfs client
Affects Versions: 0.23.0
Reporter: Hari Mankude
Assignee: Hari Mankude
Priority: Trivial


isLocalAddress() in dfsclient can use similar function in netutils

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2955) IllegalStateException during standby startup in getCurSegmentTxId

2012-02-15 Thread Hari Mankude (Created) (JIRA)

IllegalStateException during standby startup in getCurSegmentTxId
-

 Key: HDFS-2955
 URL: https://issues.apache.org/jira/browse/HDFS-2955
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: ha
Affects Versions: HA branch (HDFS-1623)
Reporter: Hari Mankude
Assignee: Hari Mankude


During standby restarts, a new routine getTransactionsSinceLastLogRoll() has 
been introduced for metrics which is calling getCurSegmentTxId(). checkstate() 
in getCurSegmentTxId() assumes that log is opened for writing and this is not 
the case in standby.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HDFS-2865) Standby namenode gets a "cannot lock storage" exception during startup

2012-02-09 Thread Hari Mankude (Resolved) (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-2865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hari Mankude resolved HDFS-2865.


Resolution: Invalid

> Standby namenode gets a "cannot lock storage" exception during startup
> --
>
> Key: HDFS-2865
> URL: https://issues.apache.org/jira/browse/HDFS-2865
> Project: Hadoop HDFS
>  Issue Type: Sub-task
>  Components: ha, name-node
>Affects Versions: HA branch (HDFS-1623)
>Reporter: Hari Mankude
>Assignee: Hari Mankude
>
> Standby NN is restarted. This is a follow-on to hdfs-2863. In this setup, 
> dfs.edits.dir is different from dfs.shared.edits.dir. During startup, standby 
> NN fails to acquire lock on the dfs.edits.dir. If standby NN is restarted 
> again, it seems to work fine.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HDFS-2866) Standby does not start up due to a gap in transaction id

2012-02-09 Thread Hari Mankude (Resolved) (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-2866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hari Mankude resolved HDFS-2866.


Resolution: Duplicate

hdfs-2874

> Standby does not start up due to a gap in transaction id
> 
>
> Key: HDFS-2866
> URL: https://issues.apache.org/jira/browse/HDFS-2866
> Project: Hadoop HDFS
>  Issue Type: Sub-task
>  Components: ha
>Affects Versions: HA branch (HDFS-1623)
>Reporter: Hari Mankude
>    Assignee: Hari Mankude
>Priority: Critical
>
> Standby notices a gap in the transaction id in the shared.edits directory. 
> The transactions in dfs.edits.dir does not seem to have the gap. The gap 
> happens during a failover.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2914) HA Standby stuck in safemode when shared edits directory is bounced

2012-02-07 Thread Hari Mankude (Created) (JIRA)

HA Standby stuck in safemode when shared edits directory is bounced
---

 Key: HDFS-2914
 URL: https://issues.apache.org/jira/browse/HDFS-2914
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: ha, name-node
Affects Versions: HA branch (HDFS-1623)
Reporter: Hari Mankude


When shared edits dir is bounced, standby NN is put into safemode by the 
NameNodeResourceMonitor(). However, there is no path for it to exit out of safe 
mode when shared edits dir reappears.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HDFS-2903) NPE if shared edits directory is not available during failover

2012-02-07 Thread Hari Mankude (Resolved) (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-2903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hari Mankude resolved HDFS-2903.


Resolution: Duplicate

Dup of HDFS-2905

> NPE if shared edits directory is not available during failover
> --
>
> Key: HDFS-2903
> URL: https://issues.apache.org/jira/browse/HDFS-2903
> Project: Hadoop HDFS
>  Issue Type: Sub-task
>  Components: ha, name-node
>Affects Versions: HA branch (HDFS-1623)
>Reporter: Hari Mankude
>    Assignee: Hari Mankude
>
> Journalset has a null pointer exception if shared edits directory is not 
> available during HA state transition.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2903) NPE if shared edits directory is not available during failover

2012-02-06 Thread Hari Mankude (Created) (JIRA)

NPE if shared edits directory is not available during failover
--

 Key: HDFS-2903
 URL: https://issues.apache.org/jira/browse/HDFS-2903
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: ha
Affects Versions: HA branch (HDFS-1623)
Reporter: Hari Mankude
Assignee: Hari Mankude


Journalset has a null pointer exception if shared edits directory is not 
available during HA state transition.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2866) Standby does not start up due to a gap in transaction id

2012-01-31 Thread Hari Mankude (Created) (JIRA)

Standby does not start up due to a gap in transaction id


 Key: HDFS-2866
 URL: https://issues.apache.org/jira/browse/HDFS-2866
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: ha
Affects Versions: HA branch (HDFS-1623)
Reporter: Hari Mankude
Priority: Critical


Standby notices a gap in the transaction id in the shared.edits directory. The 
transactions in dfs.edits.dir does not seem to have the gap. The gap happens 
during a failover.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2865) Standby namenode gets a "cannot lock storage" exception during startup

2012-01-31 Thread Hari Mankude (Created) (JIRA)

Standby namenode gets a "cannot lock storage" exception during startup
--

 Key: HDFS-2865
 URL: https://issues.apache.org/jira/browse/HDFS-2865
 Project: Hadoop HDFS
  Issue Type: Bug
Reporter: Hari Mankude
Assignee: Hari Mankude


Standby NN is restarted. This is a follow-on to hdfs-2863. In this setup, 
dfs.edits.dir is different from dfs.shared.edits.dir. During startup, standby 
NN fails to acquire lock on the dfs.edits.dir. If standby NN is restarted 
again, it seems to work fine.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Reopened] (HDFS-2802) Support for RW/RO snapshots in HDFS

2012-01-27 Thread Hari Mankude (Reopened) (JIRA)


 [ 
https://issues.apache.org/jira/browse/HDFS-2802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hari Mankude reopened HDFS-2802:



I (We) are well aware of hdfs-233. This jira was opened to provide 
comprehensive snapshot solution (both RW/RO support) for HDFS. 

> Support for RW/RO snapshots in HDFS
> ---
>
> Key: HDFS-2802
> URL: https://issues.apache.org/jira/browse/HDFS-2802
> Project: Hadoop HDFS
>  Issue Type: New Feature
>  Components: name-node
>    Reporter: Hari Mankude
>Assignee: Hari Mankude
> Fix For: 0.24.0
>
>
> Snapshots are point in time images of parts of the filesystem or the entire 
> filesystem. Snapshots can be a read-only or a read-write point in time copy 
> of the filesystem. There are several use cases for snapshots in HDFS. I will 
> post a detailed write-up soon with with more information.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2802) Support for RW/RO snapshots in HDFS

2012-01-17 Thread Hari Mankude (Created) (JIRA)

Support for RW/RO snapshots in HDFS
---

 Key: HDFS-2802
 URL: https://issues.apache.org/jira/browse/HDFS-2802
 Project: Hadoop HDFS
  Issue Type: New Feature
  Components: name-node
Reporter: Hari Mankude
Assignee: Hari Mankude
 Fix For: 0.24.0


Snapshots are point in time images of parts of the filesystem or the entire 
filesystem. Snapshots can be a read-only or a read-write point in time copy of 
the filesystem. There are several use cases for snapshots in HDFS. I will post 
a detailed write-up soon with with more information.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2783) HDFS jira for changes related to HADOOP-7970

2012-01-11 Thread Hari Mankude (Created) (JIRA)

HDFS jira for changes related to HADOOP-7970


 Key: HDFS-2783
 URL: https://issues.apache.org/jira/browse/HDFS-2783
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: ha
Affects Versions: HA branch (HDFS-1623)
Reporter: Hari Mankude
Assignee: Hari Mankude
Priority: Minor


Create a separate hdfs patch for HDFS changes related to Hadoop-7970 to avoid 
test failures.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2780) All HAServiceProtocol methods have to throw IOException since they are invoked over the network

2012-01-11 Thread Hari Mankude (Created) (JIRA)

All HAServiceProtocol methods have to throw IOException since they are invoked 
over the network
---

 Key: HDFS-2780
 URL: https://issues.apache.org/jira/browse/HDFS-2780
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: ha
Affects Versions: HA branch (HDFS-1623)
Reporter: Hari Mankude
Assignee: Hari Mankude


HAServiceProtocol methods have to throw ioexception in addition to other 
exceptions since they are RPCs which are invoked over the network.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2756) Warm standby does not read the in_progress edit log

2012-01-05 Thread Hari Mankude (Created) (JIRA)

Warm standby does not read the in_progress edit log 


 Key: HDFS-2756
 URL: https://issues.apache.org/jira/browse/HDFS-2756
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: ha
Affects Versions: HA branch (HDFS-1623)
Reporter: Hari Mankude


Warm standby does not read the in_progress edit log. This could result in 
standby taking a long time to become the primary during a failover scenario.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2753) Standby namenode stuck in safenode during a failover

2012-01-05 Thread Hari Mankude (Created) (JIRA)

Standby namenode stuck in safenode during a failover


 Key: HDFS-2753
 URL: https://issues.apache.org/jira/browse/HDFS-2753
 Project: Hadoop HDFS
  Issue Type: Bug
  Components: name-node
Affects Versions: HA branch (HDFS-1623)
Reporter: Hari Mankude
Assignee: Hari Mankude


teragen is run to generate write traffic. A manual failover is initiated by 
killing namenode process. namenode which was killed is then restarted and it 
comes up as a standby. However, the standby never exits out of the safemode. 
Looking at the standby logs, it looks like the standby namenode gets 
addStoredBlock immediately after data node registration which results in 
numBlocks getting incremented. There is an optimization in processReport() 
which causes the follow on complete block report to be ignored when NN is in 
safemode. So, NN never exits out of the safemode.

2012-01-05 18:57:46,030 INFO  hdfs.StateChange 
(DatanodeManager.java:registerDatanode(573)) - BLOCK* 
NameSystem.registerDatanode: node registration from 98.137.233.235:50010 
storage DS-526656430-98.137.233.235-50010-1325723536492
2012-01-05 18:57:46,033 INFO  net.NetworkTopology 
(NetworkTopology.java:add(344)) - Adding a new node: 
/default-rack/98.137.233.235:50010
2012-01-05 18:57:46,033 INFO  namenode.FSNamesystem 
(FSNamesystem.java:checkMode(3411)) - DID NOT call initialize at 1
2012-01-05 18:57:46,034 INFO  hdfs.StateChange 
(DatanodeManager.java:registerDatanode(573)) - BLOCK* 
NameSystem.registerDatanode: node registration from 98.137.233.237:50010 
storage DS-1961520590-98.137.233.237-50010-1325725253057
2012-01-05 18:57:46,034 INFO  net.NetworkTopology 
(NetworkTopology.java:add(344)) - Adding a new node: 
/default-rack/98.137.233.237:50010
2012-01-05 18:57:46,042 INFO  namenode.FSNamesystem 
(FSNamesystem.java:checkMode(3411)) - DID NOT call initialize at 1
2012-01-05 18:57:46,045 INFO  hdfs.StateChange 
(BlockManager.java:addStoredBlock(1775)) - BLOCK* addStoredBlock: blockMap 
updated: 98.137.233.235:50010 is added to 
blk_-3183325095022454724_1172{blockUCState=UNDER_CONSTRUCTION, 
primaryNodeIndex=-1, 
replicas=[ReplicaUnderConstruction[98.137.233.235:50010|FINALIZED]]} size 0
2012-01-05 18:57:46,046 INFO  hdfs.StateChange 
(BlockManager.java:addStoredBlock(1775)) - BLOCK* addStoredBlock: blockMap 
updated: 98.137.233.235:50010 is added to 
blk_5617057825952660916_1173{blockUCState=UNDER_CONSTRUCTION, 
primaryNodeIndex=-1, 
replicas=[ReplicaUnderConstruction[98.137.233.235:50010|FINALIZED]]} size 0
2012-01-05 18:57:46,046 INFO  hdfs.StateChange 
(BlockManager.java:addStoredBlock(1775)) - BLOCK* addStoredBlock: blockMap 
updated: 98.137.233.237:50010 is added to 
blk_-3183325095022454724_1172{blockUCState=UNDER_CONSTRUCTION, 
primaryNodeIndex=-1, 
replicas=[ReplicaUnderConstruction[98.137.233.235:50010|FINALIZED], 
ReplicaUnderConstruction[98.137.233.237:50010|FINALIZED]]} size 0
2012-01-05 18:57:46,046 INFO  hdfs.StateChange 
(BlockManager.java:addStoredBlock(1775)) - BLOCK* addStoredBlock: blockMap 
updated: 98.137.233.237:50010 is added to 
blk_5617057825952660916_1173{blockUCState=UNDER_CONSTRUCTION, 
primaryNodeIndex=-1, 
replicas=[ReplicaUnderConstruction[98.137.233.235:50010|FINALIZED], 
ReplicaUnderConstruction[98.137.233.237:50010|FINALIZED]]} size 0
2012-01-05 18:57:46,049 INFO  hdfs.StateChange 
(BlockManager.java:processReport(1365)) - BLOCK* processReport: discarded 
non-initial block report from 98.137.233.235:50010 because namenode still in 
startup phase
2012-01-05 18:57:46,049 INFO  hdfs.StateChange 
(BlockManager.java:processReport(1365)) - BLOCK* processReport: discarded 
non-initial block report from 98.137.233.237:50010 because namenode still in 
startup phase
2012-01-05 18:58:05,167 INFO  namenode.NameNode 
(NameNodeRpcServer.java:blockReceivedAndDeleted(894)) - Required GS=1175, 
Queuing blockReceivedAndDeleted message
2012-01-05 18:58:05,168 INFO  namenode.NameNode 
(NameNodeRpcServer.java:blockReceivedAndDeleted(894)) - Required GS=1175, 
Queuing blockReceivedAndDeleted message
2012-01-05 18:58:06,634 INFO  namenode.NameNode 
(NameNodeRpcServer.java:blockReceivedAndDeleted(894)) - Required GS=1176, 
Queuing blockReceivedAndDeleted message
2012-01-05 18:58:06,636 INFO  namenode.NameNode 
(NameNodeRpcServer.java:blockReceivedAndDeleted(894)) - Required GS=1176, 
Queuing blockReceivedAndDeleted message
2012-01-05 18:58:08,097 INFO  namenode.NameNode 
(NameNodeRpcServer.java:blockReceivedAndDeleted(894)) - Required GS=1177, 
Queuing blockReceivedAndDeleted message
2012-01-05 18:58:08,097 INFO  namenode.NameNode 
(NameNodeRpcServer.java:blockReceivedAndDeleted(894)) - Required GS=1177, 
Queuing blockReceivedAndDeleted message

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira

[jira] [Created] (HDFS-2650) Replace @inheritDoc with @Override

2011-12-09 Thread Hari Mankude (Created) (JIRA)

Replace @inheritDoc with @Override 
---

 Key: HDFS-2650
 URL: https://issues.apache.org/jira/browse/HDFS-2650
 Project: Hadoop HDFS
  Issue Type: Improvement
Reporter: Hari Mankude
Assignee: Hari Mankude
Priority: Minor
 Fix For: 0.24.0


@Override provides both javadoc from superclass and compile time detection of 
overridden method deletion from superclass.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HDFS-2564) Cleanup unnecessary exceptions thrown and unnecessary casts

2011-11-18 Thread Hari Mankude (Created) (JIRA)

Cleanup unnecessary exceptions thrown and unnecessary casts
---

 Key: HDFS-2564
 URL: https://issues.apache.org/jira/browse/HDFS-2564
 Project: Hadoop HDFS
  Issue Type: Improvement
  Components: data-node, hdfs client, name-node
Affects Versions: 0.24.0
Reporter: Hari Mankude
Priority: Minor
 Fix For: 0.24.0
 Attachments: HDFS-2564.txt

Cleaning up some of the java files with unnecessary exceptions and casts.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

Re: Next releases

[jira] [Created] (HDFS-4873) callGetBlockLocations returns incorrect number of blocks for snapshotted files

[jira] [Resolved] (HDFS-3205) testHANameNodesWithFederation is failing in trunk

[jira] [Created] (HDFS-3299) handle creation time also in journal service

[jira] [Created] (HDFS-3293) Implement equals for storageinfo and journainfo class.

[jira] [Created] (HDFS-3274) Create a new journal_edits_dir key to support journal nodes

[jira] [Resolved] (HDFS-3257) Fix synchronization issues with journal service

[jira] [Created] (HDFS-3257) Fix synchronization issues with journal service

[jira] [Created] (HDFS-3217) ZKFC should restart NN when healthmonitor gets a SERVICE_NOT_RESPONDING exception

[jira] [Created] (HDFS-3213) JournalDaemon (server) should persist the cluster id and nsid in the storage directory

[jira] [Created] (HDFS-3205) testHANameNodesWithFederation is failing in trunk

[jira] [Created] (HDFS-3192) Active NN should exit when it has not received a getServiceStatus() rpc from ZKFC for timeout secs

[jira] [Created] (HDFS-3191) Active NN should exit when it cannot write to quorum number of Journal Daemons

[jira] [Created] (HDFS-3186) Sync lagging journal service from the active journal service

[jira] [Created] (HDFS-3185) Setup configuration for Journal Manager and Journal Services

[jira] [Created] (HDFS-3126) Journal stream from the namenode to backup needs to have a timeout

[jira] [Created] (HDFS-3009) DFSclient islocaladdress() can use similar routine in netutils

[jira] [Created] (HDFS-2955) IllegalStateException during standby startup in getCurSegmentTxId

[jira] [Resolved] (HDFS-2865) Standby namenode gets a "cannot lock storage" exception during startup

[jira] [Resolved] (HDFS-2866) Standby does not start up due to a gap in transaction id

[jira] [Created] (HDFS-2914) HA Standby stuck in safemode when shared edits directory is bounced

[jira] [Resolved] (HDFS-2903) NPE if shared edits directory is not available during failover

[jira] [Created] (HDFS-2903) NPE if shared edits directory is not available during failover

[jira] [Created] (HDFS-2866) Standby does not start up due to a gap in transaction id

[jira] [Created] (HDFS-2865) Standby namenode gets a "cannot lock storage" exception during startup

[jira] [Reopened] (HDFS-2802) Support for RW/RO snapshots in HDFS

[jira] [Created] (HDFS-2802) Support for RW/RO snapshots in HDFS

[jira] [Created] (HDFS-2783) HDFS jira for changes related to HADOOP-7970

[jira] [Created] (HDFS-2780) All HAServiceProtocol methods have to throw IOException since they are invoked over the network

[jira] [Created] (HDFS-2756) Warm standby does not read the in_progress edit log

[jira] [Created] (HDFS-2753) Standby namenode stuck in safenode during a failover

[jira] [Created] (HDFS-2650) Replace @inheritDoc with @Override

[jira] [Created] (HDFS-2564) Cleanup unnecessary exceptions thrown and unnecessary casts

33 matches

Site Navigation

Mail list logo

Footer information