[jira] [Updated] (HBASE-11096) stop method of Master and RegionServer coprocessor is not invoked

2014-05-12 Thread Qiang Tian (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Qiang Tian updated HBASE-11096:
---

Status: Open  (was: Patch Available)

> stop method of Master and RegionServer coprocessor  is not invoked
> --
>
> Key: HBASE-11096
> URL: https://issues.apache.org/jira/browse/HBASE-11096
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.94.19, 0.98.1, 0.96.2
>Reporter: Qiang Tian
>Assignee: Qiang Tian
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: HBASE-11096-trunk-v0.patch, HBASE-11096-trunk-v0.patch, 
> HBASE-11096-trunk-v1.patch, HBASE-11096-trunk-v2.patch
>
>
> the stop method of coprocessor specified by 
> "hbase.coprocessor.master.classes" and  
> "hbase.coprocessor.regionserver.classes"  is not invoked.
> If coprocessor allocates OS resources, it could cause master/regionserver 
> resource leak or hang during exit.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-9740) A corrupt HFile could cause endless attempts to assign the region without a chance of success

2014-05-12 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-9740:
-

   Resolution: Won't Fix
Fix Version/s: (was: 0.94.20)
   Status: Resolved  (was: Patch Available)

After pushing this around for a few releases... Do we need this in 0.94? It's 
not clear how anybody would find out about these regions. Currently it is clear 
that something is wrong.
We can resurrect if it is important to have this in 0.94.

> A corrupt HFile could cause endless attempts to assign the region without a 
> chance of success
> -
>
> Key: HBASE-9740
> URL: https://issues.apache.org/jira/browse/HBASE-9740
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.94.16
>Reporter: Aditya Kishore
>Assignee: Ping
> Attachments: HBase-9740_0.94_v4.patch, HBase-9749_0.94_v2.patch, 
> HBase-9749_0.94_v3.patch, patch-9740_0.94.txt
>
>
> As described in HBASE-9737, a corrupt HFile in a region could lead to an 
> assignment storm in the cluster since the Master will keep trying to assign 
> the region to each region server one after another and obviously none will 
> succeed.
> The region server, upon detecting such a scenario should mark the region as 
> "RS_ZK_REGION_FAILED_ERROR" (or something to the effect) in the Zookeeper 
> which should indicate the Master to stop assigning the region until the error 
> has been resolved (via an HBase shell command, probably "assign"?)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-10936) Add zeroByte encoding test

2014-05-12 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-10936:
--

Summary: Add zeroByte encoding test  (was: Port zeroByte encoding test from 
parent.)

> Add zeroByte encoding test
> --
>
> Key: HBASE-10936
> URL: https://issues.apache.org/jira/browse/HBASE-10936
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: Lars Hofhansl
>Priority: Minor
> Fix For: 0.96.3, 0.94.20
>
> Attachments: 10936-0.94.txt, 10936-0.96.txt, 10936-0.98.txt
>
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11143) ageOfLastShippedOp metric is confusing

2014-05-12 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-11143:
--

Fix Version/s: (was: 0.96.3)

> ageOfLastShippedOp metric is confusing
> --
>
> Key: HBASE-11143
> URL: https://issues.apache.org/jira/browse/HBASE-11143
> Project: HBase
>  Issue Type: Bug
>  Components: Replication
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.99.0, 0.94.20, 0.98.3
>
> Attachments: 11143-0.94-v2.txt, 11143-0.94.txt, 11143-trunk.txt
>
>
> We are trying to report on replication lag and find that there is no good 
> single metric to do that.
> ageOfLastShippedOp is close, but unfortunately it is increased even when 
> there is nothing to ship on a particular RegionServer.
> I would like discuss a few options here:
> Add a new metric: replicationQueueTime (or something) with the above meaning. 
> I.e. if we have something to ship we set the age of that last shipped edit, 
> if we fail we increment that last time (just like we do now). But if there is 
> nothing to replicate we set it to current time (and hence that metric is 
> reported to close to 0).
> Alternatively we could change the meaning of ageOfLastShippedOp to mean to do 
> that. That might lead to surprises, but the current behavior is clearly weird 
> when there is nothing to replicate.
> Comments? [~jdcryans], [~stack].
> If approach sounds good, I'll make a patch for all branches.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-10915) Decouple region closing (HM and HRS) from ZK

2014-05-12 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13993467#comment-13993467
 ] 

Hadoop QA commented on HBASE-10915:
---

{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12644074/HBASE-10915.patch
  against trunk revision .
  ATTACHMENT ID: 12644074

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9490//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9490//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9490//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9490//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9490//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9490//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9490//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9490//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9490//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9490//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9490//console

This message is automatically generated.

> Decouple region closing (HM and HRS) from ZK
> 
>
> Key: HBASE-10915
> URL: https://issues.apache.org/jira/browse/HBASE-10915
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Affects Versions: 0.99.0
>Reporter: Mikhail Antonov
>Assignee: Mikhail Antonov
> Attachments: HBASE-10915.patch, HBASE-10915.patch, HBASE-10915.patch, 
> HBASE-10915.patch, HBASE-10915.patch, HBASE-10915.patch, HBASE-10915.patch, 
> HBASE-10915.patch, HBASE-10915.patch, HBASE-10915.patch, HBASE-10915.patch, 
> HBASE-10915.patch, HBASE-10915.patch
>
>
> Decouple region closing from ZK. 
> Includes RS side (CloseRegionHandler), HM side (ClosedRegionHandler) and the 
> code using (HRegionServer, RSRpcServices etc).
> May need small changes in AssignmentManager.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-7912) HBase Backup/Restore Based on HBase Snapshot

2014-05-12 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-7912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13996086#comment-13996086
 ] 

stack commented on HBASE-7912:
--

Very nice.

This doc. with perhaps a little more commentary like it could go into the hbase 
refguide when this feature is committed?

https://issues.apache.org/jira/secure/attachment/12638283/HBase_BackupRestore-Jira-7912-CLI-v1.pdf

We should add a 'backups' chapter to hbase refguide when this comes in?

>From design doc:

bq. We execute roll log across region servers to track the WALs that need to be 
in the backup. 

Late question.  How we ensure the WAL is not moved aside or even deleted 
between time of snapshot and copy aside?

bq. We’ll convert/replay the backed-up Hlogs into HFiles for fast incremental 
restore.

This is interesting.  It is done against a cluster or it is just a MR job/tool?

Looks very nice.

Have you lads had much chance testing it out?

What needs to go in first?  What should we review first?

Thanks lads.




> HBase Backup/Restore Based on HBase Snapshot
> 
>
> Key: HBASE-7912
> URL: https://issues.apache.org/jira/browse/HBASE-7912
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Richard Ding
>Assignee: Richard Ding
> Attachments: HBaseBackupRestore-Jira-7912-DesignDoc-v1.pdf, 
> HBase_BackupRestore-Jira-7912-CLI-v1.pdf
>
>
> Finally, we completed the implementation of our backup/restore solution, and 
> would like to share with community through this jira. 
> We are leveraging existing hbase snapshot feature, and provide a general 
> solution to common users. Our full backup is using snapshot to capture 
> metadata locally and using exportsnapshot to move data to another cluster; 
> the incremental backup is using offline-WALplayer to backup HLogs; we also 
> leverage global distribution rolllog and flush to improve performance; other 
> added-on values such as convert, merge, progress report, and CLI commands. So 
> that a common user can backup hbase data without in-depth knowledge of hbase. 
>  Our solution also contains some usability features for enterprise users. 
> The detail design document and CLI command will be attached in this jira. We 
> plan to use 10~12 subtasks to share each of the following features, and 
> document the detail implement in the subtasks: 
> * *Full Backup* : provide local and remote back/restore for a list of tables
> * *offline-WALPlayer* to convert HLog to HFiles offline (for incremental 
> backup)
> * *distributed* Logroll and distributed flush 
> * Backup *Manifest* and history
> * *Incremental* backup: to build on top of full backup as daily/weekly backup 
> * *Convert*  incremental backup WAL files into hfiles
> * *Merge* several backup images into one(like merge weekly into monthly)
> * *add and remove* table to and from Backup image
> * *Cancel* a backup process
> * backup progress *status*
> * full backup based on *existing snapshot*
> *-*
> *Below is the original description, to keep here as the history for the 
> design and discussion back in 2013*
> There have been attempts in the past to come up with a viable HBase 
> backup/restore solution (e.g., HBASE-4618).  Recently, there are many 
> advancements and new features in HBase, for example, FileLink, Snapshot, and 
> Distributed Barrier Procedure. This is a proposal for a backup/restore 
> solution that utilizes these new features to achieve better performance and 
> consistency. 
>  
> A common practice of backup and restore in database is to first take full 
> baseline backup, and then periodically take incremental backup that capture 
> the changes since the full baseline backup. HBase cluster can store massive 
> amount data.  Combination of full backups with incremental backups has 
> tremendous benefit for HBase as well.  The following is a typical scenario 
> for full and incremental backup.
> # The user takes a full backup of a table or a set of tables in HBase. 
> # The user schedules periodical incremental backups to capture the changes 
> from the full backup, or from last incremental backup.
> # The user needs to restore table data to a past point of time.
> # The full backup is restored to the table(s) or to different table name(s).  
> Then the incremental backups that are up to the desired point in time are 
> applied on top of the full backup. 
> We would support the following key features and capabilities.
> * Full backup uses HBase snapshot to capture HFiles.
> * Use HBase WALs to capture incremental changes, but we use bulk load of 
> HFiles for fast incremental restore.
> * Support single table or a set of tables, and column family level backup and 
> restore.
> * Restore to different table 

[jira] [Commented] (HBASE-11140) LocalHBaseCluster should create ConsensusProvider per each server

2014-05-12 Thread Mikhail Antonov (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995748#comment-13995748
 ] 

Mikhail Antonov commented on HBASE-11140:
-

Thanks [~stack]

> LocalHBaseCluster should create ConsensusProvider per each server
> -
>
> Key: HBASE-11140
> URL: https://issues.apache.org/jira/browse/HBASE-11140
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus
>Affects Versions: 0.99.0
>Reporter: Mikhail Antonov
>Assignee: Mikhail Antonov
> Fix For: 0.99.0
>
> Attachments: HBASE-11140.patch
>
>
> Right now there's a bug there when single ConsensusProvider instance is 
> shared across all threads running region servers and masters within 
> processes, which breaks certain tests in patches which used to pass 
> successfully before.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11069) Decouple region merging from ZooKeeper

2014-05-12 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13996045#comment-13996045
 ] 

Hadoop QA commented on HBASE-11069:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12644525/HBASE_11069-v2.patch
  against trunk revision .
  ATTACHMENT ID: 12644525

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 lineLengths{color}.  The patch introduces the following lines 
longer than 100:
+  rmd = server != null && server.getConsensusProvider() != null ? 
((BaseConsensusProvider) server
+  public void prepareForMerge(Server server, RegionServerServices services, 
HRegionInfo mergedRegionInfo,
+  public void prepareForMerge(Server server, RegionServerServices services, 
HRegionInfo mergedRegionInfo,
+   * 
org.apache.hadoop.hbase.regionserver.consensus.RegionMergeConsensus#finishRegionMergeTransaction

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9510//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9510//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9510//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9510//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9510//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9510//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9510//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9510//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9510//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9510//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9510//console

This message is automatically generated.

> Decouple region merging from ZooKeeper
> --
>
> Key: HBASE-11069
> URL: https://issues.apache.org/jira/browse/HBASE-11069
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Reporter: Sergey Soldatov
> Attachments: HBASE-11069.patch, HBASE_11069-v2.patch
>
>
> Region Merge should be decoupled from ZK. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11151) move tracing modules from hbase-server to hbase-common

2014-05-12 Thread Nick Dimiduk (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13996034#comment-13996034
 ] 

Nick Dimiduk commented on HBASE-11151:
--

Yep, sounds about right. Thanks.

> move tracing modules from hbase-server to hbase-common
> --
>
> Key: HBASE-11151
> URL: https://issues.apache.org/jira/browse/HBASE-11151
> Project: HBase
>  Issue Type: Improvement
>  Components: documentation, util
>Reporter: Masatake Iwasaki
>Assignee: Masatake Iwasaki
>Priority: Minor
> Fix For: 0.99.0
>
> Attachments: HBASE-11151-0.patch
>
>
> Not only servers but also clients using tracing needs SpanReceiverHost.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Resolved] (HBASE-1990) Add methods accepting strings for family/qualifier in client

2014-05-12 Thread Andrew Purtell (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-1990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell resolved HBASE-1990.
---

Resolution: Won't Fix

Not wanted badly enough I'd say

> Add methods accepting strings for family/qualifier in client 
> -
>
> Key: HBASE-1990
> URL: https://issues.apache.org/jira/browse/HBASE-1990
> Project: HBase
>  Issue Type: Improvement
>  Components: Client
>Affects Versions: 0.20.0
>Reporter: Doug Meil
>Priority: Minor
> Attachments: TestHTableGenerics.java
>
>
> Consider the following client code...
>   byte b[] = result.getValue( Bytes.toBytes("family"), 
> Bytes.toBytes("qualifier") );
> put.add( Bytes.toBytes("family"), Bytes.toBytes("qualifer"), 
> Bytes.toBytes( "value")  );
> ... the requirement to supply family and qualifiers as bytes causes code to 
> get cluttered and verbose.  At worst, it scares peoples un-necessarily about 
> HBase development, and at best, developers inevitably will get tired of doing 
> all this casting and then add their own wrapper classes around the HBase 
> client to make their code more readable.
> I would like to see something like this in the API...
>   byte b[] = result.getValue( "family"), "qualifier" );
> put.add( "family", "qualifer", Bytes.toBytes( "value")  );
> ... where the Hbase client can perform the required Bytes.toBytes() 
> conversion behind the scenes.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-10504) Define Replication Interface

2014-05-12 Thread Jean-Daniel Cryans (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995685#comment-13995685
 ] 

Jean-Daniel Cryans commented on HBASE-10504:


[~enis] So IIUC you are going a completely different way than what was 
discussed before, right? I don't mind it but I want to make sure that we're on 
the same track.

So instead of having people define their sinks, they instead put that code in 
the ReplicationConsumer. Seems like a good idea, you don't have to mess around 
getting a fake cluster up and running, among other issues. Might still be nice 
to offer some tools like removeNonReplicableEdits(), the basic metrics the 
handling of a disabled peer, etc, for the other consumers.

> Define Replication Interface
> 
>
> Key: HBASE-10504
> URL: https://issues.apache.org/jira/browse/HBASE-10504
> Project: HBase
>  Issue Type: Task
>Reporter: stack
>Assignee: stack
>Priority: Blocker
> Fix For: 0.99.0
>
> Attachments: hbase-10504_wip1.patch
>
>
> HBase has replication.  Fellas have been hijacking the replication apis to do 
> all kinds of perverse stuff like indexing hbase content (hbase-indexer 
> https://github.com/NGDATA/hbase-indexer) and our [~toffer] just showed up w/ 
> overrides that replicate via an alternate channel (over a secure thrift 
> channel between dcs over on HBASE-9360).  This issue is about surfacing these 
> APIs as public with guarantees to downstreamers similar to those we have on 
> our public client-facing APIs (and so we don't break them for downstreamers).
> Any input [~phunt] or [~gabriel.reid] or [~toffer]?
> Thanks.
>  



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11135) Change region sequenceid generation so happens earlier in the append cycle rather than just before added to file

2014-05-12 Thread Jeffrey Zhong (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13996046#comment-13996046
 ] 

Jeffrey Zhong commented on HBASE-11135:
---

{quote}
I am not going to migrate your patch on top of mine. I don't know the 
unification project well enough
{quote}
No. Sorry for the confusion. I mean I'll migrate my patch on top of yours and 
run tests.

{quote}
Should I commit it?
{quote}
Sure. +1. Thanks.

> Change region sequenceid generation so happens earlier in the append cycle 
> rather than just before added to file
> 
>
> Key: HBASE-11135
> URL: https://issues.apache.org/jira/browse/HBASE-11135
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Reporter: stack
>Assignee: stack
> Attachments: 11135.wip.txt, 11135v2.txt, 11135v5.txt, 11135v5.txt, 
> 11135v5.txt, 11135v6.txt, 11135v7.txt
>
>
> Currently we assign the region edit/sequence id just before we put it in the 
> WAL.  We do it in the single thread that feeds from the ring buffer.  Doing 
> it at this point, we can ensure order, that the edits will be in the file in 
> accordance w/ the ordering of the region sequence id.
> But the point at which region sequence id is assigned an edit is deep down in 
> the WAL system and there is a lag between our putting an edit into the WAL 
> system and the edit actually getting its edit/sequence id.
> This lag -- "late-binding" -- complicates the unification of mvcc and region 
> sequence id, especially around async WAL writes (and, related, for no-WAL 
> writes) -- the parent for this issue (For async, how you get the edit id in 
> our system when the threads have all gone home -- unless you make them wait?)
> Chatting w/ Jeffrey Zhong yesterday, we came up with a crazypants means of 
> getting the region sequence id near-immediately.  We'll run two ringbuffers.  
> The first will mesh all handler threads and the consumer will generate ids 
> (we will have order on other side of this first ring buffer), and then if 
> async or no sync, we will just let the threads return ... updating mvcc just 
> before we let them go.  All other calls will go up on to the second ring 
> buffer to be serviced as now (batching, distribution out among the sync'ing 
> threads).  The first rb will have no friction and should turn at fast rates 
> compared to the second.  There should not be noticeable slowdown nor do I 
> foresee this refactor intefering w/ our multi-WAL plans.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11123) Upgrade instructions from 0.94 to 0.98

2014-05-12 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-11123:


Status: Patch Available  (was: Open)

> Upgrade instructions from 0.94 to 0.98
> --
>
> Key: HBASE-11123
> URL: https://issues.apache.org/jira/browse/HBASE-11123
> Project: HBase
>  Issue Type: Improvement
>  Components: documentation
>Affects Versions: 0.98.2
>Reporter: Misty Stanley-Jones
>Assignee: Misty Stanley-Jones
>Priority: Minor
> Attachments: HBASE-11123.patch
>
>
> I cloned this from the 0.96 upgrade docs task. It was suggested that we need 
> upgrade instructions from 0.94 to 0.98. I will need source material to even 
> prioritize this. Assuming this is Minor.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11123) Upgrade instructions from 0.94 to 0.98

2014-05-12 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-11123:


Attachment: HBASE-11123.patch

Added upgrade info from 0.94 to 0.98 and fixed the "TODO" item in the 0.98 
section.

> Upgrade instructions from 0.94 to 0.98
> --
>
> Key: HBASE-11123
> URL: https://issues.apache.org/jira/browse/HBASE-11123
> Project: HBase
>  Issue Type: Improvement
>  Components: documentation
>Affects Versions: 0.98.2
>Reporter: Misty Stanley-Jones
>Priority: Minor
> Attachments: HBASE-11123.patch
>
>
> I cloned this from the 0.96 upgrade docs task. It was suggested that we need 
> upgrade instructions from 0.94 to 0.98. I will need source material to even 
> prioritize this. Assuming this is Minor.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11119) Update ExportSnapShot to optionally not use a tmp file on external file system

2014-05-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995884#comment-13995884
 ] 

Hudson commented on HBASE-9:


SUCCESS: Integrated in hbase-0.96-hadoop2 #276 (See 
[https://builds.apache.org/job/hbase-0.96-hadoop2/276/])
HBASE-9 Update ExportSnapShot to optionally not use a tmp file on external 
file system (Ted Malaska) (mbertozzi: rev 1593339)
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java
* 
/hbase/branches/0.96/hbase-server/src/test/java/org/apache/hadoop/hbase/snapshot/TestExportSnapshot.java


> Update ExportSnapShot to optionally not use a tmp file on external file system
> --
>
> Key: HBASE-9
> URL: https://issues.apache.org/jira/browse/HBASE-9
> Project: HBase
>  Issue Type: Improvement
>  Components: snapshots
>Reporter: Ted Malaska
>Assignee: Ted Malaska
>Priority: Minor
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: HBASE-9.patch
>
>
> There are FileSystem like S3 where renaming is extremely expensive.  This 
> patch will add a parameter that says something like
> use.tmp.folder
> It will be defaulted to true.  So default behavior is the same.  If false is 
> set them the files will land in the final destination with no need for a 
> rename. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11128) Add -target option to ExportSnapshot to export with a different name

2014-05-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995979#comment-13995979
 ] 

Hudson commented on HBASE-11128:


SUCCESS: Integrated in HBase-0.94-JDK7 #134 (See 
[https://builds.apache.org/job/HBase-0.94-JDK7/134/])
HBASE-11128 Add -target option to ExportSnapshot to export with a different 
name (mbertozzi: rev 1593778)
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/snapshot/TestExportSnapshot.java


> Add -target option to ExportSnapshot to export with a different name
> 
>
> Key: HBASE-11128
> URL: https://issues.apache.org/jira/browse/HBASE-11128
> Project: HBase
>  Issue Type: Improvement
>  Components: snapshots
>Affects Versions: 0.99.0
>Reporter: Matteo Bertozzi
>Assignee: Matteo Bertozzi
>Priority: Trivial
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: HBASE-11128-v0.patch
>
>
> Add a "-target" option to export the snapshot using a different name



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11135) Change region sequenceid generation so happens earlier in the append cycle rather than just before added to file

2014-05-12 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-11135:
--

Attachment: 11135v7.txt

Fix findbugs and javadoc.  The failure is odd (says only one replica).  Can't 
repro down here.  Retry.

> Change region sequenceid generation so happens earlier in the append cycle 
> rather than just before added to file
> 
>
> Key: HBASE-11135
> URL: https://issues.apache.org/jira/browse/HBASE-11135
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Reporter: stack
>Assignee: stack
> Attachments: 11135.wip.txt, 11135v2.txt, 11135v5.txt, 11135v5.txt, 
> 11135v5.txt, 11135v6.txt, 11135v7.txt
>
>
> Currently we assign the region edit/sequence id just before we put it in the 
> WAL.  We do it in the single thread that feeds from the ring buffer.  Doing 
> it at this point, we can ensure order, that the edits will be in the file in 
> accordance w/ the ordering of the region sequence id.
> But the point at which region sequence id is assigned an edit is deep down in 
> the WAL system and there is a lag between our putting an edit into the WAL 
> system and the edit actually getting its edit/sequence id.
> This lag -- "late-binding" -- complicates the unification of mvcc and region 
> sequence id, especially around async WAL writes (and, related, for no-WAL 
> writes) -- the parent for this issue (For async, how you get the edit id in 
> our system when the threads have all gone home -- unless you make them wait?)
> Chatting w/ Jeffrey Zhong yesterday, we came up with a crazypants means of 
> getting the region sequence id near-immediately.  We'll run two ringbuffers.  
> The first will mesh all handler threads and the consumer will generate ids 
> (we will have order on other side of this first ring buffer), and then if 
> async or no sync, we will just let the threads return ... updating mvcc just 
> before we let them go.  All other calls will go up on to the second ring 
> buffer to be serviced as now (batching, distribution out among the sync'ing 
> threads).  The first rb will have no friction and should turn at fast rates 
> compared to the second.  There should not be noticeable slowdown nor do I 
> foresee this refactor intefering w/ our multi-WAL plans.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11069) Decouple region merging from ZooKeeper

2014-05-12 Thread Sergey Soldatov (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995872#comment-13995872
 ] 

Sergey Soldatov commented on HBASE-11069:
-

Could someone add me to the project contributors, so I will be assign it to me?

> Decouple region merging from ZooKeeper
> --
>
> Key: HBASE-11069
> URL: https://issues.apache.org/jira/browse/HBASE-11069
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Reporter: Sergey Soldatov
> Attachments: HBASE-11069.patch, HBASE_11069-v2.patch
>
>
> Region Merge should be decoupled from ZK. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-10985) Decouple Split Transaction from Zookeeper

2014-05-12 Thread Sergey Soldatov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sergey Soldatov updated HBASE-10985:


Status: Open  (was: Patch Available)

> Decouple Split Transaction from Zookeeper
> -
>
> Key: HBASE-10985
> URL: https://issues.apache.org/jira/browse/HBASE-10985
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Reporter: Sergey Soldatov
> Attachments: HBASE-10985.patch, HBASE-10985.patch, HBASE-10985.patch, 
> HBASE_10985-v2.patch, HBASE_10985-v3.patch, HBASE_10985-v4.patch
>
>
> As part of  HBASE-10296 SplitTransaction should be decoupled from Zookeeper. 
> This is an initial patch for review. At the moment the consensus provider  
> placed directly to SplitTransaction to minimize affected code. In the ideal 
> world it should be done in HServer.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11154) Document how to use Reverse Scan API

2014-05-12 Thread Misty Stanley-Jones (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995922#comment-13995922
 ] 

Misty Stanley-Jones commented on HBASE-11154:
-

I think the primary value-proposition for reverse scan might belong here, as 
"One time when you don't need to use secondary indexes anymore": 
http://hbase.apache.org/book.html#secondary.indexes

6.11.1.3 here is also a good place to have a note about the new functionality: 
http://hbase.apache.org/book.html#schema.casestudies

Maybe something here too: http://hbase.apache.org/book.html#reverse.timestamp

Or maybe I need a whole more abstract session about scanners.

> Document how to use Reverse Scan API
> 
>
> Key: HBASE-11154
> URL: https://issues.apache.org/jira/browse/HBASE-11154
> Project: HBase
>  Issue Type: Task
>  Components: documentation, Scanners
>Affects Versions: 0.98.2
>Reporter: Misty Stanley-Jones
>Assignee: Misty Stanley-Jones
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-10942) support parallel request cancellation for multi-get

2014-05-12 Thread Sergey Shelukhin (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995631#comment-13995631
 ] 

Sergey Shelukhin commented on HBASE-10942:
--

About config - I think it's important to have way to turn off any new feature 
in case there's a bug in it... otherwise making new build and redeploying a 
cluster can be very costly for whoever is maintaining it. 
Wrt confusion, I'll rename variables. What do you mean by having 
CompletionService i.e. what would we use it for

> support parallel request cancellation for multi-get
> ---
>
> Key: HBASE-10942
> URL: https://issues.apache.org/jira/browse/HBASE-10942
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Sergey Shelukhin
>Assignee: Sergey Shelukhin
> Fix For: hbase-10070
>
> Attachments: HBASE-10942.01.patch, HBASE-10942.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-10909) Abstract out ZooKeeper usage in HBase

2014-05-12 Thread Mikhail Antonov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mikhail Antonov updated HBASE-10909:


Attachment: HBaseConsensus.pdf

> Abstract out ZooKeeper usage in HBase
> -
>
> Key: HBASE-10909
> URL: https://issues.apache.org/jira/browse/HBASE-10909
> Project: HBase
>  Issue Type: Umbrella
>  Components: Consensus, Zookeeper
>Reporter: Mikhail Antonov
>Assignee: Mikhail Antonov
> Attachments: HBaseConsensus.pdf, HBaseConsensus.pdf, 
> HBaseConsensus.pdf
>
>
> As some sort of follow-up or initial step towards HBASE-10296.
> Whatever consensus algorithm/library may be the chosen, perhaps one of first 
> practical steps towards this goal would be to better abstract ZK-related API 
> and details, which are now throughout the codebase (mostly leaked throuth 
> ZkUtil, ZooKeeperWatcher and listeners).
> This jira is umbrella for relevant subtasks.
> As the design doc is in process of peer-review now, please use Google Doc 
> (linked below) instead of pdf.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11154) Document how to use Reverse Scan API

2014-05-12 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-11154:


Fix Version/s: 0.98.3
   Status: Patch Available  (was: Open)

> Document how to use Reverse Scan API
> 
>
> Key: HBASE-11154
> URL: https://issues.apache.org/jira/browse/HBASE-11154
> Project: HBase
>  Issue Type: Task
>  Components: documentation, Scanners
>Affects Versions: 0.98.2
>Reporter: Misty Stanley-Jones
>Assignee: Misty Stanley-Jones
> Fix For: 0.98.3
>
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11148) Provide a distributed procedure to globally roll logs

2014-05-12 Thread Jerry He (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995877#comment-13995877
 ] 

Jerry He commented on HBASE-11148:
--

[~stack]
I will provide a patch shortly.

> Provide a distributed procedure to globally roll logs
> -
>
> Key: HBASE-11148
> URL: https://issues.apache.org/jira/browse/HBASE-11148
> Project: HBase
>  Issue Type: New Feature
>Reporter: Jerry He
> Fix For: 0.99.0
>
>
> Propose a distributed procedure here to globally roll logs.
> Currently HBaseAdmin and HBase shell provides a way to roll the WAL on a 
> single RS.
> Some use cases may require that all the RS roll the logs at the same time and 
> in a coordinated way. Also there may be requirement that some tasks be done 
> together with the roll log on each region server.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11144) Filter to support scan multiple row key ranges

2014-05-12 Thread Li Jiajia (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Li Jiajia updated HBASE-11144:
--

Attachment: MultiRowRangeFilter2.patch

MultiRowRangeFilter2.patch is generated for trunk.

> Filter to support scan multiple row key ranges
> --
>
> Key: HBASE-11144
> URL: https://issues.apache.org/jira/browse/HBASE-11144
> Project: HBase
>  Issue Type: Improvement
>  Components: Filters
>Reporter: Li Jiajia
> Attachments: MultiRowRangeFilter.patch, MultiRowRangeFilter2.patch
>
>
> Provide a filter feature to support scan multiple row key ranges. It can 
> construct the row key ranges from the passed list which can be accessed by 
> each region server. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11144) Filter to support scan multiple row key ranges

2014-05-12 Thread Li Jiajia (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Li Jiajia updated HBASE-11144:
--

Status: Open  (was: Patch Available)

> Filter to support scan multiple row key ranges
> --
>
> Key: HBASE-11144
> URL: https://issues.apache.org/jira/browse/HBASE-11144
> Project: HBase
>  Issue Type: Improvement
>  Components: Filters
>Reporter: Li Jiajia
> Attachments: MultiRowRangeFilter.patch
>
>
> Provide a filter feature to support scan multiple row key ranges. It can 
> construct the row key ranges from the passed list which can be accessed by 
> each region server. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11069) Decouple region merging from ZooKeeper

2014-05-12 Thread Sergey Soldatov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sergey Soldatov updated HBASE-11069:


Status: Patch Available  (was: Open)

> Decouple region merging from ZooKeeper
> --
>
> Key: HBASE-11069
> URL: https://issues.apache.org/jira/browse/HBASE-11069
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Reporter: Sergey Soldatov
> Attachments: HBASE-11069.patch, HBASE_11069-v2.patch
>
>
> Region Merge should be decoupled from ZK. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-6626) Add a chapter on HDFS in the troubleshooting section of the HBase reference guide.

2014-05-12 Thread Misty Stanley-Jones (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995674#comment-13995674
 ] 

Misty Stanley-Jones commented on HBASE-6626:


Stack, yes please. Unless you agree with Nicholas.

> Add a chapter on HDFS in the troubleshooting section of the HBase reference 
> guide.
> --
>
> Key: HBASE-6626
> URL: https://issues.apache.org/jira/browse/HBASE-6626
> Project: HBase
>  Issue Type: Improvement
>  Components: documentation
>Affects Versions: 0.95.2
>Reporter: Nicolas Liochon
>Assignee: Doug Meil
>Priority: Blocker
> Attachments: troubleshooting.txt
>
>
> I looked mainly at the major failure case, but here is what I have:
> New sub chapter in the existing chapter "Troubleshooting and Debugging 
> HBase": "HDFS & HBASE"
> 1) HDFS & HBase
> 2) Connection related settings
> 2.1) Number of retries
> 2.2) Timeouts
> 3) Log samples
> 1) HDFS & HBase
> HBase uses HDFS to store its HFile, i.e. the core HBase files and the 
> Write-Ahead-Logs, i.e. the files that will be used to restore the data after 
> a crash.
> In both cases, the reliability of HBase comes from the fact that HDFS writes 
> the data to multiple locations. To be efficient, HBase needs the data to be 
> available locally, hence it's highly recommended to have the HDFS datanode on 
> the same machines as the HBase Region Servers.
> Detailed information on how HDFS works can be found at [1].
> Important features are:
>  - HBase is a client application of HDFS, i.e. uses the HDFS DFSClient class. 
> This class can appears in HBase logs with other HDFS client related logs.
>  - Some HDFS settings are HDFS-server-side, i.e. must be set on the HDFS 
> side, while some other are HDFS-client-side, i.e. must be set in HBase, while 
> some other must be set in both places.
>  - the HDFS writes are pipelined from one datanode to another. When writing, 
> there are communications between:
> - HBase and HDFS namenode, through the HDFS client classes.
> - HBase and HDFS datanodes, through the HDFS client classes.
> - HDFS datanode between themselves: issues on these communications are in 
> HDFS logs, not HBase. HDFS writes are always local when possible. As a 
> consequence, there should not be much write error in HBase Region Servers: 
> they write to the local datanode. If this datanode can't replicate the 
> blocks, it will appear in its logs, not in the region servers logs.
>  - datanodes can be contacted through the ipc.Client interface (once again 
> this class can shows up in HBase logs) and the data transfer interface 
> (usually shows up as the DataNode class in the HBase logs). There are on 
> different ports (defaults being: 50010 and 50020).
>  - To understand exactly what's going on, you must look that the HDFS log 
> files as well: HBase logs represent the client side.
>  - With the default setting, HDFS needs 630s to mark a datanode as dead. For 
> this reason, this node will still be tried by HBase or by other datanodes 
> when writing and reading until HDFS definitively decides it's dead. This will 
> add some extras lines in the logs. This monitoring is performed by the 
> NameNode.
>  - The HDFS clients (i.e. HBase using HDFS client code) don't fully rely on 
> the NameNode, but can mark temporally a node as dead if they had an error 
> when they tried to use it.
> 2) Settings for retries and timeouts
> 2.1) Retries
> ipc.client.connect.max.retries
> Default 10
> Indicates the number of retries a client will make to establish a server 
> connection. Not taken into account if the error is a SocketTimeout. In this 
> case the number of retries is 45 (fixed on branch, HADOOP-7932 or in 
> HADOOP-7397). For SASL, the number of retries is hard-coded to 15. Can be 
> increased, especially if the socket timeouts have been lowered.
> ipc.client.connect.max.retries.on.timeouts
> Default 45
> If you have HADOOP-7932, max number of retries on timeout. Counter is 
> different than ipc.client.connect.max.retries so if you mix the socket errors 
> you will get 55 retries with the default values. Could be lowered, once it is 
> available. With HADOOP-7397 ipc.client.connect.max.retries is reused so there 
> would be 10 tries.
> dfs.client.block.write.retries
> Default 3
> Number of tries for the client when writing a block. After a failure, will 
> connect to the namenode a get a new location, sending the list of the 
> datanodes already tried without success. Could be increased, especially if 
> the socket timeouts have been lowered. See HBASE-6490.
> dfs.client.block.write.locateFollowingBlock.retries
> Default 5
> Number of retries to the namenode when the client got 
> NotReplicatedYetException, i.e. the existing nodes of the 

[jira] [Updated] (HBASE-11154) Document how to use Reverse Scan API

2014-05-12 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-11154:


Attachment: HBASE-11154.patch

Fixed how I generated the patch

> Document how to use Reverse Scan API
> 
>
> Key: HBASE-11154
> URL: https://issues.apache.org/jira/browse/HBASE-11154
> Project: HBase
>  Issue Type: Task
>  Components: documentation, Scanners
>Affects Versions: 0.98.2
>Reporter: Misty Stanley-Jones
>Assignee: Misty Stanley-Jones
> Attachments: HBASE-11154.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11137) Add mapred.TableSnapshotInputFormat

2014-05-12 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995983#comment-13995983
 ] 

Hadoop QA commented on HBASE-11137:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12644518/HBASE-11137.01.patch
  against trunk revision .
  ATTACHMENT ID: 12644518

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 10 new 
or modified tests.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

 {color:red}-1 core tests{color}.  The patch failed these unit tests:
   org.apache.hadoop.hbase.snapshot.TestExportSnapshot

 {color:red}-1 core zombie tests{color}.  There are 1 zombie test(s):   
at 
org.apache.hadoop.hbase.mapreduce.TestTableMapReduceBase.testMultiRegionTable(TestTableMapReduceBase.java:96)

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9508//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9508//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9508//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9508//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9508//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9508//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9508//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9508//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9508//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9508//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9508//console

This message is automatically generated.

> Add mapred.TableSnapshotInputFormat
> ---
>
> Key: HBASE-11137
> URL: https://issues.apache.org/jira/browse/HBASE-11137
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce, Performance
>Affects Versions: 0.98.0, 0.96.2
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
> Attachments: HBASE-11137.00.patch, HBASE-11137.01.patch
>
>
> We should have feature parity between mapreduce and mapred implementations. 
> This is important for Hive.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11136) Add permission check to roll WAL writer

2014-05-12 Thread Jerry He (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995333#comment-13995333
 ] 

Jerry He commented on HBASE-11136:
--

There is no use for postRollWALWriter() now.
Is it customary to have both pre and post in there?
{code}
[INFO] 
[INFO] Reactor Summary:
[INFO]
[INFO] HBase . SUCCESS [  1.712 s]
[INFO] HBase - Common  SUCCESS [ 25.713 s]
[INFO] HBase - Protocol .. SUCCESS [  0.265 s]
[INFO] HBase - Client  SUCCESS [ 36.078 s]
[INFO] HBase - Hadoop Compatibility .. SUCCESS [  6.542 s]
[INFO] HBase - Hadoop Two Compatibility .. SUCCESS [  1.359 s]
[INFO] HBase - Prefix Tree ... SUCCESS [  3.470 s]
[INFO] HBase - Server  SUCCESS [43:07 min]
[INFO] HBase - Testing Util .. SUCCESS [  1.276 s]
[INFO] HBase - Thrift  SUCCESS [01:45 min]
[INFO] HBase - Shell . SUCCESS [  1.054 s]
[INFO] HBase - Integration Tests . SUCCESS [  1.267 s]
[INFO] HBase - Examples .. SUCCESS [  1.064 s]
[INFO] HBase - Assembly .. SUCCESS [  0.916 s]
[INFO] 
[INFO] BUILD SUCCESS
{code}

> Add permission check to roll WAL writer 
> 
>
> Key: HBASE-11136
> URL: https://issues.apache.org/jira/browse/HBASE-11136
> Project: HBase
>  Issue Type: Improvement
>  Components: regionserver, security
>Affects Versions: 0.96.2, 0.98.2
>Reporter: Jerry He
>Assignee: Jerry He
>Priority: Minor
> Fix For: 0.99.0
>
> Attachments: HBASE-11136-trunk-v1.patch
>
>
> Currently HBase provides HBaseAdmin.rollHLogWriter() and shell command to 
> roll WAL on a region server. But no permission check is done on this 
> operation in a secure cluster.
> We need to add permission check to prevent un-authorized user from running 
> this operation. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11154) Document how to use Reverse Scan API

2014-05-12 Thread Misty Stanley-Jones (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995937#comment-13995937
 ] 

Misty Stanley-Jones commented on HBASE-11154:
-

I'm not sure if I actually need more than this. Ideally, the Scan API would 
have a working example. I can't figure out where in the Ref Guide would be the 
appropriate place for a working example of reverse scanning, since there 
doesn't even seem to be a working example of scanning.

> Document how to use Reverse Scan API
> 
>
> Key: HBASE-11154
> URL: https://issues.apache.org/jira/browse/HBASE-11154
> Project: HBase
>  Issue Type: Task
>  Components: documentation, Scanners
>Affects Versions: 0.98.2
>Reporter: Misty Stanley-Jones
>Assignee: Misty Stanley-Jones
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11153) http webUI's should redirect to https when enabled

2014-05-12 Thread Nick Dimiduk (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-11153:
-

Labels: noob  (was: )

> http webUI's should redirect to https when enabled
> --
>
> Key: HBASE-11153
> URL: https://issues.apache.org/jira/browse/HBASE-11153
> Project: HBase
>  Issue Type: Bug
>  Components: master, regionserver
>Affects Versions: 0.98.0
>Reporter: Nick Dimiduk
>Priority: Minor
>  Labels: noob
>
> When configured to listen on https, we should redirect non-secure requests to 
> the appropriate port/protocol. Currently we respond with a 200 and no data, 
> which is perplexing.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11128) Add -target option to ExportSnapshot to export with a different name

2014-05-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995640#comment-13995640
 ] 

Hudson commented on HBASE-11128:


FAILURE: Integrated in HBase-0.98-on-Hadoop-1.1 #289 (See 
[https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/289/])
HBASE-11128 Add -target option to ExportSnapshot to export with a different 
name (mbertozzi: rev 1593776)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/snapshot/TestExportSnapshot.java


> Add -target option to ExportSnapshot to export with a different name
> 
>
> Key: HBASE-11128
> URL: https://issues.apache.org/jira/browse/HBASE-11128
> Project: HBase
>  Issue Type: Improvement
>  Components: snapshots
>Affects Versions: 0.99.0
>Reporter: Matteo Bertozzi
>Assignee: Matteo Bertozzi
>Priority: Trivial
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: HBASE-11128-v0.patch
>
>
> Add a "-target" option to export the snapshot using a different name



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11123) Upgrade instructions from 0.94 to 0.98

2014-05-12 Thread Jean-Marc Spaggiari (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995506#comment-13995506
 ] 

Jean-Marc Spaggiari commented on HBASE-11123:
-

I confirm this statement. I test 0.94 => 0.98 migration as part of my release 
testing and it works well.

> Upgrade instructions from 0.94 to 0.98
> --
>
> Key: HBASE-11123
> URL: https://issues.apache.org/jira/browse/HBASE-11123
> Project: HBase
>  Issue Type: Improvement
>  Components: documentation
>Affects Versions: 0.98.2
>Reporter: Misty Stanley-Jones
>Priority: Minor
>
> I cloned this from the 0.96 upgrade docs task. It was suggested that we need 
> upgrade instructions from 0.94 to 0.98. I will need source material to even 
> prioritize this. Assuming this is Minor.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11137) Add mapred.TableSnapshotInputFormat

2014-05-12 Thread Enis Soztutar (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995883#comment-13995883
 ] 

Enis Soztutar commented on HBASE-11137:
---

bq.  HBASE-7987 looks like the major culprit
Not sure backporting HBASE-7987 to 0.98 is a good idea. It is changing the 
snapshot file formats and layouts. 

> Add mapred.TableSnapshotInputFormat
> ---
>
> Key: HBASE-11137
> URL: https://issues.apache.org/jira/browse/HBASE-11137
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce, Performance
>Affects Versions: 0.98.0, 0.96.2
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
> Attachments: HBASE-11137.00.patch, HBASE-11137.01.patch
>
>
> We should have feature parity between mapreduce and mapred implementations. 
> This is important for Hive.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11151) move tracing modules from hbase-server to hbase-common

2014-05-12 Thread Masatake Iwasaki (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995958#comment-13995958
 ] 

Masatake Iwasaki commented on HBASE-11151:
--

Thanks for your comment [~ndimiduk]. PerfEval seems worked with this patch dut 
to intra-project dependencies. I think TestFromClientSide should be left in 
hbase-server because it refers to modules in hbase-server and hbase-client. 
hbase-server depends on hbase-common (and hbase-client) but not vice versa.


> move tracing modules from hbase-server to hbase-common
> --
>
> Key: HBASE-11151
> URL: https://issues.apache.org/jira/browse/HBASE-11151
> Project: HBase
>  Issue Type: Improvement
>  Components: documentation, util
>Reporter: Masatake Iwasaki
>Assignee: Masatake Iwasaki
>Priority: Minor
> Fix For: 0.99.0
>
> Attachments: HBASE-11151-0.patch
>
>
> Not only servers but also clients using tracing needs SpanReceiverHost.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11136) Add permission check to roll WAL writer

2014-05-12 Thread Jerry He (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jerry He updated HBASE-11136:
-

Fix Version/s: 0.99.0
   Status: Patch Available  (was: Open)

> Add permission check to roll WAL writer 
> 
>
> Key: HBASE-11136
> URL: https://issues.apache.org/jira/browse/HBASE-11136
> Project: HBase
>  Issue Type: Improvement
>  Components: regionserver, security
>Affects Versions: 0.98.2, 0.96.2
>Reporter: Jerry He
>Assignee: Jerry He
>Priority: Minor
> Fix For: 0.99.0
>
> Attachments: HBASE-11136-trunk-v1.patch
>
>
> Currently HBase provides HBaseAdmin.rollHLogWriter() and shell command to 
> roll WAL on a region server. But no permission check is done on this 
> operation in a secure cluster.
> We need to add permission check to prevent un-authorized user from running 
> this operation. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-10909) Abstract out ZooKeeper usage in HBase

2014-05-12 Thread Mikhail Antonov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mikhail Antonov updated HBASE-10909:


Attachment: (was: HBaseConsensus.pdf)

> Abstract out ZooKeeper usage in HBase
> -
>
> Key: HBASE-10909
> URL: https://issues.apache.org/jira/browse/HBASE-10909
> Project: HBase
>  Issue Type: Umbrella
>  Components: Consensus, Zookeeper
>Reporter: Mikhail Antonov
>Assignee: Mikhail Antonov
> Attachments: HBase Consensus.pdf, HBaseConsensus.pdf, 
> HBaseConsensus.pdf
>
>
> As some sort of follow-up or initial step towards HBASE-10296.
> Whatever consensus algorithm/library may be the chosen, perhaps one of first 
> practical steps towards this goal would be to better abstract ZK-related API 
> and details, which are now throughout the codebase (mostly leaked throuth 
> ZkUtil, ZooKeeperWatcher and listeners).
> This jira is umbrella for relevant subtasks.
> As the design doc is in process of peer-review now, please use Google Doc 
> (linked below) instead of pdf.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-10909) Abstract out ZooKeeper usage in HBase

2014-05-12 Thread Mikhail Antonov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mikhail Antonov updated HBASE-10909:


Attachment: HBase Consensus.pdf

Updated PDF version. design doc after numerous questions have been asked and 
discussed in the google doc. leaving ref to google doc in place..

> Abstract out ZooKeeper usage in HBase
> -
>
> Key: HBASE-10909
> URL: https://issues.apache.org/jira/browse/HBASE-10909
> Project: HBase
>  Issue Type: Umbrella
>  Components: Consensus, Zookeeper
>Reporter: Mikhail Antonov
>Assignee: Mikhail Antonov
> Attachments: HBase Consensus.pdf, HBaseConsensus.pdf, 
> HBaseConsensus.pdf
>
>
> As some sort of follow-up or initial step towards HBASE-10296.
> Whatever consensus algorithm/library may be the chosen, perhaps one of first 
> practical steps towards this goal would be to better abstract ZK-related API 
> and details, which are now throughout the codebase (mostly leaked throuth 
> ZkUtil, ZooKeeperWatcher and listeners).
> This jira is umbrella for relevant subtasks.
> As the design doc is in process of peer-review now, please use Google Doc 
> (linked below) instead of pdf.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11069) Decouple region merging from ZooKeeper

2014-05-12 Thread Sergey Soldatov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sergey Soldatov updated HBASE-11069:


Status: Open  (was: Patch Available)

> Decouple region merging from ZooKeeper
> --
>
> Key: HBASE-11069
> URL: https://issues.apache.org/jira/browse/HBASE-11069
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Reporter: Sergey Soldatov
> Attachments: HBASE-11069.patch
>
>
> Region Merge should be decoupled from ZK. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Comment Edited] (HBASE-11135) Change region sequenceid generation so happens earlier in the append cycle rather than just before added to file

2014-05-12 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13994282#comment-13994282
 ] 

stack edited comment on HBASE-11135 at 5/10/14 7:03 PM:


[~jeffreyz] has been reviewing offline.  Comments that if CP is slow doing 
pre/post append, then as is, we will slow throughput.  But as he notes, this is 
as it is in 0.98 and not really a way around it since cp, and wal listeners are 
inline w/ the actual append.

In this patch over most of the appendNoSyncs to use new method (so 
edit/sequence id is availble on return out of append).  Changed the append to 
use a latch.  This gained me back a few percent... maybe 5%... but this patch 
makes us slower appending.  When 200 threads contending, about half slower 
again (180seconds vs 280 seconds to complete test):

{code}
$ for i in 1 3 5 10 50 200; do for j in 1 2 3; do perf stat 
./hbase-0.99.0-SNAPSHOT/bin/hbase --config /home/stack/conf_hbase 
org.apache.hadoop.hbase.regionserver.wal.HLogPerformanceEvaluation -threads $i  
 -iterations 10 -keySize 50 -valueSize 100  &> 
"/tmp/wpatch_latch_barrier.${i}.${j}.txt"; done; done
{code}

I tried then also using a cyclicbarrier in the SyncFuture for the handler 
thread and the the sync'ing thread to coordinate around but no noticeable 
difference (perhaps a little slower).

I'd be good committing this for now since it simplifies the unification of mvcc 
and sequence id.  Even when we are half slower again we are still about 0.98 
throughput in trunk since the claim in HBASE-10156 is that it doubled our 
throughput at 200 threads.

After the unification happens, we can come back here again to do see if we can 
get perf back.


was (Author: stack):
[~jeffreyz] has been reviewing offline.  Comments that if CP is slow doing 
pre/post append, then as is, we will slow throughput.  But as he notes, this is 
as it is in 0.98 and not really a way around it since cp, and wal listeners are 
inline w/ the actual append.

In this patch over most of the appendNoSyncs to use new method (so 
edit/sequence id is availble on return out of append).  Changed the append to 
use a latch.  This gained me back a few percent... maybe 5%... but this patch 
makes us slower appending.  When 200 threads contending, about 1/3rd slower 
again (180seconds vs 280 seconds to complete test):

{code}
$ for i in 1 3 5 10 50 200; do for j in 1 2 3; do perf stat 
./hbase-0.99.0-SNAPSHOT/bin/hbase --config /home/stack/conf_hbase 
org.apache.hadoop.hbase.regionserver.wal.HLogPerformanceEvaluation -threads $i  
 -iterations 10 -keySize 50 -valueSize 100  &> 
"/tmp/wpatch_latch_barrier.${i}.${j}.txt"; done; done
{code}

I tried then also using a cyclicbarrier in the SyncFuture for the handler 
thread and the the sync'ing thread to coordinate around but no noticeable 
difference (perhaps a little slower).

I'd be good committing this for now since it simplifies the unification of mvcc 
and sequence id.  Even when we are 1/3rd slower we are still ahead of 0.98 in 
trunk since the claim in HBASE-10156 is that it doubled our throughput at 200 
threads. 

After the unification happens, we can come back here again to do see if we can 
get perf back.

> Change region sequenceid generation so happens earlier in the append cycle 
> rather than just before added to file
> 
>
> Key: HBASE-11135
> URL: https://issues.apache.org/jira/browse/HBASE-11135
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Reporter: stack
>Assignee: stack
> Attachments: 11135.wip.txt, 11135v2.txt, 11135v5.txt
>
>
> Currently we assign the region edit/sequence id just before we put it in the 
> WAL.  We do it in the single thread that feeds from the ring buffer.  Doing 
> it at this point, we can ensure order, that the edits will be in the file in 
> accordance w/ the ordering of the region sequence id.
> But the point at which region sequence id is assigned an edit is deep down in 
> the WAL system and there is a lag between our putting an edit into the WAL 
> system and the edit actually getting its edit/sequence id.
> This lag -- "late-binding" -- complicates the unification of mvcc and region 
> sequence id, especially around async WAL writes (and, related, for no-WAL 
> writes) -- the parent for this issue (For async, how you get the edit id in 
> our system when the threads have all gone home -- unless you make them wait?)
> Chatting w/ Jeffrey Zhong yesterday, we came up with a crazypants means of 
> getting the region sequence id near-immediately.  We'll run two ringbuffers.  
> The first will mesh all handler threads and the consumer will generate ids 
> (we will have order on other side of this first ring buffer), and then i

[jira] [Updated] (HBASE-11154) Document how to use Reverse Scan API

2014-05-12 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-11154:


Status: Patch Available  (was: Open)

Patch attached, which adds some notes to the chapter about schema design 
pointing out the new reverse scan API

> Document how to use Reverse Scan API
> 
>
> Key: HBASE-11154
> URL: https://issues.apache.org/jira/browse/HBASE-11154
> Project: HBase
>  Issue Type: Task
>  Components: documentation, Scanners
>Affects Versions: 0.98.2
>Reporter: Misty Stanley-Jones
>Assignee: Misty Stanley-Jones
> Attachments: HBASE-11154.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11154) Document how to use Reverse Scan API

2014-05-12 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-11154:


Fix Version/s: (was: 0.98.3)
   Status: Open  (was: Patch Available)

> Document how to use Reverse Scan API
> 
>
> Key: HBASE-11154
> URL: https://issues.apache.org/jira/browse/HBASE-11154
> Project: HBase
>  Issue Type: Task
>  Components: documentation, Scanners
>Affects Versions: 0.98.2
>Reporter: Misty Stanley-Jones
>Assignee: Misty Stanley-Jones
> Attachments: HBASE-11154.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11154) Document how to use Reverse Scan API

2014-05-12 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-11154:


Attachment: HBASE-11154.patch

> Document how to use Reverse Scan API
> 
>
> Key: HBASE-11154
> URL: https://issues.apache.org/jira/browse/HBASE-11154
> Project: HBase
>  Issue Type: Task
>  Components: documentation, Scanners
>Affects Versions: 0.98.2
>Reporter: Misty Stanley-Jones
>Assignee: Misty Stanley-Jones
> Fix For: 0.98.3
>
> Attachments: HBASE-11154.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11154) Document how to use Reverse Scan API

2014-05-12 Thread Misty Stanley-Jones (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995929#comment-13995929
 ] 

Misty Stanley-Jones commented on HBASE-11154:
-

Added the following note in the first two places above:


  Reverse Scan API
  
https://issues.apache.org/jira/browse/HBASE-4811";>HBASE-4811 
implements an API to scan a table or a range within a table in reverse, 
reducing the need to optimize your schema for forward or reverse scanning. This 
feature is available in HBase 0.98 and later. See https://hbase.apache.org/apidocs/org/apache/hadoop/hbase/client/Scan.html#setReversed%28boolean";
 /> for more information.
  


> Document how to use Reverse Scan API
> 
>
> Key: HBASE-11154
> URL: https://issues.apache.org/jira/browse/HBASE-11154
> Project: HBase
>  Issue Type: Task
>  Components: documentation, Scanners
>Affects Versions: 0.98.2
>Reporter: Misty Stanley-Jones
>Assignee: Misty Stanley-Jones
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-10909) Abstract out ZooKeeper usage in HBase

2014-05-12 Thread Mikhail Antonov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mikhail Antonov updated HBASE-10909:


Affects Version/s: 0.99.0

> Abstract out ZooKeeper usage in HBase
> -
>
> Key: HBASE-10909
> URL: https://issues.apache.org/jira/browse/HBASE-10909
> Project: HBase
>  Issue Type: Umbrella
>  Components: Consensus, Zookeeper
>Affects Versions: 0.99.0
>Reporter: Mikhail Antonov
>Assignee: Mikhail Antonov
> Attachments: HBaseConsensus.pdf, HBaseConsensus.pdf, 
> HBaseConsensus.pdf
>
>
> As some sort of follow-up or initial step towards HBASE-10296.
> Whatever consensus algorithm/library may be the chosen, perhaps one of first 
> practical steps towards this goal would be to better abstract ZK-related API 
> and details, which are now throughout the codebase (mostly leaked throuth 
> ZkUtil, ZooKeeperWatcher and listeners).
> This jira is umbrella for relevant subtasks. Design doc is attached, for 
> comments/questions there's a google doc linked.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11136) Add permission check to roll WAL writer

2014-05-12 Thread Jerry He (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995578#comment-13995578
 ] 

Jerry He commented on HBASE-11136:
--

Hi, Ted

Good question. I am not very clear regarding Permission.Action.CREATE with a 
global permission scope
Some examples of only Permission.Action.ADMIN is balance, snapshot, stop rs.

> Add permission check to roll WAL writer 
> 
>
> Key: HBASE-11136
> URL: https://issues.apache.org/jira/browse/HBASE-11136
> Project: HBase
>  Issue Type: Improvement
>  Components: regionserver, security
>Affects Versions: 0.96.2, 0.98.2
>Reporter: Jerry He
>Assignee: Jerry He
>Priority: Minor
> Fix For: 0.99.0
>
> Attachments: HBASE-11136-trunk-v1.patch
>
>
> Currently HBase provides HBaseAdmin.rollHLogWriter() and shell command to 
> roll WAL on a region server. But no permission check is done on this 
> operation in a secure cluster.
> We need to add permission check to prevent un-authorized user from running 
> this operation. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11128) Add -target option to ExportSnapshot to export with a different name

2014-05-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995734#comment-13995734
 ] 

Hudson commented on HBASE-11128:


SUCCESS: Integrated in HBase-0.98 #304 (See 
[https://builds.apache.org/job/HBase-0.98/304/])
HBASE-11128 Add -target option to ExportSnapshot to export with a different 
name (mbertozzi: rev 1593776)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/snapshot/TestExportSnapshot.java


> Add -target option to ExportSnapshot to export with a different name
> 
>
> Key: HBASE-11128
> URL: https://issues.apache.org/jira/browse/HBASE-11128
> Project: HBase
>  Issue Type: Improvement
>  Components: snapshots
>Affects Versions: 0.99.0
>Reporter: Matteo Bertozzi
>Assignee: Matteo Bertozzi
>Priority: Trivial
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: HBASE-11128-v0.patch
>
>
> Add a "-target" option to export the snapshot using a different name



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Created] (HBASE-11154) Document how to use Reverse Scan API

2014-05-12 Thread Misty Stanley-Jones (JIRA)
Misty Stanley-Jones created HBASE-11154:
---

 Summary: Document how to use Reverse Scan API
 Key: HBASE-11154
 URL: https://issues.apache.org/jira/browse/HBASE-11154
 Project: HBase
  Issue Type: Task
  Components: documentation, Scanners
Affects Versions: 0.98.2
Reporter: Misty Stanley-Jones
Assignee: Misty Stanley-Jones






--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11119) Update ExportSnapShot to optionally not use a tmp file on external file system

2014-05-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13993393#comment-13993393
 ] 

Hudson commented on HBASE-9:


SUCCESS: Integrated in HBase-0.94-JDK7 #133 (See 
[https://builds.apache.org/job/HBase-0.94-JDK7/133/])
HBASE-9 Update ExportSnapShot to optionally not use a tmp file on external 
file system (Ted Malaska) (mbertozzi: rev 1593340)
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/snapshot/TestExportSnapshot.java


> Update ExportSnapShot to optionally not use a tmp file on external file system
> --
>
> Key: HBASE-9
> URL: https://issues.apache.org/jira/browse/HBASE-9
> Project: HBase
>  Issue Type: Improvement
>  Components: snapshots
>Reporter: Ted Malaska
>Assignee: Ted Malaska
>Priority: Minor
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: HBASE-9.patch
>
>
> There are FileSystem like S3 where renaming is extremely expensive.  This 
> patch will add a parameter that says something like
> use.tmp.folder
> It will be defaulted to true.  So default behavior is the same.  If false is 
> set them the files will land in the final destination with no need for a 
> rename. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-10357) Failover RPC's for scans

2014-05-12 Thread Devaraj Das (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Devaraj Das updated HBASE-10357:


Fix Version/s: hbase-10070

> Failover RPC's for scans
> 
>
> Key: HBASE-10357
> URL: https://issues.apache.org/jira/browse/HBASE-10357
> Project: HBase
>  Issue Type: Sub-task
>  Components: Client
>Reporter: Enis Soztutar
> Fix For: 0.99.0, hbase-10070
>
> Attachments: 10357-1.txt, 10357-2.txt, 10357-3.2.txt, 10357-3.txt, 
> 10357-4.2.txt, 10357-4.3.txt, 10357-4.txt
>
>
> This is extension of HBASE-10355 to add failover support for scans. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-10357) Failover RPC's for scans

2014-05-12 Thread Devaraj Das (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Devaraj Das updated HBASE-10357:


Attachment: 10357-4.3.1.txt

This is without the pom.xml change.

> Failover RPC's for scans
> 
>
> Key: HBASE-10357
> URL: https://issues.apache.org/jira/browse/HBASE-10357
> Project: HBase
>  Issue Type: Sub-task
>  Components: Client
>Reporter: Enis Soztutar
> Fix For: 0.99.0, hbase-10070
>
> Attachments: 10357-1.txt, 10357-2.txt, 10357-3.2.txt, 10357-3.txt, 
> 10357-4.2.txt, 10357-4.3.1.txt, 10357-4.3.txt, 10357-4.txt
>
>
> This is extension of HBASE-10355 to add failover support for scans. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11098) Improve documentation around our blockcache options

2014-05-12 Thread Nick Dimiduk (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995830#comment-13995830
 ] 

Nick Dimiduk commented on HBASE-11098:
--

Docstrings looks good to me. Maybe we need an example configuration to help get 
folks started, but I guess that's for the book, not docstrings.

What's up with concurrent map changes? These were intentional?

> Improve documentation around our blockcache options
> ---
>
> Key: HBASE-11098
> URL: https://issues.apache.org/jira/browse/HBASE-11098
> Project: HBase
>  Issue Type: Sub-task
>  Components: documentation
>Reporter: stack
>Assignee: stack
> Attachments: 11098.txt, 11098v2.txt
>
>
> Adding as a subtask on [~ndimiduk] necessary cleanup.  Trying to make sense 
> of this stuff I started adding doc (package-info files, javadoc).  I see this 
> as complementary to the parent task work.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11143) ageOfLastShippedOp metric is confusing

2014-05-12 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995895#comment-13995895
 ] 

Lars Hofhansl commented on HBASE-11143:
---

[~apurtell], you OK with this in 0.98? It's just a new metric.

> ageOfLastShippedOp metric is confusing
> --
>
> Key: HBASE-11143
> URL: https://issues.apache.org/jira/browse/HBASE-11143
> Project: HBase
>  Issue Type: Bug
>  Components: Replication
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.99.0, 0.94.20, 0.98.3
>
> Attachments: 11143-0.94-v2.txt, 11143-0.94.txt, 11143-trunk.txt
>
>
> We are trying to report on replication lag and find that there is no good 
> single metric to do that.
> ageOfLastShippedOp is close, but unfortunately it is increased even when 
> there is nothing to ship on a particular RegionServer.
> I would like discuss a few options here:
> Add a new metric: replicationQueueTime (or something) with the above meaning. 
> I.e. if we have something to ship we set the age of that last shipped edit, 
> if we fail we increment that last time (just like we do now). But if there is 
> nothing to replicate we set it to current time (and hence that metric is 
> reported to close to 0).
> Alternatively we could change the meaning of ageOfLastShippedOp to mean to do 
> that. That might lead to surprises, but the current behavior is clearly weird 
> when there is nothing to replicate.
> Comments? [~jdcryans], [~stack].
> If approach sounds good, I'll make a patch for all branches.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11136) Add permission check to roll WAL writer

2014-05-12 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995314#comment-13995314
 ] 

Ted Yu commented on HBASE-11136:


What's the purpose of adding postRollWALWriter() hook ?

Can you post the result for unit test suite ? QA bot is down.

> Add permission check to roll WAL writer 
> 
>
> Key: HBASE-11136
> URL: https://issues.apache.org/jira/browse/HBASE-11136
> Project: HBase
>  Issue Type: Improvement
>  Components: regionserver, security
>Affects Versions: 0.96.2, 0.98.2
>Reporter: Jerry He
>Assignee: Jerry He
>Priority: Minor
> Fix For: 0.99.0
>
> Attachments: HBASE-11136-trunk-v1.patch
>
>
> Currently HBase provides HBaseAdmin.rollHLogWriter() and shell command to 
> roll WAL on a region server. But no permission check is done on this 
> operation in a secure cluster.
> We need to add permission check to prevent un-authorized user from running 
> this operation. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11137) Add mapred.TableSnapshotInputFormat

2014-05-12 Thread Nick Dimiduk (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-11137:
-

Attachment: HBASE-11137.01.patch

Updating patch: fix line lengths and add support for both namespaces to 
IntegrationTestTableSnapshotInputFormat.

Looks like trunk has enjoyed some progress around snapshots, so rebasing to 
0.98 and 0.96 looks messy. HBASE-7987 looks like the major culprit. I'd prefer 
to see this backported before rebasing the patch (otherwise, I basically repeat 
the refactor manually for each branch).

> Add mapred.TableSnapshotInputFormat
> ---
>
> Key: HBASE-11137
> URL: https://issues.apache.org/jira/browse/HBASE-11137
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce, Performance
>Affects Versions: 0.98.0, 0.96.2
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
> Attachments: HBASE-11137.00.patch, HBASE-11137.01.patch
>
>
> We should have feature parity between mapreduce and mapred implementations. 
> This is important for Hive.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-10985) Decouple Split Transaction from Zookeeper

2014-05-12 Thread Sergey Soldatov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sergey Soldatov updated HBASE-10985:


Status: Patch Available  (was: Open)

> Decouple Split Transaction from Zookeeper
> -
>
> Key: HBASE-10985
> URL: https://issues.apache.org/jira/browse/HBASE-10985
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Reporter: Sergey Soldatov
> Attachments: HBASE-10985.patch, HBASE-10985.patch, HBASE-10985.patch, 
> HBASE_10985-v2.patch, HBASE_10985-v3.patch, HBASE_10985-v4.patch
>
>
> As part of  HBASE-10296 SplitTransaction should be decoupled from Zookeeper. 
> This is an initial patch for review. At the moment the consensus provider  
> placed directly to SplitTransaction to minimize affected code. In the ideal 
> world it should be done in HServer.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11069) Decouple region merging from ZooKeeper

2014-05-12 Thread Sergey Soldatov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sergey Soldatov updated HBASE-11069:


Attachment: HBASE_11069-v2.patch

Rebased to the latest trunk and did better decoupling. 



> Decouple region merging from ZooKeeper
> --
>
> Key: HBASE-11069
> URL: https://issues.apache.org/jira/browse/HBASE-11069
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Reporter: Sergey Soldatov
> Attachments: HBASE-11069.patch, HBASE_11069-v2.patch
>
>
> Region Merge should be decoupled from ZK. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11118) non environment variable solution for "IllegalAccessError: class com.google.protobuf.ZeroCopyLiteralByteString cannot access its superclass com.google.protobuf.Literal

2014-05-12 Thread Nick Dimiduk (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995264#comment-13995264
 ] 

Nick Dimiduk commented on HBASE-8:
--

I took the shade question to 
[user@maven|http://www.mail-archive.com/users@maven.apache.org/msg133791.html]. 
Maybe some helpful insight will surface.

> non environment variable solution for "IllegalAccessError: class 
> com.google.protobuf.ZeroCopyLiteralByteString cannot access its superclass 
> com.google.protobuf.LiteralByteString"
> --
>
> Key: HBASE-8
> URL: https://issues.apache.org/jira/browse/HBASE-8
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.98.2
>Reporter: André Kelpe
>Priority: Blocker
> Fix For: 0.99.0, 0.98.3
>
> Attachments: shade_attempt.patch
>
>
> I am running into the problem described in 
> https://issues.apache.org/jira/browse/HBASE-10304, while trying to use a 
> newer version within cascading.hbase 
> (https://github.com/cascading/cascading.hbase).
> One of the features of cascading.hbase is that you can use it from lingual 
> (http://www.cascading.org/projects/lingual/), our SQL layer for hadoop. 
> lingual has a notion of providers, which are fat jars that we pull down 
> dynamically at runtime. Those jars give users the ability to talk to any 
> system or format from SQL. They are added to the classpath  programmatically 
> before we submit jobs to a hadoop cluster.
> Since lingual does not know upfront , which providers are going to be used in 
> a given run, the HADOOP_CLASSPATH trick proposed in the JIRA above is really 
> clunky and breaks the ease of use we had before. No other provider requires 
> this right now.
> It would be great to have a programmatical way to fix this, when using fat 
> jars.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11133) Add an option to skip snapshot verification after Export

2014-05-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995641#comment-13995641
 ] 

Hudson commented on HBASE-11133:


FAILURE: Integrated in HBase-0.98-on-Hadoop-1.1 #289 (See 
[https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/289/])
HBASE-11133 Add an option to skip snapshot verification after ExportSnapshot 
(mbertozzi: rev 1593770)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java


> Add an option to skip snapshot verification after Export
> 
>
> Key: HBASE-11133
> URL: https://issues.apache.org/jira/browse/HBASE-11133
> Project: HBase
>  Issue Type: Bug
>  Components: snapshots
>Affects Versions: 0.99.0
>Reporter: Matteo Bertozzi
>Assignee: Matteo Bertozzi
>Priority: Trivial
> Fix For: 0.99.0, 0.96.3, 0.98.3
>
> Attachments: HBASE-11133-v0.patch, HBASE-11133-v1.patch
>
>
> Add a "-skip-dst-verify" option to skip snapshot verification after Export



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-10357) Failover RPC's for scans

2014-05-12 Thread Devaraj Das (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Devaraj Das updated HBASE-10357:


Attachment: 10357-4.3.txt

Addresses the comments last raised.
The main change is that the "close" of the unneeded scanners are queued up in 
the executor service to make sure all scanners (primary or otherwise) are 
closed when not needed. 
Also a semantic change - when a scanner is obtained, the underlying table 
shouldn't have been closed. Updated some unit tests to reflect this. So for 
example, one pattern is - one would create a table, do a bunch of puts, and 
then close the table. Subsequently, a scanner would be obtained from that 
table. In the approach on this jira, since the scanner uses the pool always, if 
the table is  closed, the underlying pool would be shut down as well. This 
would lead to issues. In earlier versions of the patch, I would create a new 
pool if the passed pool was shut down but it makes handling the closing of the 
pool, etc. an issue. I think for 1.0 we should make the change that a table 
shouldn't be closed if we want to invoke methods on that table.

> Failover RPC's for scans
> 
>
> Key: HBASE-10357
> URL: https://issues.apache.org/jira/browse/HBASE-10357
> Project: HBase
>  Issue Type: Sub-task
>  Components: Client
>Reporter: Enis Soztutar
> Fix For: 0.99.0
>
> Attachments: 10357-1.txt, 10357-2.txt, 10357-3.2.txt, 10357-3.txt, 
> 10357-4.2.txt, 10357-4.3.txt, 10357-4.txt
>
>
> This is extension of HBASE-10355 to add failover support for scans. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-7987) Snapshot Manifest file instead of multiple empty files

2014-05-12 Thread Nick Dimiduk (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-7987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995793#comment-13995793
 ] 

Nick Dimiduk commented on HBASE-7987:
-

Hi [~mbertozzi]. Any plan for a backport to 0.98? I'm seeing some conflict in 
TableSnapshotInputFormat while backporting my patch on HBASE-11137. If you 
intend a backport, I'd rather not further complicate.

> Snapshot Manifest file instead of multiple empty files
> --
>
> Key: HBASE-7987
> URL: https://issues.apache.org/jira/browse/HBASE-7987
> Project: HBase
>  Issue Type: Improvement
>  Components: snapshots
>Reporter: Matteo Bertozzi
>Assignee: Matteo Bertozzi
> Fix For: 0.99.0
>
> Attachments: HBASE-7987-v0.patch, HBASE-7987-v1.patch, 
> HBASE-7987-v2.patch, HBASE-7987-v2.sketch, HBASE-7987-v3.patch, 
> HBASE-7987-v4.patch, HBASE-7987-v5.patch, HBASE-7987-v6.patch, 
> HBASE-7987.sketch
>
>
> Currently taking a snapshot means creating one empty file for each file in 
> the source table directory, plus copying the .regioninfo file for each 
> region, the table descriptor file and a snapshotInfo file.
> during the restore or snapshot verification we traverse the filesystem 
> (fs.listStatus()) to find the snapshot files, and we open the .regioninfo 
> files to get the information.
> to avoid hammering the NameNode and having lots of empty files, we can use a 
> manifest file that contains the list of files and information that we need.
> To keep the RS parallelism that we have, each RS can write its own manifest.
> {code}
> message SnapshotDescriptor {
>   required string name;
>   optional string table;
>   optional int64 creationTime;
>   optional Type type;
>   optional int32 version;
> }
> message SnapshotRegionManifest {
>   optional int32 version;
>   required RegionInfo regionInfo;
>   repeated FamilyFiles familyFiles;
>   message StoreFile {
> required string name;
> optional Reference reference;
>   }
>   message FamilyFiles {
> required bytes familyName;
> repeated StoreFile storeFiles;
>   }
> }
> {code}
> {code}
> /hbase/.snapshot/
> /hbase/.snapshot//snapshotInfo
> /hbase/.snapshot//
> /hbase/.snapshot///tableInfo
> /hbase/.snapshot///regionManifest(.n)
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11108) Split ZKTable into interface and implementation

2014-05-12 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995792#comment-13995792
 ] 

Hadoop QA commented on HBASE-11108:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12644478/HBASE-11108.patch
  against trunk revision .
  ATTACHMENT ID: 12644478

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 37 new 
or modified tests.

{color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9506//console

This message is automatically generated.

> Split ZKTable into interface and implementation
> ---
>
> Key: HBASE-11108
> URL: https://issues.apache.org/jira/browse/HBASE-11108
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Affects Versions: 0.99.0
>Reporter: Konstantin Boudnik
>Assignee: Mikhail Antonov
> Attachments: HBASE-11108.patch, HBASE-11108.patch, HBASE-11108.patch
>
>
> In HBASE-11071 we are trying to split admin handlers away from ZK. However, a 
> ZKTable instance is being used in multiple places, hence it would be 
> beneficial to hide its implementation behind a well defined interface.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11137) Add mapred.TableSnapshotInputFormat

2014-05-12 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995790#comment-13995790
 ] 

Hadoop QA commented on HBASE-11137:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12644212/HBASE-11137.00.patch
  against trunk revision .
  ATTACHMENT ID: 12644212

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 10 new 
or modified tests.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 lineLengths{color}.  The patch introduces the following lines 
longer than 100:
+public TableSnapshotRegionSplit(HTableDescriptor htd, HRegionInfo 
regionInfo, List locations) {
+  public static void setInput(JobConf job, String snapshotName, Path 
restoreDir) throws IOException {
+public TableSnapshotRegionSplit(HTableDescriptor htd, HRegionInfo 
regionInfo, List locations) {

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9504//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9504//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9504//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9504//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9504//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9504//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9504//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9504//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9504//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9504//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9504//console

This message is automatically generated.

> Add mapred.TableSnapshotInputFormat
> ---
>
> Key: HBASE-11137
> URL: https://issues.apache.org/jira/browse/HBASE-11137
> Project: HBase
>  Issue Type: Improvement
>  Components: mapreduce, Performance
>Affects Versions: 0.98.0, 0.96.2
>Reporter: Nick Dimiduk
>Assignee: Nick Dimiduk
> Attachments: HBASE-11137.00.patch
>
>
> We should have feature parity between mapreduce and mapred implementations. 
> This is important for Hive.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Resolved] (HBASE-10235) Backport HBASE-10226 (Namespace grants not always checked) to 0.96

2014-05-12 Thread Andrew Purtell (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell resolved HBASE-10235.


Resolution: Won't Fix
  Assignee: (was: Andrew Purtell)

> Backport HBASE-10226 (Namespace grants not always checked) to 0.96
> --
>
> Key: HBASE-10235
> URL: https://issues.apache.org/jira/browse/HBASE-10235
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.96.2
>Reporter: Andrew Purtell
>
> Namespace level grants are not checked by the AccessController in 0.96 also.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11135) Change region sequenceid generation so happens earlier in the append cycle rather than just before added to file

2014-05-12 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995591#comment-13995591
 ] 

Hadoop QA commented on HBASE-11135:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12644431/11135v5.txt
  against trunk revision .
  ATTACHMENT ID: 12644431

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 12 new 
or modified tests.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:red}-1 javadoc{color}.  The javadoc tool appears to have generated 5 
warning messages.

{color:red}-1 findbugs{color}.  The patch appears to introduce 1 new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 lineLengths{color}.  The patch introduces the following lines 
longer than 100:
+i.visitLogEntryBeforeWrite(entry.getHTableDescriptor(), 
entry.getKey(), entry.getEdit());
+   * @deprecated Use {@link #appendNoSync(HRegionInfo, HLogKey, WALEdit, 
HTableDescriptor, AtomicLong, boolean)}
+fsUtils.recoverFileLease(((HFileSystem)fs).getBackingFs(), p, 
TEST_UTIL.getConfiguration(), null);

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

 {color:red}-1 core tests{color}.  The patch failed these unit tests:
   org.apache.hadoop.hbase.client.TestMultiParallel
  org.apache.hadoop.hbase.regionserver.wal.TestLogRolling

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9502//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9502//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9502//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9502//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9502//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9502//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9502//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9502//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9502//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9502//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9502//console

This message is automatically generated.

> Change region sequenceid generation so happens earlier in the append cycle 
> rather than just before added to file
> 
>
> Key: HBASE-11135
> URL: https://issues.apache.org/jira/browse/HBASE-11135
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Reporter: stack
>Assignee: stack
> Attachments: 11135.wip.txt, 11135v2.txt, 11135v5.txt, 11135v5.txt, 
> 11135v5.txt
>
>
> Currently we assign the region edit/sequence id just before we put it in the 
> WAL.  We do it in the single thread that feeds from the ring buffer.  Doing 
> it at this point, we can ensure order, that the edits will be in the file in 
> accordance w/ the ordering of the region sequence id.
> But the point at which region sequence id is assigned an edit is deep down in 
> the WAL system and there is a lag between our putting an edit into the WAL 
> system and the edit actually getting its edit/sequence id.
> This lag -- "late-binding" -- complicates the unification of mvcc and region 
> sequence id, especially around async WAL writes (and, related, for no-WAL 
> writes) -- the parent for this issue (For async, how you get the edit id in 
> our system when the threads have all gone home -- unless you make them wait?)
> Chatting w/ Jeffrey Zhong yesterday, we came up with a crazypants means of 
> getting the region sequence id near-immediately.  We'll run two ringbuffers.  
> The first will mesh all han

[jira] [Commented] (HBASE-11143) ageOfLastShippedOp metric is confusing

2014-05-12 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995661#comment-13995661
 ] 

Lars Hofhansl commented on HBASE-11143:
---

[~stack], the new metric? Sure. I suppose our message at this point is to 
upgrade from 0.94 to 0.98, right?
OK... Lemme me put it in 0.94, 0.98, and trunk.

> ageOfLastShippedOp metric is confusing
> --
>
> Key: HBASE-11143
> URL: https://issues.apache.org/jira/browse/HBASE-11143
> Project: HBase
>  Issue Type: Bug
>  Components: Replication
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: 11143-0.94-v2.txt, 11143-0.94.txt, 11143-trunk.txt
>
>
> We are trying to report on replication lag and find that there is no good 
> single metric to do that.
> ageOfLastShippedOp is close, but unfortunately it is increased even when 
> there is nothing to ship on a particular RegionServer.
> I would like discuss a few options here:
> Add a new metric: replicationQueueTime (or something) with the above meaning. 
> I.e. if we have something to ship we set the age of that last shipped edit, 
> if we fail we increment that last time (just like we do now). But if there is 
> nothing to replicate we set it to current time (and hence that metric is 
> reported to close to 0).
> Alternatively we could change the meaning of ageOfLastShippedOp to mean to do 
> that. That might lead to surprises, but the current behavior is clearly weird 
> when there is nothing to replicate.
> Comments? [~jdcryans], [~stack].
> If approach sounds good, I'll make a patch for all branches.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11140) LocalHBaseCluster should create ConsensusProvider per each server

2014-05-12 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-11140:
--

  Resolution: Fixed
Hadoop Flags: Reviewed
  Status: Resolved  (was: Patch Available)

Committed to trunk.  Thanks [~mantonov]

> LocalHBaseCluster should create ConsensusProvider per each server
> -
>
> Key: HBASE-11140
> URL: https://issues.apache.org/jira/browse/HBASE-11140
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus
>Affects Versions: 0.99.0
>Reporter: Mikhail Antonov
>Assignee: Mikhail Antonov
> Fix For: 0.99.0
>
> Attachments: HBASE-11140.patch
>
>
> Right now there's a bug there when single ConsensusProvider instance is 
> shared across all threads running region servers and masters within 
> processes, which breaks certain tests in patches which used to pass 
> successfully before.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-10985) Decouple Split Transaction from Zookeeper

2014-05-12 Thread Mikhail Antonov (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995720#comment-13995720
 ] 

Mikhail Antonov commented on HBASE-10985:
-

Some small comments:

 - unused import of ZkSplitTransactionConsensus in AssignmentManager, 
znodeVersion in SplitTransaction isn't used (and shouldn't be needed?)
 - Why do we need to pass in Server instance to methods of 
SplitTransactionConsensus? We can retrieve it from the consensusProvider 
reference kept in ZkSplitTransactionConsensus instance (also the split 
consensus could better be  field in SplitTransaction class I guess, shouldn't 
need to retrieve it from ConsensusProvider each time?)
 - fields consensus and watcher in ZkSplitTransactionConsensus are unused
 - can we have the methods on split consensus to not use zk-related terms like 
"transition node", or it's hard because of the zk is too tightly integrated 
here? Ideally guys glancing thru code other than consensus impl shall not need 
to know that. E.g. we could have it renamed like: (transitionSplittingNode -> 
preCommitByRegionServer; transitionZkNode -> commitByRegionServer, 
createNodeSplitting -> startTxnOnRegionServer, getNode -> 
heartbeatRegionSplitting - do you think this naming accurately reflects the 
actual meaning?), also - could we not return values like -1 coming from ZK 
world from methods of consensus?
 - to eliminate KeeperException from SplitTransaction, could we move some 
portion of post splitting steps in openDaughers() like this:

 {code}
ry {
  // add 2nd daughter first (see HBASE-4335)
  services.postOpenDeployTasks(b, server.getCatalogTracker());
  // Should add it to OnlineRegions
  services.addToOnlineRegions(b);
  services.postOpenDeployTasks(a, server.getCatalogTracker());
  services.addToOnlineRegions(a);
{code}
to separate consensus method? This way we can remove KeeperException from 
imports.

> Decouple Split Transaction from Zookeeper
> -
>
> Key: HBASE-10985
> URL: https://issues.apache.org/jira/browse/HBASE-10985
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Reporter: Sergey Soldatov
> Attachments: HBASE-10985.patch, HBASE-10985.patch, HBASE-10985.patch, 
> HBASE_10985-v2.patch, HBASE_10985-v3.patch, HBASE_10985-v4.patch
>
>
> As part of  HBASE-10296 SplitTransaction should be decoupled from Zookeeper. 
> This is an initial patch for review. At the moment the consensus provider  
> placed directly to SplitTransaction to minimize affected code. In the ideal 
> world it should be done in HServer.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11133) Add an option to skip snapshot verification after Export

2014-05-12 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995735#comment-13995735
 ] 

Hudson commented on HBASE-11133:


SUCCESS: Integrated in HBase-0.98 #304 (See 
[https://builds.apache.org/job/HBase-0.98/304/])
HBASE-11133 Add an option to skip snapshot verification after ExportSnapshot 
(mbertozzi: rev 1593770)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java


> Add an option to skip snapshot verification after Export
> 
>
> Key: HBASE-11133
> URL: https://issues.apache.org/jira/browse/HBASE-11133
> Project: HBase
>  Issue Type: Bug
>  Components: snapshots
>Affects Versions: 0.99.0
>Reporter: Matteo Bertozzi
>Assignee: Matteo Bertozzi
>Priority: Trivial
> Fix For: 0.99.0, 0.96.3, 0.98.3
>
> Attachments: HBASE-11133-v0.patch, HBASE-11133-v1.patch
>
>
> Add a "-skip-dst-verify" option to skip snapshot verification after Export



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11143) ageOfLastShippedOp metric is confusing

2014-05-12 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995638#comment-13995638
 ] 

stack commented on HBASE-11143:
---

Suggest do not put in 0.96 unless someone asks explicitly for it.

> ageOfLastShippedOp metric is confusing
> --
>
> Key: HBASE-11143
> URL: https://issues.apache.org/jira/browse/HBASE-11143
> Project: HBase
>  Issue Type: Bug
>  Components: Replication
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: 11143-0.94-v2.txt, 11143-0.94.txt, 11143-trunk.txt
>
>
> We are trying to report on replication lag and find that there is no good 
> single metric to do that.
> ageOfLastShippedOp is close, but unfortunately it is increased even when 
> there is nothing to ship on a particular RegionServer.
> I would like discuss a few options here:
> Add a new metric: replicationQueueTime (or something) with the above meaning. 
> I.e. if we have something to ship we set the age of that last shipped edit, 
> if we fail we increment that last time (just like we do now). But if there is 
> nothing to replicate we set it to current time (and hence that metric is 
> reported to close to 0).
> Alternatively we could change the meaning of ageOfLastShippedOp to mean to do 
> that. That might lead to surprises, but the current behavior is clearly weird 
> when there is nothing to replicate.
> Comments? [~jdcryans], [~stack].
> If approach sounds good, I'll make a patch for all branches.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11151) move tracing modules from hbase-server to hbase-common

2014-05-12 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995671#comment-13995671
 ] 

Hadoop QA commented on HBASE-11151:
---

{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12644320/HBASE-11151-0.patch
  against trunk revision .
  ATTACHMENT ID: 12644320

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+0 tests included{color}.  The patch appears to be a 
documentation patch that doesn't require tests.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9503//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9503//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9503//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9503//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9503//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9503//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9503//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9503//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9503//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9503//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9503//console

This message is automatically generated.

> move tracing modules from hbase-server to hbase-common
> --
>
> Key: HBASE-11151
> URL: https://issues.apache.org/jira/browse/HBASE-11151
> Project: HBase
>  Issue Type: Improvement
>  Components: documentation, util
>Reporter: Masatake Iwasaki
>Assignee: Masatake Iwasaki
>Priority: Minor
> Attachments: HBASE-11151-0.patch
>
>
> Not only servers but also clients using tracing needs SpanReceiverHost.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11135) Change region sequenceid generation so happens earlier in the append cycle rather than just before added to file

2014-05-12 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-11135:
--

Attachment: 11135v6.txt

The TestLogRolling was a real issue (I love unit tests).  TestMultiParallel is 
not me.  This has just started failing generally.  Fix line lengths.

> Change region sequenceid generation so happens earlier in the append cycle 
> rather than just before added to file
> 
>
> Key: HBASE-11135
> URL: https://issues.apache.org/jira/browse/HBASE-11135
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Reporter: stack
>Assignee: stack
> Attachments: 11135.wip.txt, 11135v2.txt, 11135v5.txt, 11135v5.txt, 
> 11135v5.txt, 11135v6.txt
>
>
> Currently we assign the region edit/sequence id just before we put it in the 
> WAL.  We do it in the single thread that feeds from the ring buffer.  Doing 
> it at this point, we can ensure order, that the edits will be in the file in 
> accordance w/ the ordering of the region sequence id.
> But the point at which region sequence id is assigned an edit is deep down in 
> the WAL system and there is a lag between our putting an edit into the WAL 
> system and the edit actually getting its edit/sequence id.
> This lag -- "late-binding" -- complicates the unification of mvcc and region 
> sequence id, especially around async WAL writes (and, related, for no-WAL 
> writes) -- the parent for this issue (For async, how you get the edit id in 
> our system when the threads have all gone home -- unless you make them wait?)
> Chatting w/ Jeffrey Zhong yesterday, we came up with a crazypants means of 
> getting the region sequence id near-immediately.  We'll run two ringbuffers.  
> The first will mesh all handler threads and the consumer will generate ids 
> (we will have order on other side of this first ring buffer), and then if 
> async or no sync, we will just let the threads return ... updating mvcc just 
> before we let them go.  All other calls will go up on to the second ring 
> buffer to be serviced as now (batching, distribution out among the sync'ing 
> threads).  The first rb will have no friction and should turn at fast rates 
> compared to the second.  There should not be noticeable slowdown nor do I 
> foresee this refactor intefering w/ our multi-WAL plans.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-10513) Provide user documentation for region replicas

2014-05-12 Thread Enis Soztutar (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-10513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Enis Soztutar updated HBASE-10513:
--

Attachment: timeline_consitency.png
hbase-10513_v1.patch

Updated the doc for addressing the comments, and adding some more clarity. 
Attaching the docbook version. I'll commit this version. 

> Provide user documentation for region replicas
> --
>
> Key: HBASE-10513
> URL: https://issues.apache.org/jira/browse/HBASE-10513
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Enis Soztutar
>Assignee: Enis Soztutar
> Fix For: 0.99.0
>
> Attachments: UserdocumentationforHBASE-10070.pdf, 
> hbase-10513_v1.patch, timeline_consitency.png
>
>
> We need some documentation for the feature introduced in HBASE-10070. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11108) Split ZKTable into interface and implementation

2014-05-12 Thread Mikhail Antonov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mikhail Antonov updated HBASE-11108:


Status: Open  (was: Patch Available)

> Split ZKTable into interface and implementation
> ---
>
> Key: HBASE-11108
> URL: https://issues.apache.org/jira/browse/HBASE-11108
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Affects Versions: 0.99.0
>Reporter: Konstantin Boudnik
>Assignee: Mikhail Antonov
> Attachments: HBASE-11108.patch, HBASE-11108.patch, HBASE-11108.patch
>
>
> In HBASE-11071 we are trying to split admin handlers away from ZK. However, a 
> ZKTable instance is being used in multiple places, hence it would be 
> beneficial to hide its implementation behind a well defined interface.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-10942) support parallel request cancellation for multi-get

2014-05-12 Thread Sergey Shelukhin (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995613#comment-13995613
 ] 

Sergey Shelukhin commented on HBASE-10942:
--

isReplica there is only used for logging; fixed. Will fix rest and update the 
patch

> support parallel request cancellation for multi-get
> ---
>
> Key: HBASE-10942
> URL: https://issues.apache.org/jira/browse/HBASE-10942
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Sergey Shelukhin
>Assignee: Sergey Shelukhin
> Fix For: hbase-10070
>
> Attachments: HBASE-10942.01.patch, HBASE-10942.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11108) Split ZKTable into interface and implementation

2014-05-12 Thread Mikhail Antonov (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mikhail Antonov updated HBASE-11108:


Attachment: HBASE-11108.patch

fixed broken test, patch tests on local machine.

> Split ZKTable into interface and implementation
> ---
>
> Key: HBASE-11108
> URL: https://issues.apache.org/jira/browse/HBASE-11108
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus, Zookeeper
>Affects Versions: 0.99.0
>Reporter: Konstantin Boudnik
>Assignee: Mikhail Antonov
> Attachments: HBASE-11108.patch, HBASE-11108.patch, HBASE-11108.patch
>
>
> In HBASE-11071 we are trying to split admin handlers away from ZK. However, a 
> ZKTable instance is being used in multiple places, hence it would be 
> beneficial to hide its implementation behind a well defined interface.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11135) Change region sequenceid generation so happens earlier in the append cycle rather than just before added to file

2014-05-12 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-11135:
--

Attachment: 11135.wip.txt

wip

Includes doc cleanup from HBASE-11109.

Does not do double ring buffer.  Instead, when we appendNoSync, we just wait 
for the append to complete before returning (early-binding instead of 
late-binding).  Also changed signature for HLog#appendNoSync so it takes a 
HLogKey.  We stick the region edit/sequence id into the HLogKey so it is 
available on return from appendNoSync.

Region sequence id is now always updated by the WAL system (Need to finish 
getting an id -- plumb it through).

Should fix HBASE-11109.  This patch includes the HBASE-11109 tests.

Need to see how much perf we are losing doing this wait on append and then a 
wait on sync.

Need to do cleanup too.





> Change region sequenceid generation so happens earlier in the append cycle 
> rather than just before added to file
> 
>
> Key: HBASE-11135
> URL: https://issues.apache.org/jira/browse/HBASE-11135
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Reporter: stack
>Assignee: stack
> Attachments: 11135.wip.txt, 11135v2.txt
>
>
> Currently we assign the region edit/sequence id just before we put it in the 
> WAL.  We do it in the single thread that feeds from the ring buffer.  Doing 
> it at this point, we can ensure order, that the edits will be in the file in 
> accordance w/ the ordering of the region sequence id.
> But the point at which region sequence id is assigned an edit is deep down in 
> the WAL system and there is a lag between our putting an edit into the WAL 
> system and the edit actually getting its edit/sequence id.
> This lag -- "late-binding" -- complicates the unification of mvcc and region 
> sequence id, especially around async WAL writes (and, related, for no-WAL 
> writes) -- the parent for this issue (For async, how you get the edit id in 
> our system when the threads have all gone home -- unless you make them wait?)
> Chatting w/ Jeffrey Zhong yesterday, we came up with a crazypants means of 
> getting the region sequence id near-immediately.  We'll run two ringbuffers.  
> The first will mesh all handler threads and the consumer will generate ids 
> (we will have order on other side of this first ring buffer), and then if 
> async or no sync, we will just let the threads return ... updating mvcc just 
> before we let them go.  All other calls will go up on to the second ring 
> buffer to be serviced as now (batching, distribution out among the sync'ing 
> threads).  The first rb will have no friction and should turn at fast rates 
> compared to the second.  There should not be noticeable slowdown nor do I 
> foresee this refactor intefering w/ our multi-WAL plans.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-1811) Snapshot HFile and region statistics at compaction time and make info available to clients

2014-05-12 Thread Jesse Yates (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-1811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995307#comment-13995307
 ] 

Jesse Yates commented on HBASE-1811:


Yeah, 7958 looks like a dup (timeline wise). Looks like good ideas always come 
back around :)

We could make 7958 be the scan-time infrastructure and this one to be adding 
comprehensive stats?

> Snapshot HFile and region statistics at compaction time and make info 
> available to clients
> --
>
> Key: HBASE-1811
> URL: https://issues.apache.org/jira/browse/HBASE-1811
> Project: HBase
>  Issue Type: Improvement
>Reporter: Andrew Purtell
>Priority: Minor
>
> Consider snapshotting HFile and region statistics at major and minor 
> compaction time and making the info available to clients:
> * Key statistics
>  ** cardinality
>  ** length avg/min/max/stdev
>  ** information content measure (entropy, etc.)
>  ** histogram
> etc.
> * Value statistics
>  ** length avg/min/max/stdev
>  ** information content measure (entropy, etc.)
>  ** histogram
> etc.
> * Region statistics
>  ** density estimation
>  ** KV count
>  ** total storage size (on disk)
>  ** total storage size (uncompressed)
> etc. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11112) PerformanceEvaluation should document --multiGet option on its printUsage.

2014-05-12 Thread Jean-Marc Spaggiari (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-2?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jean-Marc Spaggiari updated HBASE-2:


Status: Open  (was: Patch Available)

> PerformanceEvaluation should document --multiGet option on its printUsage.
> --
>
> Key: HBASE-2
> URL: https://issues.apache.org/jira/browse/HBASE-2
> Project: HBase
>  Issue Type: Bug
>  Components: documentation, Performance
>Affects Versions: 0.98.2
>Reporter: Jean-Marc Spaggiari
>Assignee: Jean-Marc Spaggiari
> Attachments: HBASE-2-v0-trunk.patch, HBASE-2-v1-trunk.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11098) Improve documentation around our blockcache options

2014-05-12 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995343#comment-13995343
 ] 

stack commented on HBASE-11098:
---

[~ndimiduk] Mind if I commit this as is Nick (I'll not do the file removes, 
just the doc fixup?)  I'd like to get this in and backport.  It is just javadoc 
to help folks trying to navigate in here.

> Improve documentation around our blockcache options
> ---
>
> Key: HBASE-11098
> URL: https://issues.apache.org/jira/browse/HBASE-11098
> Project: HBase
>  Issue Type: Sub-task
>  Components: documentation
>Reporter: stack
>Assignee: stack
> Attachments: 11098.txt, 11098v2.txt
>
>
> Adding as a subtask on [~ndimiduk] necessary cleanup.  Trying to make sense 
> of this stuff I started adding doc (package-info files, javadoc).  I see this 
> as complementary to the parent task work.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11096) stop method of Master and RegionServer coprocessor is not invoked

2014-05-12 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995235#comment-13995235
 ] 

Ted Yu commented on HBASE-11096:


{code}
+LOG.error("on RegionCoprocessorEnvironment!!", new IOException(
+"in RegionCoprocessorEnvironment"));
{code}
Is your intention of creating IOException just for record keeping ?
Looks like IOException doesn't have to be created.

> stop method of Master and RegionServer coprocessor  is not invoked
> --
>
> Key: HBASE-11096
> URL: https://issues.apache.org/jira/browse/HBASE-11096
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.96.2, 0.98.1, 0.94.19
>Reporter: Qiang Tian
>Assignee: Qiang Tian
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: HBASE-11096-trunk-v0.patch, HBASE-11096-trunk-v0.patch, 
> HBASE-11096-trunk-v1.patch, HBASE-11096-trunk-v2.patch
>
>
> the stop method of coprocessor specified by 
> "hbase.coprocessor.master.classes" and  
> "hbase.coprocessor.regionserver.classes"  is not invoked.
> If coprocessor allocates OS resources, it could cause master/regionserver 
> resource leak or hang during exit.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Resolved] (HBASE-2834) Deferred deletes

2014-05-12 Thread Andrew Purtell (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-2834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell resolved HBASE-2834.
---

Resolution: Duplicate

> Deferred deletes
> 
>
> Key: HBASE-2834
> URL: https://issues.apache.org/jira/browse/HBASE-2834
> Project: HBase
>  Issue Type: New Feature
>Reporter: Andrew Purtell
>
> Tangentally mentioned in a blog post, James Hamilton talks about deferred 
> deletes:
> {quote}
> If you have an application error, administrative error, or database 
> implementation bug that losses data, then it is simply gone unless you have 
> an offline copy. This, by the way, is why I'm a big fan of deferred delete.  
> This is a technique where deleted items are marked as deleted but not garbage 
> collected until some days or preferably weeks later.  Deferred delete is not 
> full protection but it has saved my butt more than once and I'm a believer. 
> See On Designing and Deploying Internet-Scale Services 
> (http://mvdirona.com/jrh/talksAndPapers/JamesRH_Lisa.pdf) for more detail.
> {quote}
> (See 
> http://perspectives.mvdirona.com/2010/04/07/StonebrakerOnCAPTheoremAndDatabases.aspx)
> Because deletes -- at least, after the initial write has been flushed from 
> memstore -- are tombstones, deferred delete in HBase could be supported if 
> somehow tombstones could be invalidated, an undelete operation in effect. 
> This could be accomplished by adding support for tombstones for deletes. 
> Would complicate major compaction but otherwise not touch much. A typical use 
> case might be "resurrect any data deleted from _ts1_ to _ts2_ ", a period of 
> 4 hours when an application error was operative. In this case a new write 
> would be issued to the table that is a tombstone covering any deletes over 
> that period of time. Users would defer major compactions until safe 
> checkpoint periods. 
> Such guarantees could optionally be extended to the memstoe by using 
> tombstones there as well. But it would probably be sufficient to provide 
> guidance that forcing a flush is  necessary to insure edits are persisted in 
> a way that allows for undeletion.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11143) ageOfLastShippedOp metric is confusing

2014-05-12 Thread Jean-Daniel Cryans (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995525#comment-13995525
 ] 

Jean-Daniel Cryans commented on HBASE-11143:


I'm +1 for the trunk patch too.

> ageOfLastShippedOp metric is confusing
> --
>
> Key: HBASE-11143
> URL: https://issues.apache.org/jira/browse/HBASE-11143
> Project: HBase
>  Issue Type: Bug
>  Components: Replication
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: 11143-0.94-v2.txt, 11143-0.94.txt, 11143-trunk.txt
>
>
> We are trying to report on replication lag and find that there is no good 
> single metric to do that.
> ageOfLastShippedOp is close, but unfortunately it is increased even when 
> there is nothing to ship on a particular RegionServer.
> I would like discuss a few options here:
> Add a new metric: replicationQueueTime (or something) with the above meaning. 
> I.e. if we have something to ship we set the age of that last shipped edit, 
> if we fail we increment that last time (just like we do now). But if there is 
> nothing to replicate we set it to current time (and hence that metric is 
> reported to close to 0).
> Alternatively we could change the meaning of ageOfLastShippedOp to mean to do 
> that. That might lead to surprises, but the current behavior is clearly weird 
> when there is nothing to replicate.
> Comments? [~jdcryans], [~stack].
> If approach sounds good, I'll make a patch for all branches.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11136) Add permission check to roll WAL writer

2014-05-12 Thread Jerry He (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995280#comment-13995280
 ] 

Jerry He commented on HBASE-11136:
--

Attached a patch.
The patch also includes a missing documentation on 
'hbase.coprocessor.regionserver.classes'.

> Add permission check to roll WAL writer 
> 
>
> Key: HBASE-11136
> URL: https://issues.apache.org/jira/browse/HBASE-11136
> Project: HBase
>  Issue Type: Improvement
>  Components: regionserver, security
>Affects Versions: 0.96.2, 0.98.2
>Reporter: Jerry He
>Assignee: Jerry He
>Priority: Minor
> Fix For: 0.99.0
>
> Attachments: HBASE-11136-trunk-v1.patch
>
>
> Currently HBase provides HBaseAdmin.rollHLogWriter() and shell command to 
> roll WAL on a region server. But no permission check is done on this 
> operation in a secure cluster.
> We need to add permission check to prevent un-authorized user from running 
> this operation. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11136) Add permission check to roll WAL writer

2014-05-12 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995486#comment-13995486
 ] 

Ted Yu commented on HBASE-11136:


{code}
+requirePermission("preRollLogWriter", Permission.Action.ADMIN);
{code}
Should we consider Permission.Action.CREATE as well ?

> Add permission check to roll WAL writer 
> 
>
> Key: HBASE-11136
> URL: https://issues.apache.org/jira/browse/HBASE-11136
> Project: HBase
>  Issue Type: Improvement
>  Components: regionserver, security
>Affects Versions: 0.96.2, 0.98.2
>Reporter: Jerry He
>Assignee: Jerry He
>Priority: Minor
> Fix For: 0.99.0
>
> Attachments: HBASE-11136-trunk-v1.patch
>
>
> Currently HBase provides HBaseAdmin.rollHLogWriter() and shell command to 
> roll WAL on a region server. But no permission check is done on this 
> operation in a secure cluster.
> We need to add permission check to prevent un-authorized user from running 
> this operation. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Comment Edited] (HBASE-11143) ageOfLastShippedOp metric is confusing

2014-05-12 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995242#comment-13995242
 ] 

Lars Hofhansl edited comment on HBASE-11143 at 5/12/14 5:22 PM:


\+1, and can we get the new metric in 0.96+?


was (Author: jdcryans):
+1, and can we get the new metric in 0.96+?

> ageOfLastShippedOp metric is confusing
> --
>
> Key: HBASE-11143
> URL: https://issues.apache.org/jira/browse/HBASE-11143
> Project: HBase
>  Issue Type: Bug
>  Components: Replication
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.94.20
>
> Attachments: 11143-0.94-v2.txt, 11143-0.94.txt
>
>
> We are trying to report on replication lag and find that there is no good 
> single metric to do that.
> ageOfLastShippedOp is close, but unfortunately it is increased even when 
> there is nothing to ship on a particular RegionServer.
> I would like discuss a few options here:
> Add a new metric: replicationQueueTime (or something) with the above meaning. 
> I.e. if we have something to ship we set the age of that last shipped edit, 
> if we fail we increment that last time (just like we do now). But if there is 
> nothing to replicate we set it to current time (and hence that metric is 
> reported to close to 0).
> Alternatively we could change the meaning of ageOfLastShippedOp to mean to do 
> that. That might lead to surprises, but the current behavior is clearly weird 
> when there is nothing to replicate.
> Comments? [~jdcryans], [~stack].
> If approach sounds good, I'll make a patch for all branches.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11135) Change region sequenceid generation so happens earlier in the append cycle rather than just before added to file

2014-05-12 Thread Jeffrey Zhong (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995355#comment-13995355
 ] 

Jeffrey Zhong commented on HBASE-11135:
---

Great! Should I wait for you to migrate my patch on top of your latest change?

> Change region sequenceid generation so happens earlier in the append cycle 
> rather than just before added to file
> 
>
> Key: HBASE-11135
> URL: https://issues.apache.org/jira/browse/HBASE-11135
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Reporter: stack
>Assignee: stack
> Attachments: 11135.wip.txt, 11135v2.txt, 11135v5.txt, 11135v5.txt, 
> 11135v5.txt
>
>
> Currently we assign the region edit/sequence id just before we put it in the 
> WAL.  We do it in the single thread that feeds from the ring buffer.  Doing 
> it at this point, we can ensure order, that the edits will be in the file in 
> accordance w/ the ordering of the region sequence id.
> But the point at which region sequence id is assigned an edit is deep down in 
> the WAL system and there is a lag between our putting an edit into the WAL 
> system and the edit actually getting its edit/sequence id.
> This lag -- "late-binding" -- complicates the unification of mvcc and region 
> sequence id, especially around async WAL writes (and, related, for no-WAL 
> writes) -- the parent for this issue (For async, how you get the edit id in 
> our system when the threads have all gone home -- unless you make them wait?)
> Chatting w/ Jeffrey Zhong yesterday, we came up with a crazypants means of 
> getting the region sequence id near-immediately.  We'll run two ringbuffers.  
> The first will mesh all handler threads and the consumer will generate ids 
> (we will have order on other side of this first ring buffer), and then if 
> async or no sync, we will just let the threads return ... updating mvcc just 
> before we let them go.  All other calls will go up on to the second ring 
> buffer to be serviced as now (batching, distribution out among the sync'ing 
> threads).  The first rb will have no friction and should turn at fast rates 
> compared to the second.  There should not be noticeable slowdown nor do I 
> foresee this refactor intefering w/ our multi-WAL plans.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11140) LocalHBaseCluster should create ConsensusProvider per each server

2014-05-12 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995270#comment-13995270
 ] 

Ted Yu commented on HBASE-11140:


lgtm

> LocalHBaseCluster should create ConsensusProvider per each server
> -
>
> Key: HBASE-11140
> URL: https://issues.apache.org/jira/browse/HBASE-11140
> Project: HBase
>  Issue Type: Sub-task
>  Components: Consensus
>Affects Versions: 0.99.0
>Reporter: Mikhail Antonov
>Assignee: Mikhail Antonov
> Fix For: 0.99.0
>
> Attachments: HBASE-11140.patch
>
>
> Right now there's a bug there when single ConsensusProvider instance is 
> shared across all threads running region servers and masters within 
> processes, which breaks certain tests in patches which used to pass 
> successfully before.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-7912) HBase Backup/Restore Based on HBase Snapshot

2014-05-12 Thread Demai Ni (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-7912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995431#comment-13995431
 ] 

Demai Ni commented on HBASE-7912:
-

the ping on May 9 was lost during apache email outage. so reposted here
--
hi, folks,

We have patches for both full backup (v4) and incremental backup (v1) uploaded 
today. With that, it is easy to apply this 
[HBASE-11085-trunk-v1-contains-HBASE-10900-trunk-v4.patch|https://issues.apache.org/jira/secure/attachment/12644215/HBASE-11085-trunk-v1-contains-HBASE-10900-trunk-v4.patch]
 directly on trunk, and give it a try.

Please see the example in both incremental backup jira : 
[HBASE-11085|https://issues.apache.org/jira/browse/HBASE-11085] and fullbackup 
jira : [HBASE-10900|https://issues.apache.org/jira/browse/HBASE-10900].

We will open more jiras and patches for other features (just as, merge, 
convert, delete, history, progress) in the coming weeks.

Also, thanks for the review comments from [~tedyu], [~mbertozzi], [~stack] and 
others. We will have a few follow-up improvements about zookeeper, protobuff, 
and leveraging the new snapshot manifest

Demai

> HBase Backup/Restore Based on HBase Snapshot
> 
>
> Key: HBASE-7912
> URL: https://issues.apache.org/jira/browse/HBASE-7912
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Richard Ding
>Assignee: Richard Ding
> Attachments: HBaseBackupRestore-Jira-7912-DesignDoc-v1.pdf, 
> HBase_BackupRestore-Jira-7912-CLI-v1.pdf
>
>
> Finally, we completed the implementation of our backup/restore solution, and 
> would like to share with community through this jira. 
> We are leveraging existing hbase snapshot feature, and provide a general 
> solution to common users. Our full backup is using snapshot to capture 
> metadata locally and using exportsnapshot to move data to another cluster; 
> the incremental backup is using offline-WALplayer to backup HLogs; we also 
> leverage global distribution rolllog and flush to improve performance; other 
> added-on values such as convert, merge, progress report, and CLI commands. So 
> that a common user can backup hbase data without in-depth knowledge of hbase. 
>  Our solution also contains some usability features for enterprise users. 
> The detail design document and CLI command will be attached in this jira. We 
> plan to use 10~12 subtasks to share each of the following features, and 
> document the detail implement in the subtasks: 
> * *Full Backup* : provide local and remote back/restore for a list of tables
> * *offline-WALPlayer* to convert HLog to HFiles offline (for incremental 
> backup)
> * *distributed* Logroll and distributed flush 
> * Backup *Manifest* and history
> * *Incremental* backup: to build on top of full backup as daily/weekly backup 
> * *Convert*  incremental backup WAL files into hfiles
> * *Merge* several backup images into one(like merge weekly into monthly)
> * *add and remove* table to and from Backup image
> * *Cancel* a backup process
> * backup progress *status*
> * full backup based on *existing snapshot*
> *-*
> *Below is the original description, to keep here as the history for the 
> design and discussion back in 2013*
> There have been attempts in the past to come up with a viable HBase 
> backup/restore solution (e.g., HBASE-4618).  Recently, there are many 
> advancements and new features in HBase, for example, FileLink, Snapshot, and 
> Distributed Barrier Procedure. This is a proposal for a backup/restore 
> solution that utilizes these new features to achieve better performance and 
> consistency. 
>  
> A common practice of backup and restore in database is to first take full 
> baseline backup, and then periodically take incremental backup that capture 
> the changes since the full baseline backup. HBase cluster can store massive 
> amount data.  Combination of full backups with incremental backups has 
> tremendous benefit for HBase as well.  The following is a typical scenario 
> for full and incremental backup.
> # The user takes a full backup of a table or a set of tables in HBase. 
> # The user schedules periodical incremental backups to capture the changes 
> from the full backup, or from last incremental backup.
> # The user needs to restore table data to a past point of time.
> # The full backup is restored to the table(s) or to different table name(s).  
> Then the incremental backups that are up to the desired point in time are 
> applied on top of the full backup. 
> We would support the following key features and capabilities.
> * Full backup uses HBase snapshot to capture HFiles.
> * Use HBase WALs to capture incremental changes, but we use bulk load of 
> HFiles for

[jira] [Updated] (HBASE-11143) ageOfLastShippedOp metric is confusing

2014-05-12 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-11143:
--

Attachment: 11143-trunk.txt

Trunk patch.

> ageOfLastShippedOp metric is confusing
> --
>
> Key: HBASE-11143
> URL: https://issues.apache.org/jira/browse/HBASE-11143
> Project: HBase
>  Issue Type: Bug
>  Components: Replication
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: 11143-0.94-v2.txt, 11143-0.94.txt, 11143-trunk.txt
>
>
> We are trying to report on replication lag and find that there is no good 
> single metric to do that.
> ageOfLastShippedOp is close, but unfortunately it is increased even when 
> there is nothing to ship on a particular RegionServer.
> I would like discuss a few options here:
> Add a new metric: replicationQueueTime (or something) with the above meaning. 
> I.e. if we have something to ship we set the age of that last shipped edit, 
> if we fail we increment that last time (just like we do now). But if there is 
> nothing to replicate we set it to current time (and hence that metric is 
> reported to close to 0).
> Alternatively we could change the meaning of ageOfLastShippedOp to mean to do 
> that. That might lead to surprises, but the current behavior is clearly weird 
> when there is nothing to replicate.
> Comments? [~jdcryans], [~stack].
> If approach sounds good, I'll make a patch for all branches.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11135) Change region sequenceid generation so happens earlier in the append cycle rather than just before added to file

2014-05-12 Thread Jeffrey Zhong (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995267#comment-13995267
 ] 

Jeffrey Zhong commented on HBASE-11135:
---

{quote}
One thought I had was could you update the mvcc in the ring buffer consumer 
thread just after we up the region edit/sequence id? If so, I could undo the 
latching.
{quote}
Yes, I can do that. Today I'll migrate my patch and run tests to see if there 
is any issue.

> Change region sequenceid generation so happens earlier in the append cycle 
> rather than just before added to file
> 
>
> Key: HBASE-11135
> URL: https://issues.apache.org/jira/browse/HBASE-11135
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Reporter: stack
>Assignee: stack
> Attachments: 11135.wip.txt, 11135v2.txt, 11135v5.txt, 11135v5.txt, 
> 11135v5.txt
>
>
> Currently we assign the region edit/sequence id just before we put it in the 
> WAL.  We do it in the single thread that feeds from the ring buffer.  Doing 
> it at this point, we can ensure order, that the edits will be in the file in 
> accordance w/ the ordering of the region sequence id.
> But the point at which region sequence id is assigned an edit is deep down in 
> the WAL system and there is a lag between our putting an edit into the WAL 
> system and the edit actually getting its edit/sequence id.
> This lag -- "late-binding" -- complicates the unification of mvcc and region 
> sequence id, especially around async WAL writes (and, related, for no-WAL 
> writes) -- the parent for this issue (For async, how you get the edit id in 
> our system when the threads have all gone home -- unless you make them wait?)
> Chatting w/ Jeffrey Zhong yesterday, we came up with a crazypants means of 
> getting the region sequence id near-immediately.  We'll run two ringbuffers.  
> The first will mesh all handler threads and the consumer will generate ids 
> (we will have order on other side of this first ring buffer), and then if 
> async or no sync, we will just let the threads return ... updating mvcc just 
> before we let them go.  All other calls will go up on to the second ring 
> buffer to be serviced as now (batching, distribution out among the sync'ing 
> threads).  The first rb will have no friction and should turn at fast rates 
> compared to the second.  There should not be noticeable slowdown nor do I 
> foresee this refactor intefering w/ our multi-WAL plans.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-7958) Statistics per-column family per-region

2014-05-12 Thread Jesse Yates (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-7958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995304#comment-13995304
 ] 

Jesse Yates commented on HBASE-7958:


[~apurtell] originally this was going to be done to support phoenix. so they 
could get a better idea of distribution of keys to do better region chunking. 
In phoenix we decided even sized chunks would work well enough, so the impetus 
to get this done fizzled a bit.

Other reasons for fizzle:
 * do we just expose the metric for things like otsdb? or just store it in 
HBase in our own format? both?
 * do we need to a UI components?
 * everyone wants all the metrics
 * should this even be part of core and instead just done via a scan-time 
coprocessor?

Those considerations aside, I think it still would still be great to do for 
core :)  We can always add more stats once we have a way to handle all the rest.

How to expose them is an open question (especially considering phoenix will 
want to read these stats too) - maybe a pluggable sink/use a custom tag for 
metrics2 so people can use their own sinks?

> Statistics per-column family per-region
> ---
>
> Key: HBASE-7958
> URL: https://issues.apache.org/jira/browse/HBASE-7958
> Project: HBase
>  Issue Type: New Feature
>Affects Versions: 0.95.2
>Reporter: Jesse Yates
>Assignee: Jesse Yates
> Attachments: hbase-7958-v0-parent.patch, hbase-7958-v0.patch, 
> hbase-7958_rough-cut-v0.patch
>
>
> Originating from this discussion on the dev list: 
> http://search-hadoop.com/m/coDKU1urovS/Simple+stastics+per+region/v=plain
> Essentially, we should have built-in statistics gathering for HBase tables. 
> This allows clients to have a better understanding of the distribution of 
> keys within a table and a given region. We could also surface this 
> information via the UI.
> There are a couple different proposals from the email, the overview is this:
> We add in something on compactions that gathers stats about the keys that are 
> written and then we surface them to a table.
> The possible proposals include:
> *How to implement it?*
> # Coprocessors - 
> ** advantage - it easily plugs in and people could pretty easily add their 
> own statistics. 
> ** disadvantage - UI elements would also require this, we get into dependent 
> loading, which leads down the OSGi path. Also, these CPs need to be installed 
> _after_ all the other CPs on compaction to ensure they see exactly what gets 
> written (doable, but a pain)
> # Built into HBase as a custom scanner
> ** advantage - always goes in the right place and no need to muck about with 
> loading CPs etc.
> ** disadvantage - less pluggable, at least for the initial cut
> *Where do we store data?*
> # .META.
> ** advantage - its an existing table, so we can jam it into another CF there
> ** disadvantage - this would make META much larger, possibly leading to 
> splits AND will make it much harder for other processes to read the info
> # A new stats table
> ** advantage - cleanly separates out the information from META
> ** disadvantage - should use a 'system table' idea to prevent accidental 
> deletion, manipulation by arbitrary clients, but still allow clients to read 
> it.
> Once we have this framework, we can then move to an actual implementation of 
> various statistics.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11136) Add permission check to roll WAL writer

2014-05-12 Thread Jerry He (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jerry He updated HBASE-11136:
-

Attachment: HBASE-11136-trunk-v1.patch

> Add permission check to roll WAL writer 
> 
>
> Key: HBASE-11136
> URL: https://issues.apache.org/jira/browse/HBASE-11136
> Project: HBase
>  Issue Type: Improvement
>  Components: regionserver, security
>Affects Versions: 0.96.2, 0.98.2
>Reporter: Jerry He
>Assignee: Jerry He
>Priority: Minor
> Fix For: 0.99.0
>
> Attachments: HBASE-11136-trunk-v1.patch
>
>
> Currently HBase provides HBaseAdmin.rollHLogWriter() and shell command to 
> roll WAL on a region server. But no permission check is done on this 
> operation in a secure cluster.
> We need to add permission check to prevent un-authorized user from running 
> this operation. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11145) Issue with HLog sync

2014-05-12 Thread Enis Soztutar (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Enis Soztutar updated HBASE-11145:
--

Fix Version/s: 0.99.0

> Issue with HLog sync
> 
>
> Key: HBASE-11145
> URL: https://issues.apache.org/jira/browse/HBASE-11145
> Project: HBase
>  Issue Type: Bug
>Reporter: Anoop Sam John
>Assignee: stack
> Fix For: 0.99.0
>
>
> Got the below Exceptions Log in case of a write heavy test
> {code}
> 2014-05-07 11:29:56,417 ERROR [main.append-pool1-t1] 
> wal.FSHLog$RingBufferEventHandler(1882): UNEXPECTED!!!
> java.lang.IllegalStateException: Queue full
>  at java.util.AbstractQueue.add(Unknown Source)
>  at 
> org.apache.hadoop.hbase.regionserver.wal.FSHLog$SyncRunner.offer(FSHLog.java:1227)
>  at 
> org.apache.hadoop.hbase.regionserver.wal.FSHLog$RingBufferEventHandler.onEvent(FSHLog.java:1878)
>  at 
> org.apache.hadoop.hbase.regionserver.wal.FSHLog$RingBufferEventHandler.onEvent(FSHLog.java:1)
>  at com.lmax.disruptor.BatchEventProcessor.run(BatchEventProcessor.java:133)
>  at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source)
>  at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
>  at java.lang.Thread.run(Unknown Source)
> 2014-05-07 11:29:56,418 ERROR [main.append-pool1-t1] 
> wal.FSHLog$RingBufferEventHandler(1882): UNEXPECTED!!!
> java.lang.ArrayIndexOutOfBoundsException: 5
>  at 
> org.apache.hadoop.hbase.regionserver.wal.FSHLog$RingBufferEventHandler.onEvent(FSHLog.java:1838)
>  at 
> org.apache.hadoop.hbase.regionserver.wal.FSHLog$RingBufferEventHandler.onEvent(FSHLog.java:1)
>  at com.lmax.disruptor.BatchEventProcessor.run(BatchEventProcessor.java:133)
>  at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source)
>  at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
>  at java.lang.Thread.run(Unknown Source)
> 2014-05-07 11:29:56,419 ERROR [main.append-pool1-t1] 
> wal.FSHLog$RingBufferEventHandler(1882): UNEXPECTED!!!
> java.lang.ArrayIndexOutOfBoundsException: 6
>  at 
> org.apache.hadoop.hbase.regionserver.wal.FSHLog$RingBufferEventHandler.onEvent(FSHLog.java:1838)
>  at 
> org.apache.hadoop.hbase.regionserver.wal.FSHLog$RingBufferEventHandler.onEvent(FSHLog.java:1)
>  at com.lmax.disruptor.BatchEventProcessor.run(BatchEventProcessor.java:133)
>  at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source)
>  at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
>  at java.lang.Thread.run(Unknown Source)
> 2014-05-07 11:29:56,419 ERROR [main.append-pool1-t1] 
> wal.FSHLog$RingBufferEventHandler(1882): UNEXPECTED!!!
> java.lang.ArrayIndexOutOfBoundsException: 7
>  at 
> org.apache.hadoop.hbase.regionserver.wal.FSHLog$RingBufferEventHandler.onEvent(FSHLog.java:1838)
>  at 
> org.apache.hadoop.hbase.regionserver.wal.FSHLog$RingBufferEventHandler.onEvent(FSHLog.java:1)
>  at com.lmax.disruptor.BatchEventProcessor.run(BatchEventProcessor.java:133)
>  at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(Unknown Source)
>  at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)
>  at java.lang.Thread.run(Unknown Source)
>  {code}
> In FSHLog$SyncRunner.offer we do BlockingQueue.add() which throws Exception 
> as it is full. The problem is the below shown catch() we do not do any 
> cleanup.
> {code}
> this.syncRunners[index].offer(sequence, this.syncFutures, 
> this.syncFuturesCount);
> attainSafePoint(sequence);
> this.syncFuturesCount = 0;
>   } catch (Throwable t) {
> LOG.error("UNEXPECTED!!!", t);
>   }
> {code}
> syncFuturesCount is not getting reset to 0 and so the subsequent onEvent() 
> handling throws ArrayIndexOutOfBoundsException.
> I think we should do the below 
> 1. Handle the Exception and call cleanupOutstandingSyncsOnException() as in 
> other cases of Exception handling
> 2. Instead of BlockingQueue.add() use offer() (?)



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11143) ageOfLastShippedOp metric is confusing

2014-05-12 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-11143:
--

Attachment: 11143-trunk.txt

Right patch this time.

> ageOfLastShippedOp metric is confusing
> --
>
> Key: HBASE-11143
> URL: https://issues.apache.org/jira/browse/HBASE-11143
> Project: HBase
>  Issue Type: Bug
>  Components: Replication
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.99.0, 0.96.3, 0.94.20, 0.98.3
>
> Attachments: 11143-0.94-v2.txt, 11143-0.94.txt, 11143-trunk.txt
>
>
> We are trying to report on replication lag and find that there is no good 
> single metric to do that.
> ageOfLastShippedOp is close, but unfortunately it is increased even when 
> there is nothing to ship on a particular RegionServer.
> I would like discuss a few options here:
> Add a new metric: replicationQueueTime (or something) with the above meaning. 
> I.e. if we have something to ship we set the age of that last shipped edit, 
> if we fail we increment that last time (just like we do now). But if there is 
> nothing to replicate we set it to current time (and hence that metric is 
> reported to close to 0).
> Alternatively we could change the meaning of ageOfLastShippedOp to mean to do 
> that. That might lead to surprises, but the current behavior is clearly weird 
> when there is nothing to replicate.
> Comments? [~jdcryans], [~stack].
> If approach sounds good, I'll make a patch for all branches.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11129) Expose Scan conversion methods in TableMapReduceUtil as public methods

2014-05-12 Thread Nick Dimiduk (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13992872#comment-13992872
 ] 

Nick Dimiduk commented on HBASE-11129:
--

Looking for some context here, I cannot find the commit that makes this 
interface private. Do you know off the top of your head 
([~yuzhih...@gmail.com], [~jdcryans], [~stack])?

> Expose Scan conversion methods in TableMapReduceUtil as public methods
> --
>
> Key: HBASE-11129
> URL: https://issues.apache.org/jira/browse/HBASE-11129
> Project: HBase
>  Issue Type: Task
>  Components: mapreduce
>Reporter: Ted Yu
>Assignee: Gustavo Anatoly
>Priority: Minor
> Attachments: HBASE-11129.patch
>
>
> Scan#readFields() from 0.92 has been removed.
> TableMapReduceUtil has the following package private methods:
> {code}
>   static String convertScanToString(Scan scan) throws IOException {
> {code}
> {code}
>   static Scan convertStringToScan(String base64) throws IOException {
> {code}
> We should consider exposing them as public methods so that user can interpret 
> Scan objects easily in mapreduce jobs.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Resolved] (HBASE-7030) Configurable whitelist for coprocessor class loader

2014-05-12 Thread Andrew Purtell (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-7030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell resolved HBASE-7030.
---

Resolution: Not a Problem

> Configurable whitelist for coprocessor class loader
> ---
>
> Key: HBASE-7030
> URL: https://issues.apache.org/jira/browse/HBASE-7030
> Project: HBase
>  Issue Type: Improvement
>Affects Versions: 0.95.2
>Reporter: Andrew Purtell
>
> The coprocessor class loader should allow the user to supply a list of 
> packages or classes to resolve directly with the parent (RegionServer) 
> classloader.
> See HBASE-6843 for some background.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11143) ageOfLastShippedOp metric is confusing

2014-05-12 Thread Jean-Daniel Cryans (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13995242#comment-13995242
 ] 

Jean-Daniel Cryans commented on HBASE-11143:


+1, and can we get the new metric in 0.96+?

> ageOfLastShippedOp metric is confusing
> --
>
> Key: HBASE-11143
> URL: https://issues.apache.org/jira/browse/HBASE-11143
> Project: HBase
>  Issue Type: Bug
>  Components: Replication
>Reporter: Lars Hofhansl
>Assignee: Lars Hofhansl
> Fix For: 0.94.20
>
> Attachments: 11143-0.94-v2.txt, 11143-0.94.txt
>
>
> We are trying to report on replication lag and find that there is no good 
> single metric to do that.
> ageOfLastShippedOp is close, but unfortunately it is increased even when 
> there is nothing to ship on a particular RegionServer.
> I would like discuss a few options here:
> Add a new metric: replicationQueueTime (or something) with the above meaning. 
> I.e. if we have something to ship we set the age of that last shipped edit, 
> if we fail we increment that last time (just like we do now). But if there is 
> nothing to replicate we set it to current time (and hence that metric is 
> reported to close to 0).
> Alternatively we could change the meaning of ageOfLastShippedOp to mean to do 
> that. That might lead to surprises, but the current behavior is clearly weird 
> when there is nothing to replicate.
> Comments? [~jdcryans], [~stack].
> If approach sounds good, I'll make a patch for all branches.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


  1   2   >