[jira] [Created] (HBASE-25026) Create a metric to track scans that have no start row and/or stop row

2020-09-14 Thread ramkrishna.s.vasudevan (Jira)
ramkrishna.s.vasudevan created HBASE-25026:
--

 Summary: Create a metric to track scans that have no start row 
and/or stop row
 Key: HBASE-25026
 URL: https://issues.apache.org/jira/browse/HBASE-25026
 Project: HBase
  Issue Type: Improvement
Affects Versions: 3.0.0-alpha-1, 2.4.0
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan


A metric that indicates how many of the scan requests were without start row 
and/or stop row. Generally such queries may be wrongly written or may require 
better schema design and those may be some queries doing some sanity check to 
verify if their actual application logic has done the necessary updates and the 
all the expected rows are processed. 

We do have some logs at the RPC layer to see what queries take time but nothing 
as a metric. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


Re: [DISCUSS] Snapshot meta table or system tables

2020-09-14 Thread Nick Dimiduk
On Mon, Sep 14, 2020 at 19:24 Sean Busbey  wrote:

> meta isn't the only system table and not all system tables have a
>
> representation in HDFS. i.e. the ACL table.
>

+1

On Mon, Sep 14, 2020 at 5:13 PM Nick Dimiduk  wrote:
>
> >
>
> > Is it really a good idea to restore to a previous copy of meta? Wouldn't
> it
>
> > be better to fix meta using hbck2? The only thing we know for certain
> about
>
> > a past meta state is it's in the past.
>
> >
>
> > On Sat, Sep 12, 2020 at 9:27 AM Sean Busbey  wrote:
>
> >
>
> > > Now that we have snapshots with TTLs, it would be helpful if we had
>
> > > automated periodic snapshots of system tables that e.g. hbck2 could
> use to
>
> > > do a forcible restore in cases of severe system damage.
>
> > >
>
> > > On Sat, Sep 12, 2020, 08:49 张铎(Duo Zhang) 
> wrote:
>
> > >
>
> > > > When reviewing the snapshot related code, I found that we have some
> code
>
> > > > for treating meta table. For example:
>
> > > >
>
> > > >
>
> > > >
>
> > >
> https://github.com/apache/hbase/blob/e5ca9adc54f9f580f85d21d38217afa97aa79d68/hbase-server/src/main/java/org/apache/hadoop/hbase/master/snapshot/TakeSnapshotHandler.java#L195
>
> > > >
>
> > > > But when I tried to actually snapshot meta from client(in a UT), I
> got
>
> > > the
>
> > > > following error message:
>
> > > >
>
> > > > System table snapshots are not allowed
>
> > > >
>
> > > > It is thrown from this method:
>
> > > >
>
> > > >
>
> > > >
>
> > >
> https://github.com/apache/hbase/blob/e5ca9adc54f9f580f85d21d38217afa97aa79d68/hbase-client/src/main/java/org/apache/hadoop/hbase/snapshot/ClientSnapshotDescriptionUtils.java#L42
>
> > > >
>
> > > > So I'm wondering whether we want to support snapshotting system
> tables or
>
> > > > not.
>
> > > >
>
> > > > In general, at least for meta table, we can not disable it nor
> delete it,
>
> > > > so even if we could support snapshotting it, we can not restore the
>
> > > > snapshot...
>
> > > >
>
> > > > Thoughts? Thanks.
>
> > > >
>
> > >
>
>
>
>
>
>
>
> --
>
> Sean
>
>


Re: [DISCUSS] Snapshot meta table or system tables

2020-09-14 Thread Sean Busbey
meta isn't the only system table and not all system tables have a
representation in HDFS. i.e. the ACL table.

On Mon, Sep 14, 2020 at 5:13 PM Nick Dimiduk  wrote:
>
> Is it really a good idea to restore to a previous copy of meta? Wouldn't it
> be better to fix meta using hbck2? The only thing we know for certain about
> a past meta state is it's in the past.
>
> On Sat, Sep 12, 2020 at 9:27 AM Sean Busbey  wrote:
>
> > Now that we have snapshots with TTLs, it would be helpful if we had
> > automated periodic snapshots of system tables that e.g. hbck2 could use to
> > do a forcible restore in cases of severe system damage.
> >
> > On Sat, Sep 12, 2020, 08:49 张铎(Duo Zhang)  wrote:
> >
> > > When reviewing the snapshot related code, I found that we have some code
> > > for treating meta table. For example:
> > >
> > >
> > >
> > https://github.com/apache/hbase/blob/e5ca9adc54f9f580f85d21d38217afa97aa79d68/hbase-server/src/main/java/org/apache/hadoop/hbase/master/snapshot/TakeSnapshotHandler.java#L195
> > >
> > > But when I tried to actually snapshot meta from client(in a UT), I got
> > the
> > > following error message:
> > >
> > > System table snapshots are not allowed
> > >
> > > It is thrown from this method:
> > >
> > >
> > >
> > https://github.com/apache/hbase/blob/e5ca9adc54f9f580f85d21d38217afa97aa79d68/hbase-client/src/main/java/org/apache/hadoop/hbase/snapshot/ClientSnapshotDescriptionUtils.java#L42
> > >
> > > So I'm wondering whether we want to support snapshotting system tables or
> > > not.
> > >
> > > In general, at least for meta table, we can not disable it nor delete it,
> > > so even if we could support snapshotting it, we can not restore the
> > > snapshot...
> > >
> > > Thoughts? Thanks.
> > >
> >



-- 
Sean


Re: [DISCUSS] Snapshot meta table or system tables

2020-09-14 Thread Nick Dimiduk
Is it really a good idea to restore to a previous copy of meta? Wouldn't it
be better to fix meta using hbck2? The only thing we know for certain about
a past meta state is it's in the past.

On Sat, Sep 12, 2020 at 9:27 AM Sean Busbey  wrote:

> Now that we have snapshots with TTLs, it would be helpful if we had
> automated periodic snapshots of system tables that e.g. hbck2 could use to
> do a forcible restore in cases of severe system damage.
>
> On Sat, Sep 12, 2020, 08:49 张铎(Duo Zhang)  wrote:
>
> > When reviewing the snapshot related code, I found that we have some code
> > for treating meta table. For example:
> >
> >
> >
> https://github.com/apache/hbase/blob/e5ca9adc54f9f580f85d21d38217afa97aa79d68/hbase-server/src/main/java/org/apache/hadoop/hbase/master/snapshot/TakeSnapshotHandler.java#L195
> >
> > But when I tried to actually snapshot meta from client(in a UT), I got
> the
> > following error message:
> >
> > System table snapshots are not allowed
> >
> > It is thrown from this method:
> >
> >
> >
> https://github.com/apache/hbase/blob/e5ca9adc54f9f580f85d21d38217afa97aa79d68/hbase-client/src/main/java/org/apache/hadoop/hbase/snapshot/ClientSnapshotDescriptionUtils.java#L42
> >
> > So I'm wondering whether we want to support snapshotting system tables or
> > not.
> >
> > In general, at least for meta table, we can not disable it nor delete it,
> > so even if we could support snapshotting it, we can not restore the
> > snapshot...
> >
> > Thoughts? Thanks.
> >
>


[jira] [Resolved] (HBASE-25002) Create simple pattern matching query for retrieving metrics matching the pattern

2020-09-14 Thread ramkrishna.s.vasudevan (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-25002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ramkrishna.s.vasudevan resolved HBASE-25002.

Hadoop Flags: Reviewed
  Resolution: Fixed

> Create simple pattern matching query for retrieving metrics matching the 
> pattern
> 
>
> Key: HBASE-25002
> URL: https://issues.apache.org/jira/browse/HBASE-25002
> Project: HBase
>  Issue Type: Improvement
>Affects Versions: 2.2.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
>Priority: Major
> Fix For: 3.0.0-alpha-1, 2.4.0
>
>
> Currently we allow querying a metric with a specific metric name. But 
> generally say under the MBean sub=Tables under a server we might have 'n' 
> number of tables and we might be interested in a specific metric across all 
> tables. 
> So in such cases giving a simple pattern based query will help so that we can 
> retrieve all metrics that follow that pattern.
> The other side effect is that we can also reduce the size of the json we pull 
> from the server to the querying client. Which are generally smaller scripts 
> which might not be able to process the big sized response JSON.
> Thanks [~anoopsamjohn] for the suggestion. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-25022) Remove 'hbase.testing.nocluster' config

2020-09-14 Thread Duo Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-25022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Duo Zhang resolved HBASE-25022.
---
Hadoop Flags: Reviewed
  Resolution: Fixed

Pushed to master and branch-2.

Thanks [~zghao] for reviewing.

> Remove 'hbase.testing.nocluster' config
> ---
>
> Key: HBASE-25022
> URL: https://issues.apache.org/jira/browse/HBASE-25022
> Project: HBase
>  Issue Type: Improvement
>  Components: test
>Reporter: Duo Zhang
>Assignee: Duo Zhang
>Priority: Major
> Fix For: 3.0.0-alpha-1, 2.4.0
>
>
> It is only used at two places in tests and easy to be removed, so let's just 
> remove it to make our code cleaner.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-25025) [Flaky Test][branch-1] TestFromClientSide#testCheckAndDeleteWithCompareOp

2020-09-14 Thread Reid Chan (Jira)
Reid Chan created HBASE-25025:
-

 Summary: [Flaky Test][branch-1] 
TestFromClientSide#testCheckAndDeleteWithCompareOp
 Key: HBASE-25025
 URL: https://issues.apache.org/jira/browse/HBASE-25025
 Project: HBase
  Issue Type: Improvement
Reporter: Reid Chan






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-25024) [Flake Test][branch-1] TestClientOperationInterrupt#testInterrupt50Percent

2020-09-14 Thread Reid Chan (Jira)
Reid Chan created HBASE-25024:
-

 Summary: [Flake Test][branch-1] 
TestClientOperationInterrupt#testInterrupt50Percent
 Key: HBASE-25024
 URL: https://issues.apache.org/jira/browse/HBASE-25024
 Project: HBase
  Issue Type: Improvement
Reporter: Reid Chan
 Fix For: 1.7.0


Sometimes threads could finish faster than interrupt() gets called. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-25016) Should close ResultScanner in MetaTableAccessor.scanByRegionEncodedName

2020-09-14 Thread Viraj Jasani (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-25016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Viraj Jasani resolved HBASE-25016.
--
Resolution: Fixed

> Should close ResultScanner in MetaTableAccessor.scanByRegionEncodedName
> ---
>
> Key: HBASE-25016
> URL: https://issues.apache.org/jira/browse/HBASE-25016
> Project: HBase
>  Issue Type: Bug
>  Components: master, meta
>Reporter: Duo Zhang
>Assignee: niuyulin
>Priority: Critical
> Fix For: 3.0.0-alpha-1, 2.4.0, 2.3.2
>
>
> It will cause resource leak.
> Th good news is that this method is only used for cluster maintainance, such 
> as assign or unassign region manually, so it is less hurt. Anyway, we should 
> fix it ASAP.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-25023) NPE while shutdown master node

2020-09-14 Thread lujie (Jira)
lujie created HBASE-25023:
-

 Summary: NPE while shutdown master node
 Key: HBASE-25023
 URL: https://issues.apache.org/jira/browse/HBASE-25023
 Project: HBase
  Issue Type: Bug
Reporter: lujie


while shutdown the master node, we can see the exception:
{code:java}
2020-09-14 06:48:29,530 ERROR [PEWorker-16] procedure2.ProcedureExecutor: 
CODE-BUG: Uncaught runtime exception: pid=111, ppid=64, state=RUNNABLE, 
locked=true; org.apache.hadoop.hbase.master.assignment.OpenRegionProcedure
java.lang.NullPointerException
at 
org.apache.hadoop.hbase.master.assignment.RegionRemoteProcedureBase.execute(RegionRemoteProcedureBase.java:276)
at 
org.apache.hadoop.hbase.master.assignment.RegionRemoteProcedureBase.execute(RegionRemoteProcedureBase.java:58)
at 
org.apache.hadoop.hbase.procedure2.Procedure.doExecute(Procedure.java:962)
at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.execProcedure(ProcedureExecutor.java:1648)
at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.executeProcedure(ProcedureExecutor.java:1395)
at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor.access$1100(ProcedureExecutor.java:78)
at 
org.apache.hadoop.hbase.procedure2.ProcedureExecutor$WorkerThread.run(ProcedureExecutor.java:1965)
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)