[jira] [Assigned] (HBASE-23864) No need to submit SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge is disabled

2020-02-18 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang reassigned HBASE-23864:
--

Assignee: Guanghao Zhang

> No need to submit SplitTableRegionProcedure/MergeTableRegionsProcedure when 
> split/merge is disabled
> ---
>
> Key: HBASE-23864
> URL: https://issues.apache.org/jira/browse/HBASE-23864
> Project: HBase
>  Issue Type: Improvement
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Attachments: HBASE-23864.PNG
>
>
> Now even the split/merge is disabled, master will submit a 
> SplitTableRegionProcedure, too. And rollback it when execute failed. I 
> thought the split/merge switch is a cluster level swtich. Master can check it 
> early and no need to submit 
> SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge switch 
> is disabled.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23864) No need to submit SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge is disabled

2020-02-18 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23864:
---
Summary: No need to submit 
SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge is 
disabled  (was: No need to submit 
SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge switch is 
disabled)

> No need to submit SplitTableRegionProcedure/MergeTableRegionsProcedure when 
> split/merge is disabled
> ---
>
> Key: HBASE-23864
> URL: https://issues.apache.org/jira/browse/HBASE-23864
> Project: HBase
>  Issue Type: Improvement
>Reporter: Guanghao Zhang
>Priority: Major
> Attachments: HBASE-23864.PNG
>
>
> Now even the split/merge is disabled, master will submit a 
> SplitTableRegionProcedure, too. And rollback it when execute failed. I 
> thought the split/merge switch is a cluster level swtich. Master can check it 
> early and no need to submit 
> SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge switch 
> is disabled.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23864) No need to submit SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge switch is disabled

2020-02-17 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23864:
---
Attachment: HBASE-23864.PNG

> No need to submit SplitTableRegionProcedure/MergeTableRegionsProcedure when 
> split/merge switch is disabled
> --
>
> Key: HBASE-23864
> URL: https://issues.apache.org/jira/browse/HBASE-23864
> Project: HBase
>  Issue Type: Improvement
>Reporter: Guanghao Zhang
>Priority: Major
> Attachments: HBASE-23864.PNG
>
>
> Now even the split/merge is disabled, master will submit a 
> SplitTableRegionProcedure, too. And rollback it when execute failed. I 
> thought the split/merge switch is a cluster level swtich. Master can check it 
> early and no need to submit 
> SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge switch 
> is disabled.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-23864) No need to submit SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge switch is disabled

2020-02-17 Thread Guanghao Zhang (Jira)
Guanghao Zhang created HBASE-23864:
--

 Summary: No need to submit 
SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge switch is 
disabled
 Key: HBASE-23864
 URL: https://issues.apache.org/jira/browse/HBASE-23864
 Project: HBase
  Issue Type: Bug
Reporter: Guanghao Zhang


Now even the split/merge is disabled, master will submit a 
SplitTableRegionProcedure, too. And rollback it when execute failed. I thought 
the split/merge switch is a cluster level swtich. Master can check it early and 
no need to submit SplitTableRegionProcedure/MergeTableRegionsProcedure when 
split/merge switch is disabled.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23864) No need to submit SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge switch is disabled

2020-02-17 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23864:
---
Issue Type: Improvement  (was: Bug)

> No need to submit SplitTableRegionProcedure/MergeTableRegionsProcedure when 
> split/merge switch is disabled
> --
>
> Key: HBASE-23864
> URL: https://issues.apache.org/jira/browse/HBASE-23864
> Project: HBase
>  Issue Type: Improvement
>Reporter: Guanghao Zhang
>Priority: Major
>
> Now even the split/merge is disabled, master will submit a 
> SplitTableRegionProcedure, too. And rollback it when execute failed. I 
> thought the split/merge switch is a cluster level swtich. Master can check it 
> early and no need to submit 
> SplitTableRegionProcedure/MergeTableRegionsProcedure when split/merge switch 
> is disabled.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23859) Modify "Block locality" of RegionServer Web UI to human readable percentage

2020-02-17 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23859.

Fix Version/s: 2.1.10
   2.2.4
   2.3.0
   3.0.0
   Resolution: Fixed

Pushed to branch-2.1+. Thanks [~GeorryHuang] for contributing.

> Modify "Block locality" of RegionServer Web UI to human readable percentage
> ---
>
> Key: HBASE-23859
> URL: https://issues.apache.org/jira/browse/HBASE-23859
> Project: HBase
>  Issue Type: Improvement
>Affects Versions: master
>Reporter: Zhuoyue Huang
>Assignee: Zhuoyue Huang
>Priority: Trivial
> Fix For: 3.0.0, 2.3.0, 2.2.4, 2.1.10
>
> Attachments: image-2020-02-17-20-36-50-084.png, 
> image-2020-02-17-20-38-52-990.png
>
>
> The unit of "Block locality" in Web UI just like picture 1
> !image-2020-02-17-20-36-50-084.png!
> I think we should change it to percentage unit like picture 2
> !image-2020-02-17-20-38-52-990.png!
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23855) Change bytes size to human readable size for Server Metrics of RegionServer Web UI

2020-02-16 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23855.

Resolution: Fixed

Pushed to master and branch-2. Thanks [~GeorryHuang] for contributing.

> Change bytes size to human readable size for Server Metrics of RegionServer 
> Web UI
> --
>
> Key: HBASE-23855
> URL: https://issues.apache.org/jira/browse/HBASE-23855
> Project: HBase
>  Issue Type: Improvement
>Affects Versions: master
>Reporter: Zhuoyue Huang
>Assignee: Zhuoyue Huang
>Priority: Trivial
> Fix For: 3.0.0, 2.3.0
>
> Attachments: image-2020-02-16-19-23-39-432.png, 
> image-2020-02-16-19-28-23-449.png
>
>
> I found that the “BytesBufferAllocator Status” in RegionServer Web UI still 
> using "Bytes" as a fixture unit. I think we should use "MB" or "GB" when the 
> size is too large !image-2020-02-16-19-23-39-432.png!
>  
> The Web UI after improvement:
> !image-2020-02-16-19-28-23-449.png!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23855) Change bytes size to human readable size for Server Metrics of RegionServer Web UI

2020-02-16 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23855:
---
Fix Version/s: (was: master)
   2.3.0
   3.0.0

> Change bytes size to human readable size for Server Metrics of RegionServer 
> Web UI
> --
>
> Key: HBASE-23855
> URL: https://issues.apache.org/jira/browse/HBASE-23855
> Project: HBase
>  Issue Type: Improvement
>Affects Versions: master
>Reporter: Zhuoyue Huang
>Assignee: Zhuoyue Huang
>Priority: Trivial
> Fix For: 3.0.0, 2.3.0
>
> Attachments: image-2020-02-16-19-23-39-432.png, 
> image-2020-02-16-19-28-23-449.png
>
>
> I found that the “BytesBufferAllocator Status” in RegionServer Web UI still 
> using "Bytes" as a fixture unit. I think we should use "MB" or "GB" when the 
> size is too large !image-2020-02-16-19-23-39-432.png!
>  
> The Web UI after improvement:
> !image-2020-02-16-19-28-23-449.png!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23822) Fix typo in procedures.jsp

2020-02-09 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23822.

Resolution: Fixed

Pushed to master, branch-2 and branch-2.2. Thanks [~GeorryHuang] for 
contributing.

> Fix typo in procedures.jsp
> --
>
> Key: HBASE-23822
> URL: https://issues.apache.org/jira/browse/HBASE-23822
> Project: HBase
>  Issue Type: Improvement
>  Components: website
>Affects Versions: master
>Reporter: Zhuoyue Huang
>Assignee: Zhuoyue Huang
>Priority: Trivial
> Fix For: master
>
> Attachments: procedures.jsp
>
>
> I think there's a typo.In 
> hbase-server/src/main/resources/hbase-webapps/master/procedures.jsp. The 
> Sentences "We do not list Procedures that have completed SUCCESSfully; their 
> number makes it hard to spot the problematics." are misspelled and need to be 
> corrected as "We do not list procedures that have completed successfully; 
> their number makes it hard to spot the problematics."



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23822) Fix typo in procedures.jsp

2020-02-09 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23822:
---
Fix Version/s: (was: master)
   2.2.4
   2.3.0
   3.0.0

> Fix typo in procedures.jsp
> --
>
> Key: HBASE-23822
> URL: https://issues.apache.org/jira/browse/HBASE-23822
> Project: HBase
>  Issue Type: Improvement
>  Components: website
>Affects Versions: master
>Reporter: Zhuoyue Huang
>Assignee: Zhuoyue Huang
>Priority: Trivial
> Fix For: 3.0.0, 2.3.0, 2.2.4
>
> Attachments: procedures.jsp
>
>
> I think there's a typo.In 
> hbase-server/src/main/resources/hbase-webapps/master/procedures.jsp. The 
> Sentences "We do not list Procedures that have completed SUCCESSfully; their 
> number makes it hard to spot the problematics." are misspelled and need to be 
> corrected as "We do not list procedures that have completed successfully; 
> their number makes it hard to spot the problematics."



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23822) Fix typo in procedures.jsp

2020-02-09 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23822:
---
Issue Type: Improvement  (was: Bug)

> Fix typo in procedures.jsp
> --
>
> Key: HBASE-23822
> URL: https://issues.apache.org/jira/browse/HBASE-23822
> Project: HBase
>  Issue Type: Improvement
>  Components: website
>Affects Versions: master
>Reporter: Zhuoyue Huang
>Assignee: Zhuoyue Huang
>Priority: Trivial
> Fix For: master
>
> Attachments: procedures.jsp
>
>
> I think there's a typo.In 
> hbase-server/src/main/resources/hbase-webapps/master/procedures.jsp. The 
> Sentences "We do not list Procedures that have completed SUCCESSfully; their 
> number makes it hard to spot the problematics." are misspelled and need to be 
> corrected as "We do not list procedures that have completed successfully; 
> their number makes it hard to spot the problematics."



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HBASE-23822) Fix typo in procedures.jsp

2020-02-09 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang reassigned HBASE-23822:
--

Assignee: Zhuoyue Huang

> Fix typo in procedures.jsp
> --
>
> Key: HBASE-23822
> URL: https://issues.apache.org/jira/browse/HBASE-23822
> Project: HBase
>  Issue Type: Bug
>  Components: website
>Affects Versions: master
>Reporter: Zhuoyue Huang
>Assignee: Zhuoyue Huang
>Priority: Trivial
> Fix For: master
>
> Attachments: procedures.jsp
>
>
> I think there's a typo.In 
> hbase-server/src/main/resources/hbase-webapps/master/procedures.jsp. The 
> Sentences "We do not list Procedures that have completed SUCCESSfully; their 
> number makes it hard to spot the problematics." are misspelled and need to be 
> corrected as "We do not list procedures that have completed successfully; 
> their number makes it hard to spot the problematics."



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-21110) Remove `Illegal reflective access` warning for `java.nio.Bits.unaligned()`

2020-02-02 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-21110?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17028313#comment-17028313
 ] 

Guanghao Zhang commented on HBASE-21110:


Port the spark fix here.

> Remove `Illegal reflective access` warning for `java.nio.Bits.unaligned()`
> --
>
> Key: HBASE-21110
> URL: https://issues.apache.org/jira/browse/HBASE-21110
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Mike Drob
>Assignee: Guanghao Zhang
>Priority: Major
>  Labels: jdk11
> Fix For: 3.0.0
>
>
> Using Java 11 RC 1, I get the following warning, probably need to add the 
> suggested flag to our scripts?
> {noformat}
> mdrob@mdrob-MBP:~/IdeaProjects/hbase$ java -version
> java version "11" 2018-09-25
> Java(TM) SE Runtime Environment 18.9 (build 11+28)
> Java HotSpot(TM) 64-Bit Server VM 18.9 (build 11+28, mixed mode)
> mdrob@mdrob-MBP:~/IdeaProjects/hbase$ bin/start-hbase.sh
> mdrob@mdrob-MBP:~/IdeaProjects/hbase$ cat 
> /Users/mdrob/IdeaProjects/hbase/bin/../logs/hbase-mdrob-master-mdrob-MBP.local.out
> WARNING: An illegal reflective access operation has occurred
> WARNING: Illegal reflective access by 
> org.apache.hadoop.hbase.util.UnsafeAvailChecker 
> (file:/Users/mdrob/IdeaProjects/hbase/hbase-common/target/hbase-common-3.0.0-SNAPSHOT.jar)
>  to method java.nio.Bits.unaligned()
> WARNING: Please consider reporting this to the maintainers of 
> org.apache.hadoop.hbase.util.UnsafeAvailChecker
> WARNING: Use --illegal-access=warn to enable warnings of further illegal 
> reflective access operations
> WARNING: All illegal access operations will be denied in a future release
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HBASE-21110) Remove `Illegal reflective access` warning for `java.nio.Bits.unaligned()`

2020-02-02 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-21110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang reassigned HBASE-21110:
--

Assignee: Guanghao Zhang

> Remove `Illegal reflective access` warning for `java.nio.Bits.unaligned()`
> --
>
> Key: HBASE-21110
> URL: https://issues.apache.org/jira/browse/HBASE-21110
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Mike Drob
>Assignee: Guanghao Zhang
>Priority: Major
>  Labels: jdk11
> Fix For: 3.0.0
>
>
> Using Java 11 RC 1, I get the following warning, probably need to add the 
> suggested flag to our scripts?
> {noformat}
> mdrob@mdrob-MBP:~/IdeaProjects/hbase$ java -version
> java version "11" 2018-09-25
> Java(TM) SE Runtime Environment 18.9 (build 11+28)
> Java HotSpot(TM) 64-Bit Server VM 18.9 (build 11+28, mixed mode)
> mdrob@mdrob-MBP:~/IdeaProjects/hbase$ bin/start-hbase.sh
> mdrob@mdrob-MBP:~/IdeaProjects/hbase$ cat 
> /Users/mdrob/IdeaProjects/hbase/bin/../logs/hbase-mdrob-master-mdrob-MBP.local.out
> WARNING: An illegal reflective access operation has occurred
> WARNING: Illegal reflective access by 
> org.apache.hadoop.hbase.util.UnsafeAvailChecker 
> (file:/Users/mdrob/IdeaProjects/hbase/hbase-common/target/hbase-common-3.0.0-SNAPSHOT.jar)
>  to method java.nio.Bits.unaligned()
> WARNING: Please consider reporting this to the maintainers of 
> org.apache.hadoop.hbase.util.UnsafeAvailChecker
> WARNING: Use --illegal-access=warn to enable warnings of further illegal 
> reflective access operations
> WARNING: All illegal access operations will be denied in a future release
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-21110) Remove `Illegal reflective access` warning for `java.nio.Bits.unaligned()`

2020-02-02 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-21110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-21110:
---
Summary: Remove `Illegal reflective access` warning for 
`java.nio.Bits.unaligned()`  (was: Issues with Unsafe and JDK 11)

> Remove `Illegal reflective access` warning for `java.nio.Bits.unaligned()`
> --
>
> Key: HBASE-21110
> URL: https://issues.apache.org/jira/browse/HBASE-21110
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Mike Drob
>Priority: Major
>  Labels: jdk11
> Fix For: 3.0.0
>
>
> Using Java 11 RC 1, I get the following warning, probably need to add the 
> suggested flag to our scripts?
> {noformat}
> mdrob@mdrob-MBP:~/IdeaProjects/hbase$ java -version
> java version "11" 2018-09-25
> Java(TM) SE Runtime Environment 18.9 (build 11+28)
> Java HotSpot(TM) 64-Bit Server VM 18.9 (build 11+28, mixed mode)
> mdrob@mdrob-MBP:~/IdeaProjects/hbase$ bin/start-hbase.sh
> mdrob@mdrob-MBP:~/IdeaProjects/hbase$ cat 
> /Users/mdrob/IdeaProjects/hbase/bin/../logs/hbase-mdrob-master-mdrob-MBP.local.out
> WARNING: An illegal reflective access operation has occurred
> WARNING: Illegal reflective access by 
> org.apache.hadoop.hbase.util.UnsafeAvailChecker 
> (file:/Users/mdrob/IdeaProjects/hbase/hbase-common/target/hbase-common-3.0.0-SNAPSHOT.jar)
>  to method java.nio.Bits.unaligned()
> WARNING: Please consider reporting this to the maintainers of 
> org.apache.hadoop.hbase.util.UnsafeAvailChecker
> WARNING: Use --illegal-access=warn to enable warnings of further illegal 
> reflective access operations
> WARNING: All illegal access operations will be denied in a future release
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-22972) [JDK11] Support JDK11 LTS in HBase

2020-02-02 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-22972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17028312#comment-17028312
 ] 

Guanghao Zhang commented on HBASE-22972:


When run hbase with hadoop 2.8.5 and jdk11, another warning is "WARNING: 
Illegal reflective access by 
org.apache.hadoop.security.authentication.util.KerberosUtil 
(file:/home/xiaohao/code/open_source/hbase/hbase-assembly/target/hbase-3.0.0-SNAPSHOT/lib/hadoop-auth-2.8.5.jar)
 to method sun.security.krb5.Config.getInstance()". May only support hadoop 3.x 
when run with jdk11.

> [JDK11] Support JDK11 LTS in HBase
> --
>
> Key: HBASE-22972
> URL: https://issues.apache.org/jira/browse/HBASE-22972
> Project: HBase
>  Issue Type: Umbrella
>Reporter: Duo Zhang
>Priority: Blocker
>  Labels: jdk11
> Fix For: 3.0.0, 2.3.0
>
>
> This is an umbrella issue for tracking all the problems for JDK11 in HBase. 
> And we also rely on hadoop.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (HBASE-22972) [JDK11] Support JDK11 LTS in HBase

2020-02-01 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-22972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17028262#comment-17028262
 ] 

Guanghao Zhang edited comment on HBASE-22972 at 2/2/20 3:29 AM:


The "ungligned" is a method in jdk8 but it is a filed since jdk9. And it seems 
have different name for jdk9/10 and jdk11/12.


was (Author: zghaobac):
The ungligned is a method in jdk8 but it is a filed since jdk9. And it seems 
have different name for jdk9/10 and jdk11/12.

> [JDK11] Support JDK11 LTS in HBase
> --
>
> Key: HBASE-22972
> URL: https://issues.apache.org/jira/browse/HBASE-22972
> Project: HBase
>  Issue Type: Umbrella
>Reporter: Duo Zhang
>Priority: Blocker
>  Labels: jdk11
> Fix For: 3.0.0, 2.3.0
>
>
> This is an umbrella issue for tracking all the problems for JDK11 in HBase. 
> And we also rely on hadoop.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-22972) [JDK11] Support JDK11 LTS in HBase

2020-02-01 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-22972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17028262#comment-17028262
 ] 

Guanghao Zhang commented on HBASE-22972:


The ungligned is a method in jdk8 but it is a filed since jdk9. And it seems 
have different name for jdk9/10 and jdk11/12.

> [JDK11] Support JDK11 LTS in HBase
> --
>
> Key: HBASE-22972
> URL: https://issues.apache.org/jira/browse/HBASE-22972
> Project: HBase
>  Issue Type: Umbrella
>Reporter: Duo Zhang
>Priority: Blocker
>  Labels: jdk11
> Fix For: 3.0.0, 2.3.0
>
>
> This is an umbrella issue for tracking all the problems for JDK11 in HBase. 
> And we also rely on hadoop.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-22972) [JDK11] Support JDK11 LTS in HBase

2020-02-01 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-22972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17028255#comment-17028255
 ] 

Guanghao Zhang commented on HBASE-22972:


The waring message is "WARNING: Illegal reflective access by 
org.apache.hadoop.hbase.util.UnsafeAvailChecker 
(file:/home/zhangguanghao1/code/hbase/hbase-common/target/hbase-common-2.3.0-SNAPSHOT.jar)
 to method java.nio.Bits.unaligned()". And spark had smae problem and resolved 
by https://issues.apache.org/jira/browse/SPARK-27981

> [JDK11] Support JDK11 LTS in HBase
> --
>
> Key: HBASE-22972
> URL: https://issues.apache.org/jira/browse/HBASE-22972
> Project: HBase
>  Issue Type: Umbrella
>Reporter: Duo Zhang
>Priority: Blocker
>  Labels: jdk11
> Fix For: 3.0.0, 2.3.0
>
>
> This is an umbrella issue for tracking all the problems for JDK11 in HBase. 
> And we also rely on hadoop.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (HBASE-22972) [JDK11] Support JDK11 LTS in HBase

2020-02-01 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-22972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17028254#comment-17028254
 ] 

Guanghao Zhang edited comment on HBASE-22972 at 2/2/20 3:02 AM:


The runtime means what? I build a package of branch-2 and run with jdk11. There 
are some warnings but it worked now. I used PE to write/read 1M rows and it 
worked. The hadoop version is 2.8.5.


was (Author: zghaobac):
The runtime means what? I build a package of branch-2 and run with jdk11. There 
are some warnings but it worked now. I used PE to write/read 1M rows and it 
worked.

> [JDK11] Support JDK11 LTS in HBase
> --
>
> Key: HBASE-22972
> URL: https://issues.apache.org/jira/browse/HBASE-22972
> Project: HBase
>  Issue Type: Umbrella
>Reporter: Duo Zhang
>Priority: Blocker
>  Labels: jdk11
> Fix For: 3.0.0, 2.3.0
>
>
> This is an umbrella issue for tracking all the problems for JDK11 in HBase. 
> And we also rely on hadoop.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-22972) [JDK11] Support JDK11 LTS in HBase

2020-02-01 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-22972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17028254#comment-17028254
 ] 

Guanghao Zhang commented on HBASE-22972:


The runtime means what? I build a package of branch-2 and run with jdk11. There 
are some warnings but it worked now. I used PE to write/read 1M rows and it 
worked.

> [JDK11] Support JDK11 LTS in HBase
> --
>
> Key: HBASE-22972
> URL: https://issues.apache.org/jira/browse/HBASE-22972
> Project: HBase
>  Issue Type: Umbrella
>Reporter: Duo Zhang
>Priority: Blocker
>  Labels: jdk11
> Fix For: 3.0.0, 2.3.0
>
>
> This is an umbrella issue for tracking all the problems for JDK11 in HBase. 
> And we also rely on hadoop.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23633) Find a way to handle the corrupt recovered hfiles

2020-02-01 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17028252#comment-17028252
 ] 

Guanghao Zhang commented on HBASE-23633:


[~pankajkumar] Are you working for this now?

> Find a way to handle the corrupt recovered hfiles
> -
>
> Key: HBASE-23633
> URL: https://issues.apache.org/jira/browse/HBASE-23633
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Guanghao Zhang
>Priority: Major
>
> Copy the comment from PR review.
>  
> If the file is a corrupt HFile, an exception will be thrown here, which will 
> cause the region to fail to open.
> Maybe we can add a new parameter to control whether to skip the exception, 
> similar to recover edits which has a parameter 
> "hbase.hregion.edits.replay.skip.errors";
>  
> Regions that can't be opened because of detached References or corrupt hfiles 
> are a fact-of-life. We need work on this issue. This will be a new variant on 
> the problem -- i.e. bad recovered hfiles.
> On adding a config to ignore bad files and just open, thats a bit dangerous 
> as per @infraio  as it could mean silent data loss.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23634) Enable "Split WAL to HFile" by default

2020-01-27 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23634:
---
Parent: HBASE-23286
Issue Type: Sub-task  (was: Improvement)

> Enable "Split WAL to HFile" by default
> --
>
> Key: HBASE-23634
> URL: https://issues.apache.org/jira/browse/HBASE-23634
> Project: HBase
>  Issue Type: Sub-task
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Priority: Blocker
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23633) Find a way to handle the corrupt recovered hfiles

2020-01-27 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23633:
---
Parent: HBASE-23286
Issue Type: Sub-task  (was: Improvement)

> Find a way to handle the corrupt recovered hfiles
> -
>
> Key: HBASE-23633
> URL: https://issues.apache.org/jira/browse/HBASE-23633
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Guanghao Zhang
>Priority: Major
>
> Copy the comment from PR review.
>  
> If the file is a corrupt HFile, an exception will be thrown here, which will 
> cause the region to fail to open.
> Maybe we can add a new parameter to control whether to skip the exception, 
> similar to recover edits which has a parameter 
> "hbase.hregion.edits.replay.skip.errors";
>  
> Regions that can't be opened because of detached References or corrupt hfiles 
> are a fact-of-life. We need work on this issue. This will be a new variant on 
> the problem -- i.e. bad recovered hfiles.
> On adding a config to ignore bad files and just open, thats a bit dangerous 
> as per @infraio  as it could mean silent data loss.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23741) Data loss when WAL split to HFile enabled

2020-01-27 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17024202#comment-17024202
 ] 

Guanghao Zhang commented on HBASE-23741:


Sorry, I didn't run ITBLL so there may some case not handle. Will make this 
feature stable before we release 2.3.0. Thanks for the test.

> Data loss when WAL split to HFile enabled
> -
>
> Key: HBASE-23741
> URL: https://issues.apache.org/jira/browse/HBASE-23741
> Project: HBase
>  Issue Type: Sub-task
>  Components: MTTR
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Pankaj Kumar
>Priority: Critical
>
> Very simple steps as below,
> 1. Create table with 1 region
> 2. Insert 1 record 
> 3. Flush the table 
> 4. Scan table and observe timestamp of the inserted row
> 5. Insert same row key with same timestamp as previously inserted but with 
> different value
> 6. Kill -9 RS where table region is online
> 7. Start RS
> Scan the table and check the result, latest cell must be returned.
> Thanks [~sreenivasulureddy] for finding this issue.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-23739) BoundedRecoveredHFilesOutputSink should read the table descriptor directly

2020-01-26 Thread Guanghao Zhang (Jira)
Guanghao Zhang created HBASE-23739:
--

 Summary: BoundedRecoveredHFilesOutputSink should read the table 
descriptor directly
 Key: HBASE-23739
 URL: https://issues.apache.org/jira/browse/HBASE-23739
 Project: HBase
  Issue Type: Sub-task
Reporter: Guanghao Zhang


Read from meta or filesystem?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23286) Improve MTTR: Split WAL to HFile

2020-01-26 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17024056#comment-17024056
 ] 

Guanghao Zhang commented on HBASE-23286:


{quote}how BoundedRecoveredHFilesOutputSink will retrieve the table descriptor 
when HMaster hasn't finished initialization
{quote}
Good question. This should be a problem. Let me open a new issue for this.

> Improve MTTR: Split WAL to HFile
> 
>
> Key: HBASE-23286
> URL: https://issues.apache.org/jira/browse/HBASE-23286
> Project: HBase
>  Issue Type: Improvement
>  Components: MTTR
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0
>
>
> After HBASE-20724, the compaction event marker is not used anymore when 
> failover. So our new proposal is split WAL to HFile to imporve MTTR. It has 3 
> steps:
>  # Read WAL and write HFile to region’s column family’s recovered.hfiles 
> directory.
>  # Open region.
>  # Bulkload the recovered.hfiles for every column family.
> The design doc was attathed by a google doc. Any suggestions are welcomed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23156) start-hbase.sh failed with ClassNotFoundException when build with hadoop3

2020-01-19 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23156:
---
Release Note: Introduce a new 
hbase-assembly/src/main/assembly/hadoop-three-compat.xml for build with hadoop 
3.x.

> start-hbase.sh failed with ClassNotFoundException when build with hadoop3
> -
>
> Key: HBASE-23156
> URL: https://issues.apache.org/jira/browse/HBASE-23156
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>
> {code:java}
> Exception in thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapperException in thread "main" 
> java.lang.NoClassDefFoundError: com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.util.HBaseConfTool.main(HBaseConfTool.java:39)Caused 
> by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 moreException in 
> thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.zookeeper.ZKServerTool.main(ZKServerTool.java:63)Caused
>  by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23156) start-hbase.sh failed with ClassNotFoundException when build with hadoop3

2020-01-19 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23156.

Resolution: Fixed

Pushed to branch-2 and master. Thanks [~zhangduo] for reviewing.

> start-hbase.sh failed with ClassNotFoundException when build with hadoop3
> -
>
> Key: HBASE-23156
> URL: https://issues.apache.org/jira/browse/HBASE-23156
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>
> {code:java}
> Exception in thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapperException in thread "main" 
> java.lang.NoClassDefFoundError: com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.util.HBaseConfTool.main(HBaseConfTool.java:39)Caused 
> by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 moreException in 
> thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.zookeeper.ZKServerTool.main(ZKServerTool.java:63)Caused
>  by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23347) Pluggable RPC authentication

2020-01-19 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17019203#comment-17019203
 ] 

Guanghao Zhang commented on HBASE-23347:


TestThriftSpnegoHttpFallbackServer and TestSecureRESTServer failed. May 
releated to this issue.

> Pluggable RPC authentication
> 
>
> Key: HBASE-23347
> URL: https://issues.apache.org/jira/browse/HBASE-23347
> Project: HBase
>  Issue Type: Improvement
>  Components: rpc, security
>Reporter: Josh Elser
>Assignee: Josh Elser
>Priority: Major
> Fix For: 3.0.0, 2.3.0
>
>
> Today in HBase, we rely on SASL to implement Kerberos and delegation token 
> authentication. The RPC client and server logic is very tightly coupled to 
> our three authentication mechanism (the previously two mentioned plus simple 
> auth'n) for no good reason (other than "that's how it was built", best as I 
> can tell).
> SASL's function is to decouple the "application" from how a request is being 
> authenticated, which means that, to support a variety of other authentication 
> approaches, we just need to be a little more flexible in letting developers 
> create their own authentication mechanism for HBase.
> This is less for the "average joe" user to write their own authentication 
> plugin (eek), but more to allow us HBase developers to start iterating, see 
> what is possible.
> I'll attach a full write-up on what I have today as to how I think we can add 
> these abstractions, as well as an initial implementation of this idea, with a 
> unit test that shows an end-to-end authentication solution against HBase.
> cc/ [~wchevreuil] as he's been working with me behind the scenes, giving lots 
> of great feedback and support.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23156) start-hbase.sh failed with ClassNotFoundException when build with hadoop3

2020-01-19 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018850#comment-17018850
 ] 

Guanghao Zhang commented on HBASE-23156:


And there is one another problem. Need to config 
hbase.unsafe.stream.capability.enforce to false when start standalone hbase. 
Because the LocalFileSystem not support hflush and hsync, the 
CommonFSUtils.hasCapability will fail...

> start-hbase.sh failed with ClassNotFoundException when build with hadoop3
> -
>
> Key: HBASE-23156
> URL: https://issues.apache.org/jira/browse/HBASE-23156
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>
> {code:java}
> Exception in thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapperException in thread "main" 
> java.lang.NoClassDefFoundError: com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.util.HBaseConfTool.main(HBaseConfTool.java:39)Caused 
> by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 moreException in 
> thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.zookeeper.ZKServerTool.main(ZKServerTool.java:63)Caused
>  by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HBASE-23693) Split failure may cause region hole and data loss when use zk assign

2020-01-18 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang reassigned HBASE-23693:
--

Assignee: tianhang tang

> Split failure may cause region hole and data loss when use zk assign
> 
>
> Key: HBASE-23693
> URL: https://issues.apache.org/jira/browse/HBASE-23693
> Project: HBase
>  Issue Type: Bug
>  Components: master
>Affects Versions: 1.4.8
>Reporter: tianhang tang
>Assignee: tianhang tang
>Priority: Critical
> Attachments: HBASE-23693.branch-1.001.patch
>
>
> to mock this case, I add a sleep code in SplitTransactionImpl.excute after 
> the PONR and before openDaughters:
> {code:java}
> public PairOfSameType execute(final Server server,
>   final RegionServerServices services, User user) throws IOException {
> this.server = server;
> this.rsServices = services;
> useZKForAssignment = server == null ? true :
>   ConfigUtil.useZKForAssignment(server.getConfiguration());
> if (useCoordinatedStateManager(server)) {
>   std =
>   ((BaseCoordinatedStateManager) server.getCoordinatedStateManager())
>   .getSplitTransactionCoordination().getDefaultDetails();
> }
> PairOfSameType regions = createDaughters(server, services, user);
> if (this.parent.getCoprocessorHost() != null) {
>   if (user == null) {
> parent.getCoprocessorHost().preSplitAfterPONR();
>   } else {
> try {
>   user.getUGI().doAs(new PrivilegedExceptionAction() {
> @Override
> public Void run() throws Exception {
>   parent.getCoprocessorHost().preSplitAfterPONR();
>   return null;
> }
>   });
> } catch (InterruptedException ie) {
>   InterruptedIOException iioe = new InterruptedIOException();
>   iioe.initCause(ie);
>   throw iioe;
> }
>   }
> }
> 
> //sleep here!!!
> try {
>   Thread.sleep(1000 * 60 * 60);
> } catch (InterruptedException e) {
>   e.printStackTrace();
> }
> regions = stepsAfterPONR(server, services, regions, user);
> transition(SplitTransactionPhase.COMPLETED);
> return regions;
>   }
> {code}
> so the split transaction will hang.
> then i try to reproduce this problem:
> 1.Create a test table and move it into a test rsgroup, there is only 1 RS in 
> the test group
> 2.Trigger a region split
> 3.The split transaction step after the PONR and sleep, regioninfo in meta has 
> been updated
> 4.Kill the RS process to mock machine crash
> 5.ServerCrashProcedure cleanup SPLITING_NEW region, the daughter regions will 
> be deleted
> 6.ServerCrashProcedure try to assign the parent region, because RS is down 
> and assign fails, the region status is set to FAILED_OPEN and put back into 
> regionsInTransition. But at this time, due to RS crash, the node of the 
> region under ZK region-in-transition no longer exist
> 7.CatalogJanitor thread is blocked due to RIT
> 8.Switch active master
> 9.The CatalogJanitor thread on the new master executes normally and the 
> parent region is cleaned up because split = true && offline = true in the 
> meta table
> 10.We have a hole in the test table and loss data.
>  
> I modified the code when ServerCrashProcedure cleans up the child regions, it 
> will update the parent regioninfo in the meta table, and this problem is no 
> longer reproduced.
> I will upload the patch later.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23156) start-hbase.sh failed with ClassNotFoundException when build with hadoop3

2020-01-18 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018778#comment-17018778
 ] 

Guanghao Zhang commented on HBASE-23156:


Failed since HBASE-22264, the diffenence is remove two jar from the exclude set:

com.fasterxml.woodstox:woodstox-core
org.codehaus.woodstox:stax2-api

> start-hbase.sh failed with ClassNotFoundException when build with hadoop3
> -
>
> Key: HBASE-23156
> URL: https://issues.apache.org/jira/browse/HBASE-23156
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>
> {code:java}
> Exception in thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapperException in thread "main" 
> java.lang.NoClassDefFoundError: com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.util.HBaseConfTool.main(HBaseConfTool.java:39)Caused 
> by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 moreException in 
> thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.zookeeper.ZKServerTool.main(ZKServerTool.java:63)Caused
>  by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work started] (HBASE-23156) start-hbase.sh failed with ClassNotFoundException when build with hadoop3

2020-01-18 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HBASE-23156 started by Guanghao Zhang.
--
> start-hbase.sh failed with ClassNotFoundException when build with hadoop3
> -
>
> Key: HBASE-23156
> URL: https://issues.apache.org/jira/browse/HBASE-23156
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>
> {code:java}
> Exception in thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapperException in thread "main" 
> java.lang.NoClassDefFoundError: com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.util.HBaseConfTool.main(HBaseConfTool.java:39)Caused 
> by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 moreException in 
> thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.zookeeper.ZKServerTool.main(ZKServerTool.java:63)Caused
>  by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23156) start-hbase.sh failed with ClassNotFoundException when build with hadoop3

2020-01-18 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23156:
---
Affects Version/s: 2.3.0
   3.0.0

> start-hbase.sh failed with ClassNotFoundException when build with hadoop3
> -
>
> Key: HBASE-23156
> URL: https://issues.apache.org/jira/browse/HBASE-23156
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>
> {code:java}
> Exception in thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapperException in thread "main" 
> java.lang.NoClassDefFoundError: com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.util.HBaseConfTool.main(HBaseConfTool.java:39)Caused 
> by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 moreException in 
> thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.zookeeper.ZKServerTool.main(ZKServerTool.java:63)Caused
>  by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HBASE-23156) start-hbase.sh failed with ClassNotFoundException when build with hadoop3

2020-01-18 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang reassigned HBASE-23156:
--

Assignee: Guanghao Zhang

> start-hbase.sh failed with ClassNotFoundException when build with hadoop3
> -
>
> Key: HBASE-23156
> URL: https://issues.apache.org/jira/browse/HBASE-23156
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>
> {code:java}
> Exception in thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapperException in thread "main" 
> java.lang.NoClassDefFoundError: com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.util.HBaseConfTool.main(HBaseConfTool.java:39)Caused 
> by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 moreException in 
> thread "main" java.lang.NoClassDefFoundError: 
> com/ctc/wstx/io/InputBootstrapper at 
> org.apache.hadoop.hbase.zookeeper.ZKServerTool.main(ZKServerTool.java:63)Caused
>  by: java.lang.ClassNotFoundException: com.ctc.wstx.io.InputBootstrapper at 
> java.net.URLClassLoader.findClass(URLClassLoader.java:382) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:424) at 
> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:349) at 
> java.lang.ClassLoader.loadClass(ClassLoader.java:357) ... 1 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23547) Release 2.2.3

2020-01-15 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23547.

  Assignee: Guanghao Zhang
Resolution: Fixed

> Release 2.2.3
> -
>
> Key: HBASE-23547
> URL: https://issues.apache.org/jira/browse/HBASE-23547
> Project: HBase
>  Issue Type: Umbrella
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23691) Add 2.2.3 to download page

2020-01-15 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23691.

  Assignee: Guanghao Zhang
Resolution: Fixed

Pushed to master. Thanks [~janh] for reviewing.

> Add 2.2.3 to download page
> --
>
> Key: HBASE-23691
> URL: https://issues.apache.org/jira/browse/HBASE-23691
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23691) Add 2.2.3 to download page

2020-01-15 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23691:
---
Fix Version/s: 3.0.0

> Add 2.2.3 to download page
> --
>
> Key: HBASE-23691
> URL: https://issues.apache.org/jira/browse/HBASE-23691
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23692) Set version as 2.2.4-SNAPSHOT in branch-2.2

2020-01-15 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23692.

Fix Version/s: 2.2.4
 Assignee: Guanghao Zhang
   Resolution: Fixed

> Set version as 2.2.4-SNAPSHOT in branch-2.2
> ---
>
> Key: HBASE-23692
> URL: https://issues.apache.org/jira/browse/HBASE-23692
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 2.2.4
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23638) Set version to 2.2.3 in branch-2.2 for first RC of 2.2.3

2020-01-15 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23638.

  Assignee: Guanghao Zhang
Resolution: Fixed

> Set version to 2.2.3 in branch-2.2 for first RC of 2.2.3
> 
>
> Key: HBASE-23638
> URL: https://issues.apache.org/jira/browse/HBASE-23638
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 2.2.3
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23637) Generate CHANGES.md and RELEASENOTES.md for 2.2.3

2020-01-15 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23637.

  Assignee: Guanghao Zhang
Resolution: Fixed

> Generate CHANGES.md and RELEASENOTES.md for 2.2.3
> -
>
> Key: HBASE-23637
> URL: https://issues.apache.org/jira/browse/HBASE-23637
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 2.2.3
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-23692) Set version as 2.2.4-SNAPSHOT in branch-2.2

2020-01-15 Thread Guanghao Zhang (Jira)
Guanghao Zhang created HBASE-23692:
--

 Summary: Set version as 2.2.4-SNAPSHOT in branch-2.2
 Key: HBASE-23692
 URL: https://issues.apache.org/jira/browse/HBASE-23692
 Project: HBase
  Issue Type: Sub-task
Reporter: Guanghao Zhang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-23691) Add 2.2.3 to download page

2020-01-15 Thread Guanghao Zhang (Jira)
Guanghao Zhang created HBASE-23691:
--

 Summary: Add 2.2.3 to download page
 Key: HBASE-23691
 URL: https://issues.apache.org/jira/browse/HBASE-23691
 Project: HBase
  Issue Type: Sub-task
Reporter: Guanghao Zhang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23684) NPE HFilesOutputSink

2020-01-14 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23684?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17015558#comment-17015558
 ] 

Guanghao Zhang commented on HBASE-23684:


tableDesc or tableName is null. [~stack] How many tables in your test cluster?

> NPE HFilesOutputSink
> 
>
> Key: HBASE-23684
> URL: https://issues.apache.org/jira/browse/HBASE-23684
> Project: HBase
>  Issue Type: Bug
>  Components: wal
>Affects Versions: 2.3.0
>Reporter: Michael Stack
>Priority: Major
>
> Enabling the new split to hfiles feature, HBASE-23286, running branch-2 tip, 
> I see this out on RegionServers:
> {code}
>  2020-01-13 17:37:08,204 INFO org.apache.hadoop.hbase.wal.OutputSink: 3 split 
> writer threads finished
>  2020-01-13 17:37:08,233 INFO org.apache.hadoop.hbase.wal.WALSplitter: 
> Processed 1007 edits across 0 regions cost 284 ms; edits skipped=76; 
> WAL=hdfs://nameservice1/hbase/genie/WALs/hbasedn101.example.org,16020,1578934806382-splitting/hbasedn101.example.org%2C16020%2C1578934806382.1578937008832,
>  size=128.5 M, length=134708720, corrupted=false, progress failed=true
>  2020-01-13 17:37:08,234 WARN 
> org.apache.hadoop.hbase.regionserver.SplitLogWorker: log splitting of 
> WALs/hbasedn101.example.org,16020,1578934806382-splitting/hbasedn101.example.org%2C16020%2C1578934806382.1578937008832
>  failed, returning error
>  java.io.IOException: java.lang.NullPointerException
>  at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.writeRemainingEntryBuffers(BoundedRecoveredHFilesOutputSink.java:173)
>  at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.close(BoundedRecoveredHFilesOutputSink.java:140)
>  at 
> org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:339)
>  at 
> org.apache.hadoop.hbase.wal.WALSplitter.splitLogFile(WALSplitter.java:181)
>  at 
> org.apache.hadoop.hbase.regionserver.SplitLogWorker.splitLog(SplitLogWorker.java:105)
>  at 
> org.apache.hadoop.hbase.regionserver.SplitLogWorker.lambda$new$0(SplitLogWorker.java:84)
>  at 
> org.apache.hadoop.hbase.regionserver.handler.WALSplitterHandler.process(WALSplitterHandler.java:70)
>  at 
> org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:104)
>  at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
>  at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
>  at java.base/java.lang.Thread.run(Thread.java:834)
>  Caused by: java.lang.NullPointerException
>  at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.configContextForNonMetaWriter(BoundedRecoveredHFilesOutputSink.java:225)
>  at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.createRecoveredHFileWriter(BoundedRecoveredHFilesOutputSink.java:213)
>  at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.append(BoundedRecoveredHFilesOutputSink.java:117)
>  at 
> org.apache.hadoop.hbase.wal.BoundedRecoveredHFilesOutputSink.lambda$writeRemainingEntryBuffers$3(BoundedRecoveredHFilesOutputSink.java:155)
>  at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
>  at 
> java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
>  at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
> {code}
> It is a bit odd because log says there were zero regions. Not sure what that 
> was about.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Reopened] (HBASE-23662) Replace HColumnDescriptor(String cf) with ColumnFamilyDescriptor

2020-01-12 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang reopened HBASE-23662:


> Replace HColumnDescriptor(String cf) with ColumnFamilyDescriptor
> 
>
> Key: HBASE-23662
> URL: https://issues.apache.org/jira/browse/HBASE-23662
> Project: HBase
>  Issue Type: Task
>Affects Versions: 3.0.0
>Reporter: Viraj Jasani
>Assignee: Viraj Jasani
>Priority: Major
> Fix For: 3.0.0
>
>
> We have deprecated HColumnDescriptor and HTableDescriptor some time back in 
> 2.0.0. Although there are many use-cases still using them, we can start 
> replacing them with builder pattern: ColumnFamilyDescriptorBuilder and 
> TableDescriptorBuilder.
> In an attempt to do so, starting with replacing specific constructor of 
> HColumnDescriptor:
> {code:java}
> public HTableDescriptor(final TableName name) {
>   this(new ModifyableTableDescriptor(name));
> }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23662) Replace HColumnDescriptor(String cf) with ColumnFamilyDescriptor

2020-01-12 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17013931#comment-17013931
 ] 

Guanghao Zhang commented on HBASE-23662:


[https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5277/testReport/junit/org.apache.hadoop.hbase.client/TestShell/testRunShellTests/]

> Replace HColumnDescriptor(String cf) with ColumnFamilyDescriptor
> 
>
> Key: HBASE-23662
> URL: https://issues.apache.org/jira/browse/HBASE-23662
> Project: HBase
>  Issue Type: Task
>Affects Versions: 3.0.0
>Reporter: Viraj Jasani
>Assignee: Viraj Jasani
>Priority: Major
> Fix For: 3.0.0
>
>
> We have deprecated HColumnDescriptor and HTableDescriptor some time back in 
> 2.0.0. Although there are many use-cases still using them, we can start 
> replacing them with builder pattern: ColumnFamilyDescriptorBuilder and 
> TableDescriptorBuilder.
> In an attempt to do so, starting with replacing specific constructor of 
> HColumnDescriptor:
> {code:java}
> public HTableDescriptor(final TableName name) {
>   this(new ModifyableTableDescriptor(name));
> }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23662) Replace HColumnDescriptor(String cf) with ColumnFamilyDescriptor

2020-01-12 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17013930#comment-17013930
 ] 

Guanghao Zhang commented on HBASE-23662:


All shell unit test failed. Because the code in shell still need 
HColumnDescriptor(String cf). Please upload a addendum patch to update shell 
use case too. Thanks.

> Replace HColumnDescriptor(String cf) with ColumnFamilyDescriptor
> 
>
> Key: HBASE-23662
> URL: https://issues.apache.org/jira/browse/HBASE-23662
> Project: HBase
>  Issue Type: Task
>Affects Versions: 3.0.0
>Reporter: Viraj Jasani
>Assignee: Viraj Jasani
>Priority: Major
> Fix For: 3.0.0
>
>
> We have deprecated HColumnDescriptor and HTableDescriptor some time back in 
> 2.0.0. Although there are many use-cases still using them, we can start 
> replacing them with builder pattern: ColumnFamilyDescriptorBuilder and 
> TableDescriptorBuilder.
> In an attempt to do so, starting with replacing specific constructor of 
> HColumnDescriptor:
> {code:java}
> public HTableDescriptor(final TableName name) {
>   this(new ModifyableTableDescriptor(name));
> }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23659) BaseLoadBalancer#wouldLowerAvailability should consider region replicas

2020-01-12 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17013929#comment-17013929
 ] 

Guanghao Zhang commented on HBASE-23659:


|master.assignment.TestRegionReplicaSplit|20.0% (6 / 30)|

> BaseLoadBalancer#wouldLowerAvailability should consider region replicas
> ---
>
> Key: HBASE-23659
> URL: https://issues.apache.org/jira/browse/HBASE-23659
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> Found this issue when try to fix the flaky unit test  TestRegionReplicaSplit. 
> It may fail as 
> java.lang.AssertionError: Splitted regions should not be assigned to same 
> region server.
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.assignment/TestRegionReplicaSplit/testRegionReplicaSplitRegionAssignment/].
>  
> Now  wouldLowerAvailability method only consider the primary regions. The 
> replica region can't assign to same server with primary region. But can be 
> assigned to same server with other replica regions. 
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23658) Fix flaky TestSnapshotFromMaster

2020-01-12 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17013928#comment-17013928
 ] 

Guanghao Zhang commented on HBASE-23658:


|master.cleaner.TestSnapshotFromMaster|20.0% (6 / 30)|

Not flaky anymore. See 
[https://builds.apache.org/view/H-L/view/HBase/job/HBase-Find-Flaky-Tests/job/branch-2.2/lastSuccessfulBuild/artifact/dashboard.html].

> Fix flaky TestSnapshotFromMaster
> 
>
> Key: HBASE-23658
> URL: https://issues.apache.org/jira/browse/HBASE-23658
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3, 2.1.9
>
> Attachments: HBASE-23658.branch-2.2.001.patch
>
>
> testAsyncSnapshotWillNotBlockSnapshotHFileCleaner is flaky.  The assert may 
> fail.
> {code:java}
> assertTrue(master.getSnapshotManager().isTakingAnySnapshot());
> future.get(); // in branch-2.2, here is Thread.sleep
> assertFalse(master.getSnapshotManager().isTakingAnySnapshot());
> {code}
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.cleaner/TestSnapshotFromMaster/testAsyncSnapshotWillNotBlockSnapshotHFileCleaner/]
>  
> [https://builds.apache.org/view/H-L/view/HBase/job/HBase-Find-Flaky-Tests/job/branch-2.2/lastSuccessfulBuild/artifact/dashboard.html]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23655) Fix flaky TestRSGroupsKillRS: should wait the SCP to finish

2020-01-12 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17013927#comment-17013927
 ] 

Guanghao Zhang commented on HBASE-23655:


|rsgroup.TestRSGroupsKillRS|3.3% (1 / 30)|

Not flaky anymore. See 
[https://builds.apache.org/view/H-L/view/HBase/job/HBase-Find-Flaky-Tests/job/branch-2.2/lastSuccessfulBuild/artifact/dashboard.html]

> Fix flaky TestRSGroupsKillRS: should wait the SCP to finish
> ---
>
> Key: HBASE-23655
> URL: https://issues.apache.org/jira/browse/HBASE-23655
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 2.2.2
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> In teardownAfterMethod, it will moveTables and delete RSGroups. It will fail 
> if not wait SCP to finish.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23601) OutputSink.WriterThread exception gets stuck and repeated indefinietly

2020-01-10 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17012637#comment-17012637
 ] 

Guanghao Zhang commented on HBASE-23601:


I plan to release 2.2.3 now. Let me revert it from branch-2.2 as this not 
resolved. Will merge this after release. Thanks.

> OutputSink.WriterThread exception gets stuck and repeated indefinietly
> --
>
> Key: HBASE-23601
> URL: https://issues.apache.org/jira/browse/HBASE-23601
> Project: HBase
>  Issue Type: Bug
>  Components: read replicas
>Affects Versions: 2.2.2
>Reporter: Szabolcs Bukros
>Assignee: Szabolcs Bukros
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.1.9, 2.2.4
>
>
> When a WriterThread runs into an exception (ie: NotServingRegionException), 
> the exception is stored in the controller. It is never removed and can not be 
> overwritten either.
>  
> {code:java}
> public void run()  {
>   try {
> doRun();
>   } catch (Throwable t) {
> LOG.error("Exiting thread", t);
> controller.writerThreadError(t);
>   }
> }{code}
> Thanks to this every time PipelineController.checkForErrors() is called the 
> same old exception is rethrown.
>  
> For example in RegionReplicaReplicationEndpoint.replicate there is a while 
> loop that does the actual replicating. Every time it loops, it calls 
> checkForErrors(), catches the rethrown exception, logs it but does nothing 
> about it. This results in ~2GB log files in ~5min in my experience.
>  
> My proposal would be to clean up the stored exception when it reaches 
> RegionReplicaReplicationEndpoint.replicate and make sure we restart the 
> WriterThread that died throwing it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23666) Backport "HBASE-23660 hbase:meta's table.jsp ref to wrong rs address" to branch-2

2020-01-10 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23666:
---
Fix Version/s: 2.2.3

> Backport "HBASE-23660 hbase:meta's table.jsp ref to wrong rs address" to 
> branch-2
> -
>
> Key: HBASE-23666
> URL: https://issues.apache.org/jira/browse/HBASE-23666
> Project: HBase
>  Issue Type: Bug
>  Components: master
>Affects Versions: 2.2.2
>Reporter: Lijin Bin
>Assignee: Lijin Bin
>Priority: Major
> Fix For: 2.3.0, 2.2.3
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23666) Backport "HBASE-23660 hbase:meta's table.jsp ref to wrong rs address" to branch-2

2020-01-10 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23666:
---
Fix Version/s: (was: 2.2.3)

> Backport "HBASE-23660 hbase:meta's table.jsp ref to wrong rs address" to 
> branch-2
> -
>
> Key: HBASE-23666
> URL: https://issues.apache.org/jira/browse/HBASE-23666
> Project: HBase
>  Issue Type: Bug
>  Components: master
>Affects Versions: 2.2.2
>Reporter: Lijin Bin
>Assignee: Lijin Bin
>Priority: Major
> Fix For: 2.3.0
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23651) Region balance throttling can be disabled

2020-01-10 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23651:
---
Fix Version/s: (was: 2.2.4)
   2.2.3

> Region balance throttling can be disabled
> -
>
> Key: HBASE-23651
> URL: https://issues.apache.org/jira/browse/HBASE-23651
> Project: HBase
>  Issue Type: Improvement
>Affects Versions: 2.2.3
>Reporter: Lijin Bin
>Assignee: Lijin Bin
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> HBASE-17178 Add region balance throttling, but it can not be disabled, 
> sometimes we need no throttle and balance the cluster as fast as possible.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23663) Allow dot and hyphen in Profiler's URL

2020-01-10 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23663:
---
Fix Version/s: (was: 2.2.4)
   2.2.3

> Allow dot and hyphen in Profiler's URL
> --
>
> Key: HBASE-23663
> URL: https://issues.apache.org/jira/browse/HBASE-23663
> Project: HBase
>  Issue Type: Bug
>  Components: profiler
>Affects Versions: 3.0.0, 2.2.0, 2.3.0, 2.1.5
>Reporter: Peter Somogyi
>Assignee: Peter Somogyi
>Priority: Minor
> Fix For: 3.0.0, 2.3.0, 2.2.3, 2.1.9
>
>
> ProfileOutputServlet sanitizes the input URL but fails when the query string 
> contains dot or hyphen. These are valid characters for the hostname.
> Example URL part:
> /prof-output/async-prof-pid-122466-cpu-1.svg?host=myhost-1.example.com=16010
>  
> In this case the user gets the following error message:
> {noformat}
> HTTP ERROR: 500
> Problem accessing /prof-output/async-prof-pid-122466-cpu-1.svg. Reason:
> java.lang.RuntimeException: Non-alphanumeric data found in input, 
> aborting.
> Powered by Jetty:// 9.3.27.v20190418 {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23658) Fix flaky TestSnapshotFromMaster

2020-01-10 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23658.

Resolution: Fixed

> Fix flaky TestSnapshotFromMaster
> 
>
> Key: HBASE-23658
> URL: https://issues.apache.org/jira/browse/HBASE-23658
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3, 2.1.9
>
> Attachments: HBASE-23658.branch-2.2.001.patch
>
>
> testAsyncSnapshotWillNotBlockSnapshotHFileCleaner is flaky.  The assert may 
> fail.
> {code:java}
> assertTrue(master.getSnapshotManager().isTakingAnySnapshot());
> future.get(); // in branch-2.2, here is Thread.sleep
> assertFalse(master.getSnapshotManager().isTakingAnySnapshot());
> {code}
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.cleaner/TestSnapshotFromMaster/testAsyncSnapshotWillNotBlockSnapshotHFileCleaner/]
>  
> [https://builds.apache.org/view/H-L/view/HBase/job/HBase-Find-Flaky-Tests/job/branch-2.2/lastSuccessfulBuild/artifact/dashboard.html]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23659) BaseLoadBalancer#wouldLowerAvailability should consider region replicas

2020-01-10 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23659.

Resolution: Fixed

> BaseLoadBalancer#wouldLowerAvailability should consider region replicas
> ---
>
> Key: HBASE-23659
> URL: https://issues.apache.org/jira/browse/HBASE-23659
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> Found this issue when try to fix the flaky unit test  TestRegionReplicaSplit. 
> It may fail as 
> java.lang.AssertionError: Splitted regions should not be assigned to same 
> region server.
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.assignment/TestRegionReplicaSplit/testRegionReplicaSplitRegionAssignment/].
>  
> Now  wouldLowerAvailability method only consider the primary regions. The 
> replica region can't assign to same server with primary region. But can be 
> assigned to same server with other replica regions. 
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23655) Fix flaky TestRSGroupsKillRS: should wait the SCP to finish

2020-01-10 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23655.

Resolution: Fixed

> Fix flaky TestRSGroupsKillRS: should wait the SCP to finish
> ---
>
> Key: HBASE-23655
> URL: https://issues.apache.org/jira/browse/HBASE-23655
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 2.2.2
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> In teardownAfterMethod, it will moveTables and delete RSGroups. It will fail 
> if not wait SCP to finish.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23659) BaseLoadBalancer#wouldLowerAvailability should consider region replicas

2020-01-10 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17012552#comment-17012552
 ] 

Guanghao Zhang commented on HBASE-23659:


Pushed to branch-2.2+. Thanks all for reviewing.

> BaseLoadBalancer#wouldLowerAvailability should consider region replicas
> ---
>
> Key: HBASE-23659
> URL: https://issues.apache.org/jira/browse/HBASE-23659
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>
> Found this issue when try to fix the flaky unit test  TestRegionReplicaSplit. 
> It may fail as 
> java.lang.AssertionError: Splitted regions should not be assigned to same 
> region server.
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.assignment/TestRegionReplicaSplit/testRegionReplicaSplitRegionAssignment/].
>  
> Now  wouldLowerAvailability method only consider the primary regions. The 
> replica region can't assign to same server with primary region. But can be 
> assigned to same server with other replica regions. 
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23659) BaseLoadBalancer#wouldLowerAvailability should consider region replicas

2020-01-10 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23659:
---
Fix Version/s: 2.2.3
   2.3.0
   3.0.0

> BaseLoadBalancer#wouldLowerAvailability should consider region replicas
> ---
>
> Key: HBASE-23659
> URL: https://issues.apache.org/jira/browse/HBASE-23659
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> Found this issue when try to fix the flaky unit test  TestRegionReplicaSplit. 
> It may fail as 
> java.lang.AssertionError: Splitted regions should not be assigned to same 
> region server.
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.assignment/TestRegionReplicaSplit/testRegionReplicaSplitRegionAssignment/].
>  
> Now  wouldLowerAvailability method only consider the primary regions. The 
> replica region can't assign to same server with primary region. But can be 
> assigned to same server with other replica regions. 
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23286) Improve MTTR: Split WAL to HFile

2020-01-10 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17012543#comment-17012543
 ] 

Guanghao Zhang commented on HBASE-23286:


{quote}bq.  I think [~zghao] has already ported the patch to our internal 
branch-2 based branch and tested the feature?
{quote}
Yes. I tested this on our internal branch.

> Improve MTTR: Split WAL to HFile
> 
>
> Key: HBASE-23286
> URL: https://issues.apache.org/jira/browse/HBASE-23286
> Project: HBase
>  Issue Type: Improvement
>  Components: MTTR
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0
>
>
> After HBASE-20724, the compaction event marker is not used anymore when 
> failover. So our new proposal is split WAL to HFile to imporve MTTR. It has 3 
> steps:
>  # Read WAL and write HFile to region’s column family’s recovered.hfiles 
> directory.
>  # Open region.
>  # Bulkload the recovered.hfiles for every column family.
> The design doc was attathed by a google doc. Any suggestions are welcomed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23286) Improve MTTR: Split WAL to HFile

2020-01-09 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17012401#comment-17012401
 ] 

Guanghao Zhang commented on HBASE-23286:


[~stack]  Let me take a look about this.

> Improve MTTR: Split WAL to HFile
> 
>
> Key: HBASE-23286
> URL: https://issues.apache.org/jira/browse/HBASE-23286
> Project: HBase
>  Issue Type: Improvement
>  Components: MTTR
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0
>
>
> After HBASE-20724, the compaction event marker is not used anymore when 
> failover. So our new proposal is split WAL to HFile to imporve MTTR. It has 3 
> steps:
>  # Read WAL and write HFile to region’s column family’s recovered.hfiles 
> directory.
>  # Open region.
>  # Bulkload the recovered.hfiles for every column family.
> The design doc was attathed by a google doc. Any suggestions are welcomed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23655) Fix flaky TestRSGroupsKillRS: should wait the SCP to finish

2020-01-09 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011717#comment-17011717
 ] 

Guanghao Zhang commented on HBASE-23655:


Pushed to branch-2.2+.

> Fix flaky TestRSGroupsKillRS: should wait the SCP to finish
> ---
>
> Key: HBASE-23655
> URL: https://issues.apache.org/jira/browse/HBASE-23655
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 2.2.2
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> In teardownAfterMethod, it will moveTables and delete RSGroups. It will fail 
> if not wait SCP to finish.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HBASE-23658) Fix flaky TestSnapshotFromMaster

2020-01-09 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang reassigned HBASE-23658:
--

Assignee: Guanghao Zhang

> Fix flaky TestSnapshotFromMaster
> 
>
> Key: HBASE-23658
> URL: https://issues.apache.org/jira/browse/HBASE-23658
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3, 2.1.9
>
> Attachments: HBASE-23658.branch-2.2.001.patch
>
>
> testAsyncSnapshotWillNotBlockSnapshotHFileCleaner is flaky.  The assert may 
> fail.
> {code:java}
> assertTrue(master.getSnapshotManager().isTakingAnySnapshot());
> future.get(); // in branch-2.2, here is Thread.sleep
> assertFalse(master.getSnapshotManager().isTakingAnySnapshot());
> {code}
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.cleaner/TestSnapshotFromMaster/testAsyncSnapshotWillNotBlockSnapshotHFileCleaner/]
>  
> [https://builds.apache.org/view/H-L/view/HBase/job/HBase-Find-Flaky-Tests/job/branch-2.2/lastSuccessfulBuild/artifact/dashboard.html]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23658) Fix flaky TestSnapshotFromMaster

2020-01-09 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011637#comment-17011637
 ] 

Guanghao Zhang commented on HBASE-23658:


Pushed to branch-2.1+. Thanks [~zhangduo] for reviewing.

> Fix flaky TestSnapshotFromMaster
> 
>
> Key: HBASE-23658
> URL: https://issues.apache.org/jira/browse/HBASE-23658
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Priority: Major
> Attachments: HBASE-23658.branch-2.2.001.patch
>
>
> testAsyncSnapshotWillNotBlockSnapshotHFileCleaner is flaky.  The assert may 
> fail.
> {code:java}
> assertTrue(master.getSnapshotManager().isTakingAnySnapshot());
> future.get(); // in branch-2.2, here is Thread.sleep
> assertFalse(master.getSnapshotManager().isTakingAnySnapshot());
> {code}
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.cleaner/TestSnapshotFromMaster/testAsyncSnapshotWillNotBlockSnapshotHFileCleaner/]
>  
> [https://builds.apache.org/view/H-L/view/HBase/job/HBase-Find-Flaky-Tests/job/branch-2.2/lastSuccessfulBuild/artifact/dashboard.html]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23658) Fix flaky TestSnapshotFromMaster

2020-01-09 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23658:
---
Fix Version/s: 2.1.9
   2.2.3
   2.3.0
   3.0.0

> Fix flaky TestSnapshotFromMaster
> 
>
> Key: HBASE-23658
> URL: https://issues.apache.org/jira/browse/HBASE-23658
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3, 2.1.9
>
> Attachments: HBASE-23658.branch-2.2.001.patch
>
>
> testAsyncSnapshotWillNotBlockSnapshotHFileCleaner is flaky.  The assert may 
> fail.
> {code:java}
> assertTrue(master.getSnapshotManager().isTakingAnySnapshot());
> future.get(); // in branch-2.2, here is Thread.sleep
> assertFalse(master.getSnapshotManager().isTakingAnySnapshot());
> {code}
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.cleaner/TestSnapshotFromMaster/testAsyncSnapshotWillNotBlockSnapshotHFileCleaner/]
>  
> [https://builds.apache.org/view/H-L/view/HBase/job/HBase-Find-Flaky-Tests/job/branch-2.2/lastSuccessfulBuild/artifact/dashboard.html]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23651) Region balance throttling can be disabled

2020-01-08 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011324#comment-17011324
 ] 

Guanghao Zhang commented on HBASE-23651:


[~binlijin] Please update the Release Note. Thanks.

> Region balance throttling can be disabled
> -
>
> Key: HBASE-23651
> URL: https://issues.apache.org/jira/browse/HBASE-23651
> Project: HBase
>  Issue Type: Improvement
>Affects Versions: 2.2.3
>Reporter: Lijin Bin
>Assignee: Lijin Bin
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.4
>
>
> HBASE-17178 Add region balance throttling, but it can not be disabled, 
> sometimes we need no throttle and balance the cluster as fast as possible.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HBASE-23659) BaseLoadBalancer#wouldLowerAvailability should consider region replicas

2020-01-08 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang reassigned HBASE-23659:
--

Assignee: Guanghao Zhang

> BaseLoadBalancer#wouldLowerAvailability should consider region replicas
> ---
>
> Key: HBASE-23659
> URL: https://issues.apache.org/jira/browse/HBASE-23659
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>
> Found this issue when try to fix the flaky unit test  TestRegionReplicaSplit. 
> It may fail as 
> java.lang.AssertionError: Splitted regions should not be assigned to same 
> region server.
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.assignment/TestRegionReplicaSplit/testRegionReplicaSplitRegionAssignment/].
>  
> Now  wouldLowerAvailability method only consider the primary regions. The 
> replica region can't assign to same server with primary region. But can be 
> assigned to same server with other replica regions. 
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23659) BaseLoadBalancer#wouldLowerAvailability should consider region replicas

2020-01-08 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23659?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010594#comment-17010594
 ] 

Guanghao Zhang commented on HBASE-23659:


Found a bug when try to fix this. The primary index may be -1... Then 
wouldLowerAvailability will always return true.

 
{code:java}
int primary = regionIndexToPrimaryIndex[region]; // may be -1

if (contains(primariesOfRegionsPerServer[server], primary)) {
  // check for whether there are other servers that we can place this region
  for (int i = 0; i < primariesOfRegionsPerServer.length; i++) {
if (i != server && !contains(primariesOfRegionsPerServer[i], primary)) {
  LOG.debug("Better server return true, region " + regionInfo + " server " 
+ serverName);
  return true; // meaning there is a better server
}
  }
  LOG.debug("No better server return false, region " + regionInfo + " server " 
+ serverName);
  return false; // there is not a better server to place this
}
{code}
 

> BaseLoadBalancer#wouldLowerAvailability should consider region replicas
> ---
>
> Key: HBASE-23659
> URL: https://issues.apache.org/jira/browse/HBASE-23659
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Priority: Major
>
> Found this issue when try to fix the flaky unit test  TestRegionReplicaSplit. 
> It may fail as 
> java.lang.AssertionError: Splitted regions should not be assigned to same 
> region server.
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.assignment/TestRegionReplicaSplit/testRegionReplicaSplitRegionAssignment/].
>  
> Now  wouldLowerAvailability method only consider the primary regions. The 
> replica region can't assign to same server with primary region. But can be 
> assigned to same server with other replica regions. 
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23659) BaseLoadBalancer#wouldLowerAvailability should consider region replicas

2020-01-08 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23659:
---
Description: 
Found this issue when try to fix the flaky unit test  TestRegionReplicaSplit. 
It may fail as 

java.lang.AssertionError: Splitted regions should not be assigned to same 
region server.

See 
[https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.assignment/TestRegionReplicaSplit/testRegionReplicaSplitRegionAssignment/].

 

Now  wouldLowerAvailability method only consider the primary regions. The 
replica region can't assign to same server with primary region. But can be 
assigned to same server with other replica regions. 

 

  was:
Found this issue when try to fix the flaky unit test  TestRegionReplicaSplit. 
It may fail as 

java.lang.AssertionError: Splitted regions should not be assigned to same 
region server.

See 
[https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.assignment/TestRegionReplicaSplit/testRegionReplicaSplitRegionAssignment/].

 

 


> BaseLoadBalancer#wouldLowerAvailability should consider region replicas
> ---
>
> Key: HBASE-23659
> URL: https://issues.apache.org/jira/browse/HBASE-23659
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Priority: Major
>
> Found this issue when try to fix the flaky unit test  TestRegionReplicaSplit. 
> It may fail as 
> java.lang.AssertionError: Splitted regions should not be assigned to same 
> region server.
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.assignment/TestRegionReplicaSplit/testRegionReplicaSplitRegionAssignment/].
>  
> Now  wouldLowerAvailability method only consider the primary regions. The 
> replica region can't assign to same server with primary region. But can be 
> assigned to same server with other replica regions. 
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-23659) BaseLoadBalancer#wouldLowerAvailability should consider region replicas

2020-01-08 Thread Guanghao Zhang (Jira)
Guanghao Zhang created HBASE-23659:
--

 Summary: BaseLoadBalancer#wouldLowerAvailability should consider 
region replicas
 Key: HBASE-23659
 URL: https://issues.apache.org/jira/browse/HBASE-23659
 Project: HBase
  Issue Type: Bug
Reporter: Guanghao Zhang


Found this issue when try to fix the flaky unit test  TestRegionReplicaSplit. 
It may fail as 

java.lang.AssertionError: Splitted regions should not be assigned to same 
region server.

See 
[https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.assignment/TestRegionReplicaSplit/testRegionReplicaSplitRegionAssignment/].

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23658) Fix flaky TestSnapshotFromMaster

2020-01-07 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23658:
---
Attachment: HBASE-23658.branch-2.2.001.patch

> Fix flaky TestSnapshotFromMaster
> 
>
> Key: HBASE-23658
> URL: https://issues.apache.org/jira/browse/HBASE-23658
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Priority: Major
> Attachments: HBASE-23658.branch-2.2.001.patch
>
>
> testAsyncSnapshotWillNotBlockSnapshotHFileCleaner is flaky.  The assert may 
> fail.
> {code:java}
> assertTrue(master.getSnapshotManager().isTakingAnySnapshot());
> future.get(); // in branch-2.2, here is Thread.sleep
> assertFalse(master.getSnapshotManager().isTakingAnySnapshot());
> {code}
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.cleaner/TestSnapshotFromMaster/testAsyncSnapshotWillNotBlockSnapshotHFileCleaner/]
>  
> [https://builds.apache.org/view/H-L/view/HBase/job/HBase-Find-Flaky-Tests/job/branch-2.2/lastSuccessfulBuild/artifact/dashboard.html]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23658) Fix flaky TestSnapshotFromMaster

2020-01-07 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010229#comment-17010229
 ] 

Guanghao Zhang commented on HBASE-23658:


Use waitfor to make the ut more stable.

> Fix flaky TestSnapshotFromMaster
> 
>
> Key: HBASE-23658
> URL: https://issues.apache.org/jira/browse/HBASE-23658
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Priority: Major
>
> testAsyncSnapshotWillNotBlockSnapshotHFileCleaner is flaky.  The assert may 
> fail.
> {code:java}
> assertTrue(master.getSnapshotManager().isTakingAnySnapshot());
> future.get(); // in branch-2.2, here is Thread.sleep
> assertFalse(master.getSnapshotManager().isTakingAnySnapshot());
> {code}
> See 
> [https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.cleaner/TestSnapshotFromMaster/testAsyncSnapshotWillNotBlockSnapshotHFileCleaner/]
>  
> [https://builds.apache.org/view/H-L/view/HBase/job/HBase-Find-Flaky-Tests/job/branch-2.2/lastSuccessfulBuild/artifact/dashboard.html]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-23658) Fix flaky TestSnapshotFromMaster

2020-01-07 Thread Guanghao Zhang (Jira)
Guanghao Zhang created HBASE-23658:
--

 Summary: Fix flaky TestSnapshotFromMaster
 Key: HBASE-23658
 URL: https://issues.apache.org/jira/browse/HBASE-23658
 Project: HBase
  Issue Type: Bug
Reporter: Guanghao Zhang


testAsyncSnapshotWillNotBlockSnapshotHFileCleaner is flaky.  The assert may 
fail.
{code:java}
assertTrue(master.getSnapshotManager().isTakingAnySnapshot());
future.get(); // in branch-2.2, here is Thread.sleep
assertFalse(master.getSnapshotManager().isTakingAnySnapshot());
{code}
See 
[https://builds.apache.org/job/HBase-Flaky-Tests/job/master/5227/testReport/junit/org.apache.hadoop.hbase.master.cleaner/TestSnapshotFromMaster/testAsyncSnapshotWillNotBlockSnapshotHFileCleaner/]

 

[https://builds.apache.org/view/H-L/view/HBase/job/HBase-Find-Flaky-Tests/job/branch-2.2/lastSuccessfulBuild/artifact/dashboard.html]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23655) Fix flaky TestRSGroupsKillRS: should wait the SCP to finish

2020-01-07 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23655:
---
Fix Version/s: 2.2.3
   2.3.0
   3.0.0

> Fix flaky TestRSGroupsKillRS: should wait the SCP to finish
> ---
>
> Key: HBASE-23655
> URL: https://issues.apache.org/jira/browse/HBASE-23655
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 2.2.2
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> In teardownAfterMethod, it will moveTables and delete RSGroups. It will fail 
> if not wait SCP to finish.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23655) Fix flaky TestRSGroupsKillRS: should wait the SCP to finish

2020-01-07 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010214#comment-17010214
 ] 

Guanghao Zhang commented on HBASE-23655:


Ping [~zhangduo] for reviewing.

> Fix flaky TestRSGroupsKillRS: should wait the SCP to finish
> ---
>
> Key: HBASE-23655
> URL: https://issues.apache.org/jira/browse/HBASE-23655
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 2.2.2
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> In teardownAfterMethod, it will moveTables and delete RSGroups. It will fail 
> if not wait SCP to finish.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23655) Fix flaky TestRSGroupsKillRS: should wait the SCP to finish

2020-01-07 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010213#comment-17010213
 ] 

Guanghao Zhang commented on HBASE-23655:


Test it more than 10 times and passed locally.

> Fix flaky TestRSGroupsKillRS: should wait the SCP to finish
> ---
>
> Key: HBASE-23655
> URL: https://issues.apache.org/jira/browse/HBASE-23655
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 2.2.2
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>
> In teardownAfterMethod, it will moveTables and delete RSGroups. It will fail 
> if not wait SCP to finish.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HBASE-23655) Fix flaky TestRSGroupsKillRS: should wait the SCP to finish

2020-01-07 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang reassigned HBASE-23655:
--

Assignee: Guanghao Zhang

> Fix flaky TestRSGroupsKillRS: should wait the SCP to finish
> ---
>
> Key: HBASE-23655
> URL: https://issues.apache.org/jira/browse/HBASE-23655
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 2.2.2
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
>
> In teardownAfterMethod, it will moveTables and delete RSGroups. It will fail 
> if not wait SCP to finish.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23655) Fix flaky TestRSGroupsKillRS: should wait the SCP to finish

2020-01-07 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23655:
---
Description: 
In teardownAfterMethod, it will moveTables and delete RSGroups. It will fail if 
not wait SCP to finish.

 

 

> Fix flaky TestRSGroupsKillRS: should wait the SCP to finish
> ---
>
> Key: HBASE-23655
> URL: https://issues.apache.org/jira/browse/HBASE-23655
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 2.2.2
>Reporter: Guanghao Zhang
>Priority: Major
>
> In teardownAfterMethod, it will moveTables and delete RSGroups. It will fail 
> if not wait SCP to finish.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-23655) Fix flaky TestRSGroupsKillRS: should wait the SCP to finish

2020-01-07 Thread Guanghao Zhang (Jira)
Guanghao Zhang created HBASE-23655:
--

 Summary: Fix flaky TestRSGroupsKillRS: should wait the SCP to 
finish
 Key: HBASE-23655
 URL: https://issues.apache.org/jira/browse/HBASE-23655
 Project: HBase
  Issue Type: Bug
Affects Versions: 2.2.2
Reporter: Guanghao Zhang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23638) Set version to 2.2.3 in branch-2.2 for first RC of 2.2.3

2020-01-06 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23638:
---
Fix Version/s: 2.2.3

> Set version to 2.2.3 in branch-2.2 for first RC of 2.2.3
> 
>
> Key: HBASE-23638
> URL: https://issues.apache.org/jira/browse/HBASE-23638
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Guanghao Zhang
>Priority: Major
> Fix For: 2.2.3
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23637) Generate CHANGES.md and RELEASENOTES.md for 2.2.3

2020-01-06 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23637:
---
Fix Version/s: 2.2.3

> Generate CHANGES.md and RELEASENOTES.md for 2.2.3
> -
>
> Key: HBASE-23637
> URL: https://issues.apache.org/jira/browse/HBASE-23637
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Guanghao Zhang
>Priority: Major
> Fix For: 2.2.3
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23636) Disable table may hang when regionserver stop or abort.

2020-01-05 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17008555#comment-17008555
 ] 

Guanghao Zhang commented on HBASE-23636:


Plan to add this fix to 2.2.3. [~binlijin] Please help to commit this. Thanks.

> Disable table may hang when regionserver stop or abort.
> ---
>
> Key: HBASE-23636
> URL: https://issues.apache.org/jira/browse/HBASE-23636
> Project: HBase
>  Issue Type: Bug
>  Components: amv2
>Affects Versions: 2.2.3
>Reporter: Lijin Bin
>Assignee: Lijin Bin
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> ServerCrashProcedure will skip regions when table disabling and no procedure 
> with regions, and when DisableTableProcedure to close this region it will 
> never success.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23636) Disable table may hang when regionserver stop or abort.

2020-01-05 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23636:
---
Fix Version/s: 2.3.0
   3.0.0

> Disable table may hang when regionserver stop or abort.
> ---
>
> Key: HBASE-23636
> URL: https://issues.apache.org/jira/browse/HBASE-23636
> Project: HBase
>  Issue Type: Bug
>  Components: amv2
>Affects Versions: 2.2.3
>Reporter: Lijin Bin
>Assignee: Lijin Bin
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> ServerCrashProcedure will skip regions when table disabling and no procedure 
> with regions, and when DisableTableProcedure to close this region it will 
> never success.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23636) Disable table may hang when regionserver stop or abort.

2020-01-05 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23636:
---
Issue Type: Bug  (was: Improvement)

> Disable table may hang when regionserver stop or abort.
> ---
>
> Key: HBASE-23636
> URL: https://issues.apache.org/jira/browse/HBASE-23636
> Project: HBase
>  Issue Type: Bug
>  Components: amv2
>Affects Versions: 2.2.3
>Reporter: Lijin Bin
>Assignee: Lijin Bin
>Priority: Major
>
> ServerCrashProcedure will skip regions when table disabling and no procedure 
> with regions, and when DisableTableProcedure to close this region it will 
> never success.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23636) Disable table may hang when regionserver stop or abort.

2020-01-05 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23636:
---
Fix Version/s: 2.2.3

> Disable table may hang when regionserver stop or abort.
> ---
>
> Key: HBASE-23636
> URL: https://issues.apache.org/jira/browse/HBASE-23636
> Project: HBase
>  Issue Type: Bug
>  Components: amv2
>Affects Versions: 2.2.3
>Reporter: Lijin Bin
>Assignee: Lijin Bin
>Priority: Major
> Fix For: 2.2.3
>
>
> ServerCrashProcedure will skip regions when table disabling and no procedure 
> with regions, and when DisableTableProcedure to close this region it will 
> never success.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23642) Reintroduce ReplicationUtils.contains as deprecated

2020-01-03 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17007849#comment-17007849
 ] 

Guanghao Zhang commented on HBASE-23642:


I searched issues yesterday but not found this. Thanks for reminding. :) 

> Reintroduce ReplicationUtils.contains as deprecated
> ---
>
> Key: HBASE-23642
> URL: https://issues.apache.org/jira/browse/HBASE-23642
> Project: HBase
>  Issue Type: Task
>  Components: Replication
>Affects Versions: 2.3.0, 2.2.3, 2.1.9
>Reporter: Peter Somogyi
>Assignee: Peter Somogyi
>Priority: Major
> Fix For: 2.3.0, 2.2.3, 2.1.9
>
>
> ReplicationUtils.contains was removed in HBASE-23345. This is an IA.Private 
> class but HBCK2 relied on this method.
> The plan is to add back the method as deprecated and also move HBCK2 off of 
> this to the new ReplicationPeerConfig#needToReplicate(TableName) instead 
> (HBASE-23641).
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HBASE-23642) Reintroduce ReplicationUtils.contains as deprecated

2020-01-03 Thread Guanghao Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/HBASE-23642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17007844#comment-17007844
 ] 

Guanghao Zhang commented on HBASE-23642:


OK. +1 for branch-2.2.

> Reintroduce ReplicationUtils.contains as deprecated
> ---
>
> Key: HBASE-23642
> URL: https://issues.apache.org/jira/browse/HBASE-23642
> Project: HBase
>  Issue Type: Task
>  Components: Replication
>Affects Versions: 2.3.0, 2.2.3, 2.1.9
>Reporter: Peter Somogyi
>Assignee: Peter Somogyi
>Priority: Major
> Fix For: 2.3.0, 2.2.3, 2.1.9
>
>
> ReplicationUtils.contains was removed in HBASE-23345. This is an IA.Private 
> class but HBCK2 relied on this method.
> The plan is to add back the method as deprecated and also move HBCK2 off of 
> this to the new ReplicationPeerConfig#needToReplicate(TableName) instead 
> (HBASE-23641).
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-23638) Set version to 2.2.3 in branch-2.2 for first RC of 2.2.3

2020-01-03 Thread Guanghao Zhang (Jira)
Guanghao Zhang created HBASE-23638:
--

 Summary: Set version to 2.2.3 in branch-2.2 for first RC of 2.2.3
 Key: HBASE-23638
 URL: https://issues.apache.org/jira/browse/HBASE-23638
 Project: HBase
  Issue Type: Sub-task
Reporter: Guanghao Zhang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-23637) Generate CHANGES.md and RELEASENOTES.md for 2.2.3

2020-01-03 Thread Guanghao Zhang (Jira)
Guanghao Zhang created HBASE-23637:
--

 Summary: Generate CHANGES.md and RELEASENOTES.md for 2.2.3
 Key: HBASE-23637
 URL: https://issues.apache.org/jira/browse/HBASE-23637
 Project: HBase
  Issue Type: Sub-task
Reporter: Guanghao Zhang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23175) Yarn unable to acquire delegation token for HBase Spark jobs

2020-01-03 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23175.

Resolution: Fixed

Pushed to branch-2.2.

> Yarn unable to acquire delegation token for HBase Spark jobs
> 
>
> Key: HBASE-23175
> URL: https://issues.apache.org/jira/browse/HBASE-23175
> Project: HBase
>  Issue Type: Bug
>  Components: security, spark
>Affects Versions: 2.0.0
>Reporter: Ankit Singhal
>Assignee: Ankit Singhal
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3, 2.1.8
>
> Attachments: HBASE-23175.master.001.patch
>
>
> Spark rely on the TokenUtil.obtainToken(conf) API which is removed in 
> HBase-2.0, though it has been fixed in SPARK-26432 to use the new API but 
> planned for Spark-3.0, hence we need the fix in HBase until they release it 
> and we upgrade it
> {code}
> 18/03/20 20:39:07 ERROR ApplicationMaster: User class threw exception: 
> org.apache.hadoop.hbase.HBaseIOException: 
> com.google.protobuf.ServiceException: Error calling method 
> hbase.pb.AuthenticationService.GetAuthenticationToken
> org.apache.hadoop.hbase.HBaseIOException: 
> com.google.protobuf.ServiceException: Error calling method 
> hbase.pb.AuthenticationService.GetAuthenticationToken
> at 
> org.apache.hadoop.hbase.shaded.protobuf.ProtobufUtil.makeIOExceptionOfException(ProtobufUtil.java:360)
> at 
> org.apache.hadoop.hbase.shaded.protobuf.ProtobufUtil.handleRemoteException(ProtobufUtil.java:346)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil.obtainToken(TokenUtil.java:86)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil$1.run(TokenUtil.java:121)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil$1.run(TokenUtil.java:118)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:422)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1682)
> at 
> org.apache.hadoop.hbase.security.User$SecureHadoopUser.runAs(User.java:313)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil.obtainToken(TokenUtil.java:118)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil.addTokenForJob(TokenUtil.java:272)
> at 
> org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil.initCredentials(TableMapReduceUtil.java:533)
> at 
> org.apache.hadoop.hbase.spark.HBaseContext.(HBaseContext.scala:73)
> at 
> org.apache.hadoop.hbase.spark.JavaHBaseContext.(JavaHBaseContext.scala:46)
> at 
> org.apache.hadoop.hbase.spark.example.hbasecontext.JavaHBaseBulkDeleteExample.main(JavaHBaseBulkDeleteExample.java:64)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at 
> org.apache.spark.deploy.yarn.ApplicationMaster$$anon$4.run(ApplicationMaster.scala:706)
> Caused by: com.google.protobuf.ServiceException: Error calling method 
> hbase.pb.AuthenticationService.GetAuthenticationToken
> at 
> org.apache.hadoop.hbase.client.SyncCoprocessorRpcChannel.callBlockingMethod(SyncCoprocessorRpcChannel.java:71)
> at 
> org.apache.hadoop.hbase.protobuf.generated.AuthenticationProtos$AuthenticationService$BlockingStub.getAuthenticationToken(AuthenticationProtos.java:4512)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil.obtainToken(TokenUtil.java:81)
> ... 17 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Reopened] (HBASE-23175) Yarn unable to acquire delegation token for HBase Spark jobs

2020-01-03 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang reopened HBASE-23175:


Forgot to pushed to branch-2.2. Reopened it.

> Yarn unable to acquire delegation token for HBase Spark jobs
> 
>
> Key: HBASE-23175
> URL: https://issues.apache.org/jira/browse/HBASE-23175
> Project: HBase
>  Issue Type: Bug
>  Components: security, spark
>Affects Versions: 2.0.0
>Reporter: Ankit Singhal
>Assignee: Ankit Singhal
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.1.8, 2.2.3
>
> Attachments: HBASE-23175.master.001.patch
>
>
> Spark rely on the TokenUtil.obtainToken(conf) API which is removed in 
> HBase-2.0, though it has been fixed in SPARK-26432 to use the new API but 
> planned for Spark-3.0, hence we need the fix in HBase until they release it 
> and we upgrade it
> {code}
> 18/03/20 20:39:07 ERROR ApplicationMaster: User class threw exception: 
> org.apache.hadoop.hbase.HBaseIOException: 
> com.google.protobuf.ServiceException: Error calling method 
> hbase.pb.AuthenticationService.GetAuthenticationToken
> org.apache.hadoop.hbase.HBaseIOException: 
> com.google.protobuf.ServiceException: Error calling method 
> hbase.pb.AuthenticationService.GetAuthenticationToken
> at 
> org.apache.hadoop.hbase.shaded.protobuf.ProtobufUtil.makeIOExceptionOfException(ProtobufUtil.java:360)
> at 
> org.apache.hadoop.hbase.shaded.protobuf.ProtobufUtil.handleRemoteException(ProtobufUtil.java:346)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil.obtainToken(TokenUtil.java:86)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil$1.run(TokenUtil.java:121)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil$1.run(TokenUtil.java:118)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:422)
> at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1682)
> at 
> org.apache.hadoop.hbase.security.User$SecureHadoopUser.runAs(User.java:313)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil.obtainToken(TokenUtil.java:118)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil.addTokenForJob(TokenUtil.java:272)
> at 
> org.apache.hadoop.hbase.mapreduce.TableMapReduceUtil.initCredentials(TableMapReduceUtil.java:533)
> at 
> org.apache.hadoop.hbase.spark.HBaseContext.(HBaseContext.scala:73)
> at 
> org.apache.hadoop.hbase.spark.JavaHBaseContext.(JavaHBaseContext.scala:46)
> at 
> org.apache.hadoop.hbase.spark.example.hbasecontext.JavaHBaseBulkDeleteExample.main(JavaHBaseBulkDeleteExample.java:64)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
> at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:498)
> at 
> org.apache.spark.deploy.yarn.ApplicationMaster$$anon$4.run(ApplicationMaster.scala:706)
> Caused by: com.google.protobuf.ServiceException: Error calling method 
> hbase.pb.AuthenticationService.GetAuthenticationToken
> at 
> org.apache.hadoop.hbase.client.SyncCoprocessorRpcChannel.callBlockingMethod(SyncCoprocessorRpcChannel.java:71)
> at 
> org.apache.hadoop.hbase.protobuf.generated.AuthenticationProtos$AuthenticationService$BlockingStub.getAuthenticationToken(AuthenticationProtos.java:4512)
> at 
> org.apache.hadoop.hbase.security.token.TokenUtil.obtainToken(TokenUtil.java:81)
> ... 17 more
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23553) Snapshot referenced data files are deleted in some case

2020-01-03 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23553.

Fix Version/s: 2.2.3
   2.3.0
   3.0.0
   Resolution: Fixed

> Snapshot referenced data files are deleted in some case
> ---
>
> Key: HBASE-23553
> URL: https://issues.apache.org/jira/browse/HBASE-23553
> Project: HBase
>  Issue Type: Bug
>Reporter: Yi Mei
>Assignee: Yi Mei
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.2.3
>
>
> We scan snapshot in our cluster and got following exception:
> {code:java}
> java.io.IOException: java.io.IOException: java.io.FileNotFoundException: 
> Unable to open link: org.apache.hadoop.hbase.io.HFileLink 
> locations=[hdfs://tjwqsrv-galaxy98/hbase/tjwqsrv-galaxy98/data/default/galaxy_online_fds_object_table/06dd90d8540b56343859b63a6134450c/A/4a6cf05f419a9f61059cb05a962f,
>  
> hdfs://tjwqsrv-galaxy98/hbase/tjwqsrv-galaxy98/.tmp/data/default/galaxy_online_fds_object_table/06dd90d8540b56343859b63a6134450c/A/4a6cf05f419a9f61059cb05a962f,
>  
> hdfs://tjwqsrv-galaxy98/hbase/tjwqsrv-galaxy98/archive/data/default/galaxy_online_fds_object_table/06dd90d8540b56343859b63a6134450c/A/4a6cf05f419a9f61059cb05a962f]
>  
> at 
> org.apache.hadoop.hbase.regionserver.HRegion.initializeRegionStores(HRegion.java:867)
>  
> at 
> org.apache.hadoop.hbase.regionserver.HRegion.initializeRegionInternals(HRegion.java:778)
>  at org.apache.hadoop.hbase.regionserver.HRegion.initialize(HRegion.java:749) 
> at 
> org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:5306) 
> at 
> org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:5271) 
> at 
> org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:5243) 
> at 
> org.apache.hadoop.hbase.client.ClientSideRegionScanner.(ClientSideRegionScanner.java:72)
>  
> at 
> org.apache.hadoop.hbase.mapreduce.TableSnapshotInputFormatImpl$RecordReader.initialize(TableSnapshotInputFormatImpl.java:239)
>  
> at 
> org.apache.hadoop.hbase.mapreduce.TableSnapshotInputFormat$TableSnapshotRegionRecordReader.initialize(TableSnapshotInputFormat.java:150)
>  
> at 
> org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.initialize(MapTask.java:552)
>  at {code}
> I checked to namenode logs and found that this file is deleted by hbase 
> cleaner although a snapshot still referenced to this file.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23231) ReplicationSource do not update metrics after refresh

2020-01-03 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23231:
---
Fix Version/s: (was: 2.2.2)
   2.2.3

> ReplicationSource do not update metrics after refresh
> -
>
> Key: HBASE-23231
> URL: https://issues.apache.org/jira/browse/HBASE-23231
> Project: HBase
>  Issue Type: Bug
>  Components: wal
>Affects Versions: 2.2.2
>Reporter: Lijin Bin
>Assignee: Lijin Bin
>Priority: Major
> Fix For: 3.0.0, 2.3.0, 2.1.8, 2.2.3
>
>
> When replication refresh to new state, it will create a new source and 
> terminate the old source and replace the old source with new source.
> {code}
>   public void refreshSources(String peerId) throws IOException {
> String terminateMessage = "Peer " + peerId +
>   " state or config changed. Will close the previous replication source 
> and open a new one";
> ReplicationPeer peer = replicationPeers.getPeer(peerId);
> ReplicationSourceInterface src = createSource(peerId, peer);
> // synchronized on latestPaths to avoid missing the new log
> synchronized (this.latestPaths) {
>   ReplicationSourceInterface toRemove = this.sources.put(peerId, src);
>   if (toRemove != null) {
> LOG.info("Terminate replication source for " + toRemove.getPeerId());
> toRemove.terminate(terminateMessage);
>   }
>   for (NavigableSet walsByGroup : walsById.get(peerId).values()) {
> walsByGroup.forEach(wal -> src.enqueueLog(new Path(this.logDir, 
> wal)));
>   }
> }
> LOG.info("Startup replication source for " + src.getPeerId());
> src.startup();
> {code}
> terminate replication source will remove all metrics, current terminate 
> replication source be called after create new source which do init metrics, 
> so the result is there is no corresponding metrics after refresh replication 
> source.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (HBASE-23286) Improve MTTR: Split WAL to HFile

2020-01-03 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang resolved HBASE-23286.

Resolution: Fixed

Pushed to branch-2 and master. Thanks all for reviewing. And opened two 
follow-up issues.

> Improve MTTR: Split WAL to HFile
> 
>
> Key: HBASE-23286
> URL: https://issues.apache.org/jira/browse/HBASE-23286
> Project: HBase
>  Issue Type: Improvement
>  Components: MTTR
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0
>
>
> After HBASE-20724, the compaction event marker is not used anymore when 
> failover. So our new proposal is split WAL to HFile to imporve MTTR. It has 3 
> steps:
>  # Read WAL and write HFile to region’s column family’s recovered.hfiles 
> directory.
>  # Open region.
>  # Bulkload the recovered.hfiles for every column family.
> The design doc was attathed by a google doc. Any suggestions are welcomed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-23634) Enable "Split WAL to HFile" by default

2020-01-03 Thread Guanghao Zhang (Jira)
Guanghao Zhang created HBASE-23634:
--

 Summary: Enable "Split WAL to HFile" by default
 Key: HBASE-23634
 URL: https://issues.apache.org/jira/browse/HBASE-23634
 Project: HBase
  Issue Type: Improvement
Affects Versions: 3.0.0, 2.3.0
Reporter: Guanghao Zhang






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (HBASE-23633) Find a way to handle the corrupt recovered hfiles

2020-01-03 Thread Guanghao Zhang (Jira)
Guanghao Zhang created HBASE-23633:
--

 Summary: Find a way to handle the corrupt recovered hfiles
 Key: HBASE-23633
 URL: https://issues.apache.org/jira/browse/HBASE-23633
 Project: HBase
  Issue Type: Umbrella
Reporter: Guanghao Zhang


Copy the comment from PR review.

 

If the file is a corrupt HFile, an exception will be thrown here, which will 
cause the region to fail to open.
Maybe we can add a new parameter to control whether to skip the exception, 
similar to recover edits which has a parameter 
"hbase.hregion.edits.replay.skip.errors";

 

Regions that can't be opened because of detached References or corrupt hfiles 
are a fact-of-life. We need work on this issue. This will be a new variant on 
the problem -- i.e. bad recovered hfiles.

On adding a config to ignore bad files and just open, thats a bit dangerous as 
per @infraio  as it could mean silent data loss.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23633) Find a way to handle the corrupt recovered hfiles

2020-01-03 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23633:
---
Issue Type: Improvement  (was: Umbrella)

> Find a way to handle the corrupt recovered hfiles
> -
>
> Key: HBASE-23633
> URL: https://issues.apache.org/jira/browse/HBASE-23633
> Project: HBase
>  Issue Type: Improvement
>Reporter: Guanghao Zhang
>Priority: Major
>
> Copy the comment from PR review.
>  
> If the file is a corrupt HFile, an exception will be thrown here, which will 
> cause the region to fail to open.
> Maybe we can add a new parameter to control whether to skip the exception, 
> similar to recover edits which has a parameter 
> "hbase.hregion.edits.replay.skip.errors";
>  
> Regions that can't be opened because of detached References or corrupt hfiles 
> are a fact-of-life. We need work on this issue. This will be a new variant on 
> the problem -- i.e. bad recovered hfiles.
> On adding a config to ignore bad files and just open, thats a bit dangerous 
> as per @infraio  as it could mean silent data loss.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HBASE-23286) Improve MTTR: Split WAL to HFile

2020-01-03 Thread Guanghao Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-23286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-23286:
---
Release Note: 
Add a new feature to improve MTTR which have 3 steps to failover:
1. Read WAL and write HFile to region’s column family’s recovered.hfiles 
directory.
2. Open region.
3. Bulkload the recovered.hfiles for every column family.

Compared to DLS(distributed log split), this feature will reduce region open 
time significantly.

Config hbase.wal.split.to.hfile to true to enable this featue.

> Improve MTTR: Split WAL to HFile
> 
>
> Key: HBASE-23286
> URL: https://issues.apache.org/jira/browse/HBASE-23286
> Project: HBase
>  Issue Type: Improvement
>  Components: MTTR
>Affects Versions: 3.0.0, 2.3.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Major
> Fix For: 3.0.0, 2.3.0
>
>
> After HBASE-20724, the compaction event marker is not used anymore when 
> failover. So our new proposal is split WAL to HFile to imporve MTTR. It has 3 
> steps:
>  # Read WAL and write HFile to region’s column family’s recovered.hfiles 
> directory.
>  # Open region.
>  # Bulkload the recovered.hfiles for every column family.
> The design doc was attathed by a google doc. Any suggestions are welcomed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


<    3   4   5   6   7   8   9   10   11   12   >