[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13268874#comment-13268874 ] Hudson commented on HBASE-5873: --- Integrated in HBase-0.92-security #106 (See [https://builds.apache.org/job/HBase-0.92-security/106/]) HBASE-5873 TimeOut Monitor thread should be started after atleast one region server registers. (Revision 1330558) Result = SUCCESS larsh : Files : * /hbase/branches/0.92/CHANGES.txt * /hbase/branches/0.92/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java * /hbase/branches/0.92/src/main/java/org/apache/hadoop/hbase/master/HMaster.java > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13263135#comment-13263135 ] Hudson commented on HBASE-5873: --- Integrated in HBase-0.94-security #22 (See [https://builds.apache.org/job/HBase-0.94-security/22/]) HBASE-5873 TimeOut Monitor thread should be started after atleast one region server registers. (Revision 1330549) Result = FAILURE larsh : Files : * /hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java * /hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/master/HMaster.java > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13262360#comment-13262360 ] ramkrishna.s.vasudevan commented on HBASE-5873: --- @Lars Thanks for taking care of committing to the other branches also. :) > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13262275#comment-13262275 ] Hudson commented on HBASE-5873: --- Integrated in HBase-TRUNK-security #184 (See [https://builds.apache.org/job/HBase-TRUNK-security/184/]) HBASE-5873 TimeOut Monitor thread should be started after atleast one region server registers. (Revision 1330551) Result = FAILURE larsh : Files : * /hbase/trunk/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java * /hbase/trunk/src/main/java/org/apache/hadoop/hbase/master/HMaster.java > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13262236#comment-13262236 ] Hudson commented on HBASE-5873: --- Integrated in HBase-0.92 #391 (See [https://builds.apache.org/job/HBase-0.92/391/]) HBASE-5873 TimeOut Monitor thread should be started after atleast one region server registers. (Revision 1330558) Result = SUCCESS larsh : Files : * /hbase/branches/0.92/CHANGES.txt * /hbase/branches/0.92/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java * /hbase/branches/0.92/src/main/java/org/apache/hadoop/hbase/master/HMaster.java > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13262164#comment-13262164 ] Hudson commented on HBASE-5873: --- Integrated in HBase-0.94 #149 (See [https://builds.apache.org/job/HBase-0.94/149/]) HBASE-5873 TimeOut Monitor thread should be started after atleast one region server registers. (Revision 1330549) Result = SUCCESS larsh : Files : * /hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java * /hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/master/HMaster.java > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13262159#comment-13262159 ] Hudson commented on HBASE-5873: --- Integrated in HBase-TRUNK #2814 (See [https://builds.apache.org/job/HBase-TRUNK/2814/]) HBASE-5873 TimeOut Monitor thread should be started after atleast one region server registers. (Revision 1330551) Result = FAILURE larsh : Files : * /hbase/trunk/src/main/java/org/apache/hadoop/hbase/master/AssignmentManager.java * /hbase/trunk/src/main/java/org/apache/hadoop/hbase/master/HMaster.java > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13262061#comment-13262061 ] Lars Hofhansl commented on HBASE-5873: -- Committed to 0.94 and trunk. Creating 0.90 and 0.92 patches now. > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13261940#comment-13261940 ] Lars Hofhansl commented on HBASE-5873: -- This change does violate encapsulation a bit. I double checked where in the code we create instances of AssignmentManager. Besides the HMaster it is only from tests (and they all pass it's good). > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13261908#comment-13261908 ] Lars Hofhansl commented on HBASE-5873: -- Looks good, tests pass. +1 from me. > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13261802#comment-13261802 ] Hadoop QA commented on HBASE-5873: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12524297/5873-trunk.txt against trunk revision . +1 @author. The patch does not contain any @author tags. -1 tests included. The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. +1 javadoc. The javadoc tool did not generate any warning messages. +1 javac. The applied patch does not increase the total number of javac compiler warnings. -1 findbugs. The patch appears to introduce 5 new Findbugs (version 1.3.9) warnings. +1 release audit. The applied patch does not increase the total number of release audit warnings. +1 core tests. The patch passed unit tests in . Test results: https://builds.apache.org/job/PreCommit-HBASE-Build/1647//testReport/ Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/1647//artifact/trunk/patchprocess/newPatchFindbugsWarnings.html Console output: https://builds.apache.org/job/PreCommit-HBASE-Build/1647//console This message is automatically generated. > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13261763#comment-13261763 ] Zhihong Yu commented on HBASE-5873: --- +1 if tests pass. > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Assignee: rajeshbabu >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13261755#comment-13261755 ] Lars Hofhansl commented on HBASE-5873: -- Oops, rajesh, we crossed comments. > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: 5873-trunk.txt, HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13261742#comment-13261742 ] rajeshbabu commented on HBASE-5873: --- This patch uploaded for 0.94 > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HBASE-5873) TimeOut Monitor thread should be started after atleast one region server registers.
[ https://issues.apache.org/jira/browse/HBASE-5873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13261713#comment-13261713 ] Hadoop QA commented on HBASE-5873: -- -1 overall. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12524287/HBASE-5873.patch against trunk revision . +1 @author. The patch does not contain any @author tags. -1 tests included. The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. -1 patch. The patch command could not apply the patch. Console output: https://builds.apache.org/job/PreCommit-HBASE-Build/1645//console This message is automatically generated. > TimeOut Monitor thread should be started after atleast one region server > registers. > --- > > Key: HBASE-5873 > URL: https://issues.apache.org/jira/browse/HBASE-5873 > Project: HBase > Issue Type: Bug >Affects Versions: 0.90.6 >Reporter: ramkrishna.s.vasudevan >Priority: Minor > Fix For: 0.90.7, 0.92.2, 0.94.0, 0.96.0 > > Attachments: HBASE-5873.patch > > > Currently timeout monitor thread is started even before the region server has > registered with the master. > In timeout monitor we depend on the region server to be online > {code} > boolean allRSsOffline = this.serverManager.getOnlineServersList(). > isEmpty(); > {code} > Now when the master starts up it sees there are no online servers and hence > sets > allRSsOffline to true. > {code} > setAllRegionServersOffline(allRSsOffline); > {code} > So this.allRegionServersOffline is also true. > By this time an RS has come up, > Now timeout comes up again (after 10secs) in the next cycle he sees > allRSsOffline as false. > Hence > {code} > else if (this.allRegionServersOffline && !allRSsOffline) { > // if some RSs just came back online, we can start the > // the assignment right away > actOnTimeOut(regionState); > {code} > This condition makes him to take action based on timeout. > Because of this even if one Region assignment of ROOT is going on, this piece > of code triggers another assignment and thus we get RegionAlreadyinTransition > Exception. Later we need to wait for 30 mins for assigning ROOT itself. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira