[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-18 Thread stack (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-5200:
-

Attachment: 5200-v4-092.txt

What I applied to 0.92.  There is no TestAssignmentManager in 0.92 and there 
was a little bit of an issue fixing a reject.

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, 5200-v3.txt, 
> 5200-v4-092.txt, 5200-v4.txt, 5200-v4no-prefix.txt, HBASE-5200.patch, 
> HBASE-5200_1.patch, HBASE-5200_trunk_latest_with_test_2.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch, hbase-5200_90_latest_new.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-18 Thread stack (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-5200:
-

   Resolution: Fixed
Fix Version/s: (was: 0.90.7)
   Status: Resolved  (was: Patch Available)

Committed trunk and 0.92 branch.

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, 5200-v3.txt, 
> 5200-v4-092.txt, 5200-v4.txt, 5200-v4no-prefix.txt, HBASE-5200.patch, 
> HBASE-5200_1.patch, HBASE-5200_trunk_latest_with_test_2.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch, hbase-5200_90_latest_new.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-17 Thread stack (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-5200:
-

Status: Patch Available  (was: Open)

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, 5200-v3.txt, 5200-v4.txt, 
> 5200-v4no-prefix.txt, HBASE-5200.patch, HBASE-5200_1.patch, 
> HBASE-5200_trunk_latest_with_test_2.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch, hbase-5200_90_latest_new.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-17 Thread stack (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-5200:
-

Status: Open  (was: Patch Available)

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, 5200-v3.txt, 5200-v4.txt, 
> 5200-v4no-prefix.txt, HBASE-5200.patch, HBASE-5200_1.patch, 
> HBASE-5200_trunk_latest_with_test_2.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch, hbase-5200_90_latest_new.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-17 Thread stack (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-5200:
-

Attachment: 5200-v4no-prefix.txt

v4 for hadoopqa

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, 5200-v3.txt, 5200-v4.txt, 
> 5200-v4no-prefix.txt, HBASE-5200.patch, HBASE-5200_1.patch, 
> HBASE-5200_trunk_latest_with_test_2.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch, hbase-5200_90_latest_new.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-15 Thread ramkrishna.s.vasudevan (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ramkrishna.s.vasudevan updated HBASE-5200:
--

Attachment: hbase-5200_90_latest_new.patch

Updated Patch for 0.90

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, 5200-v3.txt, 5200-v4.txt, 
> HBASE-5200.patch, HBASE-5200_1.patch, 
> HBASE-5200_trunk_latest_with_test_2.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch, hbase-5200_90_latest_new.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-14 Thread stack (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-5200:
-

Attachment: 5200-v4.txt

Fix issue Zhihong found.   Lookup HRI if no entry for encoded regionname in map 
OR if the value is null.

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, 5200-v3.txt, 5200-v4.txt, 
> HBASE-5200.patch, HBASE-5200_1.patch, 
> HBASE-5200_trunk_latest_with_test_2.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-14 Thread stack (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-5200:
-

Attachment: 5200-v3.txt

I like your test additions.  I refactored the tests so less state kept between 
tests.

I also added Map of failover regions w/ HRI as value so if present we don't 
have to go to .META.

I'm thinking though that we can avoid a bunch of the issues we're seeing here 
if we go the fb route of not handling events during failover.   Not doing this 
simplifies failover considerably.  Without it, all kinds of events can be 
triggered when we're not ready to handle them.  Let me have a go at the forward 
port of HBASE-5344 and then run these unit tests to see  how we do w/ it in 
place.

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, 5200-v3.txt, 
> HBASE-5200.patch, HBASE-5200_1.patch, 
> HBASE-5200_trunk_latest_with_test_2.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/

[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-14 Thread ramkrishna.s.vasudevan (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ramkrishna.s.vasudevan updated HBASE-5200:
--

Status: Patch Available  (was: Open)

0.90 will submit tomorrow.

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, HBASE-5200.patch, 
> HBASE-5200_1.patch, HBASE-5200_trunk_latest_with_test_2.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-14 Thread ramkrishna.s.vasudevan (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ramkrishna.s.vasudevan updated HBASE-5200:
--

Status: Open  (was: Patch Available)

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, HBASE-5200.patch, 
> HBASE-5200_1.patch, HBASE-5200_trunk_latest_with_test_2.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-14 Thread ramkrishna.s.vasudevan (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ramkrishna.s.vasudevan updated HBASE-5200:
--

Attachment: HBASE-5200_trunk_latest_with_test_2.patch

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, HBASE-5200.patch, 
> HBASE-5200_1.patch, HBASE-5200_trunk_latest_with_test_2.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-13 Thread stack (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-5200:
-

Attachment: 5200-test.txt

A unit test for this condition.

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-test.txt, 5200-v2.txt, HBASE-5200.patch, 
> HBASE-5200_1.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-13 Thread ramkrishna.s.vasudevan (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ramkrishna.s.vasudevan updated HBASE-5200:
--

Attachment: hbase-5200_90_latest.patch

Patch for 0.90

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-v2.txt, HBASE-5200.patch, HBASE-5200_1.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml, 
> hbase-5200_90_latest.patch
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-03 Thread Zhihong Yu (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhihong Yu updated HBASE-5200:
--

Comment: was deleted

(was: -1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12513158/5200-v2.txt
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

-1 javadoc.  The javadoc tool appears to have generated -136 warning 
messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

-1 findbugs.  The patch appears to introduce 155 new Findbugs (version 
1.3.9) warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

 -1 core tests.  The patch failed these unit tests:
   org.apache.hadoop.hbase.TestInfoServers
  org.apache.hadoop.hbase.client.TestAdmin
  org.apache.hadoop.hbase.io.hfile.TestHFileBlock
  org.apache.hadoop.hbase.master.TestZKBasedOpenCloseRegion
  org.apache.hadoop.hbase.mapreduce.TestImportTsv
  org.apache.hadoop.hbase.mapred.TestTableMapReduce
  org.apache.hadoop.hbase.mapreduce.TestHFileOutputFormat

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/898//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/898//artifact/trunk/patchprocess/newPatchFindbugsWarnings.html
Console output: https://builds.apache.org/job/PreCommit-HBASE-Build/898//console

This message is automatically generated.)

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-v2.txt, HBASE-5200.patch, HBASE-5200_1.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.Assig

[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-03 Thread Zhihong Yu (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhihong Yu updated HBASE-5200:
--

Attachment: 5200-v2.txt

Re-attach patch v2 for Hadoop QA.

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-v2.txt, HBASE-5200.patch, HBASE-5200_1.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-03 Thread Zhihong Yu (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhihong Yu updated HBASE-5200:
--

Attachment: (was: 5200-v2.txt)

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-v2.txt, HBASE-5200.patch, HBASE-5200_1.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-02-03 Thread Zhihong Yu (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhihong Yu updated HBASE-5200:
--

Attachment: 5200-v2.txt

Patch v2 incorporates review comments

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: 5200-v2.txt, HBASE-5200.patch, HBASE-5200_1.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (HBASE-5200) AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the region assignment inconsistent

2012-01-31 Thread Zhihong Yu (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-5200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhihong Yu updated HBASE-5200:
--

Fix Version/s: 0.94.0
 Hadoop Flags: Reviewed
  Summary: AM.ProcessRegionInTransition() and AM.handleRegion() race 
thus leaving the region assignment inconsistent  (was: 
AM.ProcessRegionInTransition() and AM.handleRegion() races thus leaving the 
region assignment inconsistent.)

> AM.ProcessRegionInTransition() and AM.handleRegion() race thus leaving the 
> region assignment inconsistent
> -
>
> Key: HBASE-5200
> URL: https://issues.apache.org/jira/browse/HBASE-5200
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 0.90.5
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 0.94.0, 0.90.7, 0.92.1
>
> Attachments: HBASE-5200.patch, HBASE-5200_1.patch, 
> TEST-org.apache.hadoop.hbase.master.TestRestartCluster.xml
>
>
> This is the scenario
> Consider a case where the balancer is going on thus trying to close regions 
> in a RS.
> Before we could close a master switch happens.  
> On Master switch the set of nodes that are in RIT is collected and we first 
> get Data and start watching the node
> After that the node data is added into RIT.
> Now by this time (before adding to RIT) if the RS to which close was called 
> does a transition in AM.handleRegion() we miss the handling saying RIT state 
> was null.
> {code}
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> a66d281d231dfcaea97c270698b26b6f from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> c12e53bfd48ddc5eec507d66821c4d23 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,358 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 59ae13de8c1eb325a0dd51f4902d2052 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> f45bc9614d7575f35244849af85aa078 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> cc3ecd7054fe6cd4a1159ed92fd62641 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 3af40478a17fee96b4a192b22c90d5a2 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> e6096a8466e730463e10d3d61f809b92 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> 4806781a1a23066f7baed22b4d237e24 from server 
> HOST-192-168-47-204,20020,1326342744518 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> 2012-01-13 10:50:46,359 WARN 
> org.apache.hadoop.hbase.master.AssignmentManager: Received CLOSED for region 
> d69e104131accaefe21dcc01fddc7629 from server 
> HOST-192-168-47-205,20020,1326363111288 but region was in  the state null and 
> not in expected PENDING_CLOSE or CLOSING states
> {code}
> In branch the CLOSING node is created by RS thus leading to more 
> inconsistency.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira