date:20120626


 [ 
https://issues.apache.org/jira/browse/HBASE-4379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-4379:
--

Status: Open  (was: Patch Available)

 [hbck] Does not complain about tables with no end region [Z,]
 -

 Key: HBASE-4379
 URL: https://issues.apache.org/jira/browse/HBASE-4379
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.92.0, 0.90.5
Reporter: Jonathan Hsieh
Assignee: Anoop Sam John
 Attachments: 
 0001-HBASE-4379-hbck-does-not-complain-about-tables-with-.patch, 
 HBASE-4379_94.patch, hbase-4379.v2.patch


 hbck does not detect or have an error condition when the last region of a 
 table is missing (end key != '').

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6267) hbase.store.delete.expired.storefile should be true by default

2012-06-26 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-6267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401194#comment-13401194
 ] 

Hudson commented on HBASE-6267:
---

Integrated in HBase-0.94 #281 (See 
[https://builds.apache.org/job/HBase-0.94/281/])
HBASE-6267. hbase.store.delete.expired.storefile should be true by default 
(Revision 1353813)

 Result = FAILURE
apurtell : 
Files : 
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/regionserver/Store.java
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/io/hfile/TestScannerSelectionUsingTTL.java


 hbase.store.delete.expired.storefile should be true by default
 --

 Key: HBASE-6267
 URL: https://issues.apache.org/jira/browse/HBASE-6267
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Affects Versions: 0.96.0, 0.94.1
Reporter: Andrew Purtell
Assignee: Andrew Purtell
 Fix For: 0.96.0, 0.94.1

 Attachments: HBASE-6267-0.94.patch, HBASE-6267.patch


 HBASE-5199 introduces this logic into Store:
 {code}
 +  // Delete the expired store files before the compaction selection.
 +  if (conf.getBoolean(hbase.store.delete.expired.storefile, false)
 +   (ttl != Long.MAX_VALUE)  (this.scanInfo.minVersions == 0)) {
 +CompactSelection expiredSelection = compactSelection
 +.selectExpiredStoreFilesToCompact(
 +EnvironmentEdgeManager.currentTimeMillis() - this.ttl);
 +
 +// If there is any expired store files, delete them  by compaction.
 +if (expiredSelection != null) {
 +  return expiredSelection;
 +}
 +  }
 {code}
 Is there any reason why that should not be default {{true}}?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4379) [hbck] Does not complain about tables with no end region [Z,]


 [ 
https://issues.apache.org/jira/browse/HBASE-4379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-4379:
--

Attachment: HBASE-4379_94_V2.patch
HBASE-4379_Trunk.patch

 [hbck] Does not complain about tables with no end region [Z,]
 -

 Key: HBASE-4379
 URL: https://issues.apache.org/jira/browse/HBASE-4379
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.90.5, 0.92.0
Reporter: Jonathan Hsieh
Assignee: Anoop Sam John
 Attachments: 
 0001-HBASE-4379-hbck-does-not-complain-about-tables-with-.patch, 
 HBASE-4379_94.patch, HBASE-4379_94_V2.patch, HBASE-4379_Trunk.patch, 
 hbase-4379.v2.patch


 hbck does not detect or have an error condition when the last region of a 
 table is missing (end key != '').

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix

[
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401195#comment-13401195
]

Jieshan Bean commented on HBASE-6200:
-

Ya...You reminded me. I have one idea to optimize this:
1. if (left family length != right family length). Only compare column family
is enough.
2. if (left family length == right family length). we can put them together to
compare.
So no mattter which case, only one comparison will happen.
I will test it right now.

KeyComparator.compareWithoutRow can be wrong when families have the same
prefix
---

Key: HBASE-6200
URL: https://issues.apache.org/jira/browse/HBASE-6200
Project: HBase
Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

Attachments: 6200-trunk-v2.patch, HBASE-6200-90-v2.patch,
HBASE-6200-90.patch, HBASE-6200-92-v2.patch, HBASE-6200-92.patch,
HBASE-6200-94-v2.patch, HBASE-6200-94.patch, HBASE-6200-trunk-v2.patch,
HBASE-6200-trunk.patch, PerformanceTestCase-6200-94.patch

As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird
behavior when some families share the same prefix. He posted a link to his
code to show how it fails, http://pastebin.com/7TBA1XGh
Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families
and qualifiers so f:a is said to be bigger than f1:, which is false. Then
what happens is that the KVs are returned in the right order from the RS but
then doing {{Result.binarySearch}} it uses
{{KeyComparator.compareWithoutRow}} which has a different sorting so the end
result is undetermined.
I added some debug and I can see that the data is returned in the right order
but {{Arrays.binarySearch}} returned the wrong KV, which is then verified
agains the passed family and qualifier which fails so null is returned.
I don't know how frequent it is for users to have families with the same
prefix, but those that do have that and that use those families at the same
time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6233) [brainstorm] snapshots: hardlink alternatives

2012-06-26 Thread Matteo Bertozzi (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-6233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401198#comment-13401198
 ] 

Matteo Bertozzi commented on HBASE-6233:


@Jon
Yes on Take snapshot you rename the hfile to .snapshot/files directory and 
replace it with a symlink.
Also you need to create a symlink in .snapshot/name/ folder (the one that 
describe the snapshot).
When you want to restore you have just to create a symlink of the file.

I see two advantages for using this approach:
One is code remain unchanged fs.delete() stay fs.delete() (all the symlink 
code is done in takeSnapshot() and nothing change from the hbase point of view)

The other one is: 
 * hbase 0.96 ship with snapshots (hardlink alternative)
 * hbase 0.98 ship with snapshot + hdfs hardlink
If you use the approach that I've described a user that have taken snapshots 
using 0.96 doesn't have to do nothing special to migrate to 0.98. symlink to 
.snapshot/files/ keeps to work. And the future 'take snapshot' just create 
hardlink in .snapshot/name/ and restore as another hardlink against 
.snapshot/name

In the other case (take the exception and retry) you need to keep the logic in 
0.98 or do some fancy script that search for the Reference files and replace 
with the hardlink.

 [brainstorm] snapshots: hardlink alternatives
 -

 Key: HBASE-6233
 URL: https://issues.apache.org/jira/browse/HBASE-6233
 Project: HBase
  Issue Type: Brainstorming
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi

 Discussion ticket around snapshots and hardlink alternatives.
 (See the HDFS-3370 discussion about hardlink and implementation problems)
 (taking for a moment WAL out of the discussion and focusing on hfiles)
 With hardlinks available taking snapshot will be fairly easy:
 * (hfiles are immutable)
 * hardlink to .snapshot/name to take snapshot
 * hardlink from .snapshot/name to restore the snapshot
 * No code change needed (on fs.delete() only one reference is deleted)
 but we don't have hardlinks, what are the alternatives?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6233) [brainstorm] snapshots: hardlink alternatives


[ 
https://issues.apache.org/jira/browse/HBASE-6233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401204#comment-13401204
 ] 

Zhihong Ted Yu commented on HBASE-6233:
---

From discussion of HDFS-3370, it is unknown when hdfs hardlink would get 
accepted.

 [brainstorm] snapshots: hardlink alternatives
 -

 Key: HBASE-6233
 URL: https://issues.apache.org/jira/browse/HBASE-6233
 Project: HBase
  Issue Type: Brainstorming
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi

 Discussion ticket around snapshots and hardlink alternatives.
 (See the HDFS-3370 discussion about hardlink and implementation problems)
 (taking for a moment WAL out of the discussion and focusing on hfiles)
 With hardlinks available taking snapshot will be fairly easy:
 * (hfiles are immutable)
 * hardlink to .snapshot/name to take snapshot
 * hardlink from .snapshot/name to restore the snapshot
 * No code change needed (on fs.delete() only one reference is deleted)
 but we don't have hardlinks, what are the alternatives?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


[ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401205#comment-13401205
 ] 

Zhihong Ted Yu commented on HBASE-6200:
---

The above approach should work.

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, HBASE-6200-90-v2.patch, 
 HBASE-6200-90.patch, HBASE-6200-92-v2.patch, HBASE-6200-92.patch, 
 HBASE-6200-94-v2.patch, HBASE-6200-94.patch, HBASE-6200-trunk-v2.patch, 
 HBASE-6200-trunk.patch, PerformanceTestCase-6200-94.patch


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6220) PersistentMetricsTimeVaryingRate gets used for non-time-based metrics

[
https://issues.apache.org/jira/browse/HBASE-6220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401212#comment-13401212
]

Hadoop QA commented on HBASE-6220:
--

-1 overall. Here are the results of testing the latest attachment

http://issues.apache.org/jira/secure/attachment/12533426/ServerMetrics_HBASE_6220.patch
against trunk revision .

+1 @author. The patch does not contain any @author tags.

-1 tests included. The patch doesn't appear to include any new or modified
tests.
Please justify why no new tests are needed for this
patch.
Also please list what manual steps were performed to
verify this patch.

+1 hadoop2.0. The patch compiles against the hadoop 2.0 profile.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac
compiler warnings.

-1 findbugs. The patch appears to introduce 6 new Findbugs (version 1.3.9)
warnings.

+1 release audit. The applied patch does not increase the total number of
release audit warnings.

-1 core tests. The patch failed these unit tests:
org.apache.hadoop.hbase.replication.TestReplication

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/2254//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/2254//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/2254//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/2254//console

This message is automatically generated.

PersistentMetricsTimeVaryingRate gets used for non-time-based metrics
-

Key: HBASE-6220
URL: https://issues.apache.org/jira/browse/HBASE-6220
Project: HBase
Issue Type: Bug
Components: metrics
Affects Versions: 0.96.0
Reporter: David S. Wang
Priority: Minor
Labels: noob
Attachments: ServerMetrics_HBASE_6220.patch

PersistentMetricsTimeVaryingRate gets used for metrics that are not
time-based, leading to confusing names such as avg_time for compaction
size, etc. You hav to read the code in order to understand that this is
actually referring to bytes, not seconds.

[jira] [Commented] (HBASE-6170) Timeouts for row lock and scan should be separate

[
https://issues.apache.org/jira/browse/HBASE-6170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401219#comment-13401219
]

Hadoop QA commented on HBASE-6170:
--

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12533425/HBASE-6170v1.patch
against trunk revision .

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 6 new or modified tests.

+1 hadoop2.0. The patch compiles against the hadoop 2.0 profile.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac
compiler warnings.

-1 findbugs. The patch appears to introduce 6 new Findbugs (version 1.3.9)
warnings.

+1 release audit. The applied patch does not increase the total number of
release audit warnings.

-1 core tests. The patch failed these unit tests:

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/2255//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/2255//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/2255//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/2255//console

This message is automatically generated.

Timeouts for row lock and scan should be separate
-

Key: HBASE-6170
URL: https://issues.apache.org/jira/browse/HBASE-6170
Project: HBase
Issue Type: Improvement
Components: regionserver
Affects Versions: 0.94.0
Reporter: Otis Gospodnetic
Assignee: Chris Trezzo
Priority: Minor
Fix For: 0.96.0

Attachments: HBASE-6170v1.patch

Apparently the timeout used for row locking and for scanning is global. It
would be better to have two separate timeouts.
(opening the issue to make Lars George happy)

[jira] [Commented] (HBASE-4379) [hbck] Does not complain about tables with no end region [Z,]


[ 
https://issues.apache.org/jira/browse/HBASE-4379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401220#comment-13401220
 ] 

Hadoop QA commented on HBASE-4379:
--

-1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12533442/HBASE-4379_94_V2.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

-1 patch.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/2256//console

This message is automatically generated.

 [hbck] Does not complain about tables with no end region [Z,]
 -

 Key: HBASE-4379
 URL: https://issues.apache.org/jira/browse/HBASE-4379
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.90.5, 0.92.0, 0.94.0, 0.96.0
Reporter: Jonathan Hsieh
Assignee: Anoop Sam John
 Fix For: 0.96.0, 0.94.1

 Attachments: 
 0001-HBASE-4379-hbck-does-not-complain-about-tables-with-.patch, 
 HBASE-4379_94.patch, HBASE-4379_94_V2.patch, HBASE-4379_Trunk.patch, 
 hbase-4379.v2.patch


 hbck does not detect or have an error condition when the last region of a 
 table is missing (end key != '').

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4379) [hbck] Does not complain about tables with no end region [Z,]


 [ 
https://issues.apache.org/jira/browse/HBASE-4379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-4379:
--

Attachment: HBASE-4379_Trunk.patch

 [hbck] Does not complain about tables with no end region [Z,]
 -

 Key: HBASE-4379
 URL: https://issues.apache.org/jira/browse/HBASE-4379
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.90.5, 0.92.0, 0.94.0, 0.96.0
Reporter: Jonathan Hsieh
Assignee: Anoop Sam John
 Fix For: 0.96.0, 0.94.1

 Attachments: 
 0001-HBASE-4379-hbck-does-not-complain-about-tables-with-.patch, 
 HBASE-4379_94.patch, HBASE-4379_94_V2.patch, HBASE-4379_Trunk.patch, 
 hbase-4379.v2.patch


 hbck does not detect or have an error condition when the last region of a 
 table is missing (end key != '').

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4379) [hbck] Does not complain about tables with no end region [Z,]


 [ 
https://issues.apache.org/jira/browse/HBASE-4379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-4379:
--

Status: Open  (was: Patch Available)

 [hbck] Does not complain about tables with no end region [Z,]
 -

 Key: HBASE-4379
 URL: https://issues.apache.org/jira/browse/HBASE-4379
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.94.0, 0.92.0, 0.90.5, 0.96.0
Reporter: Jonathan Hsieh
Assignee: Anoop Sam John
 Fix For: 0.96.0, 0.94.1

 Attachments: 
 0001-HBASE-4379-hbck-does-not-complain-about-tables-with-.patch, 
 HBASE-4379_94.patch, HBASE-4379_94_V2.patch, HBASE-4379_Trunk.patch, 
 hbase-4379.v2.patch


 hbck does not detect or have an error condition when the last region of a 
 table is missing (end key != '').

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4379) [hbck] Does not complain about tables with no end region [Z,]


 [ 
https://issues.apache.org/jira/browse/HBASE-4379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-4379:
--

Attachment: (was: HBASE-4379_Trunk.patch)

 [hbck] Does not complain about tables with no end region [Z,]
 -

 Key: HBASE-4379
 URL: https://issues.apache.org/jira/browse/HBASE-4379
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.90.5, 0.92.0, 0.94.0, 0.96.0
Reporter: Jonathan Hsieh
Assignee: Anoop Sam John
 Fix For: 0.96.0, 0.94.1

 Attachments: 
 0001-HBASE-4379-hbck-does-not-complain-about-tables-with-.patch, 
 HBASE-4379_94.patch, HBASE-4379_94_V2.patch, HBASE-4379_Trunk.patch, 
 hbase-4379.v2.patch


 hbck does not detect or have an error condition when the last region of a 
 table is missing (end key != '').

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4379) [hbck] Does not complain about tables with no end region [Z,]


 [ 
https://issues.apache.org/jira/browse/HBASE-4379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-4379:
--

Status: Patch Available  (was: Open)

 [hbck] Does not complain about tables with no end region [Z,]
 -

 Key: HBASE-4379
 URL: https://issues.apache.org/jira/browse/HBASE-4379
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.94.0, 0.92.0, 0.90.5, 0.96.0
Reporter: Jonathan Hsieh
Assignee: Anoop Sam John
 Fix For: 0.96.0, 0.94.1

 Attachments: 
 0001-HBASE-4379-hbck-does-not-complain-about-tables-with-.patch, 
 HBASE-4379_94.patch, HBASE-4379_94_V2.patch, HBASE-4379_Trunk.patch, 
 hbase-4379.v2.patch


 hbck does not detect or have an error condition when the last region of a 
 table is missing (end key != '').

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6205) Support an option to keep data of dropped table for some time

[
https://issues.apache.org/jira/browse/HBASE-6205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401228#comment-13401228
]

Anoop Sam John commented on HBASE-6205:
---

How about trying Devaraj's idea? If we do this way, need to see will it have
some impacts on tools like HBCK.

Support an option to keep data of dropped table for some time
-

Key: HBASE-6205
URL: https://issues.apache.org/jira/browse/HBASE-6205
Project: HBase
Issue Type: New Feature
Affects Versions: 0.94.0, 0.96.0
Reporter: chunhui shen
Assignee: chunhui shen
Fix For: 0.96.0

Attachments: HBASE-6205.patch, HBASE-6205v2.patch,
HBASE-6205v3.patch, HBASE-6205v4.patch, HBASE-6205v5.patch

User may drop table accidentally because of error code or other uncertain
reasons.
Unfortunately, it happens in our environment because one user make a mistake
between production cluster and testing cluster.
So, I just give a suggestion, do we need to support an option to keep data of
dropped table for some time, e.g. 1 day
In the patch:
We make a new dir named .trashtables in the rood dir.
In the DeleteTableHandler, we move files in dropped table's dir to trash
table dir instead of deleting them directly.
And Create new class TrashCleaner which will clean dropped tables if it is
time out with a period check.
Default keep time for dropped tables is 1 day, and check period is 1 hour.

[jira] [Commented] (HBASE-6205) Support an option to keep data of dropped table for some time

2012-06-26 Thread chunhui shen (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-6205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401239#comment-13401239
 ] 

chunhui shen commented on HBASE-6205:
-

bq.How about trying Devaraj's idea? 

If user dropped the table, and create the table with the same name, is it will 
something wrong?

Another problem, if we set table disable_delete, could user see this table?

 Support an option to keep data of dropped table for some time
 -

 Key: HBASE-6205
 URL: https://issues.apache.org/jira/browse/HBASE-6205
 Project: HBase
  Issue Type: New Feature
Affects Versions: 0.94.0, 0.96.0
Reporter: chunhui shen
Assignee: chunhui shen
 Fix For: 0.96.0

 Attachments: HBASE-6205.patch, HBASE-6205v2.patch, 
 HBASE-6205v3.patch, HBASE-6205v4.patch, HBASE-6205v5.patch


 User may drop table accidentally because of error code or other uncertain 
 reasons.
 Unfortunately, it happens in our environment because one user make a mistake 
 between production cluster and testing cluster.
 So, I just give a suggestion, do we need to support an option to keep data of 
 dropped table for some time, e.g. 1 day
 In the patch:
 We make a new dir named .trashtables in the rood dir.
 In the DeleteTableHandler, we move files in dropped table's dir to trash 
 table dir instead of deleting them directly.
 And Create new class TrashCleaner which will clean dropped tables if it is 
 time out with a period check.
 Default keep time for dropped tables is 1 day, and check period is 1 hour.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6228) Fixup daughters twice cause daughter region assigned twice

[
https://issues.apache.org/jira/browse/HBASE-6228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401243#comment-13401243
]

ramkrishna.s.vasudevan commented on HBASE-6228:
---

+1 Chunhui. Your explanation is right. Sorry for making noise here. :)
Thanks.

Fixup daughters twice cause daughter region assigned twice
---

Key: HBASE-6228
URL: https://issues.apache.org/jira/browse/HBASE-6228
Project: HBase
Issue Type: Bug
Components: master
Reporter: chunhui shen
Assignee: chunhui shen
Fix For: 0.96.0

Attachments: HBASE-6228.patch, HBASE-6228v2.patch, HBASE-6228v2.patch

First, how fixup daughters twice happen?
1.we will fixupDaughters at the last of HMaster#finishInitialization
2.ServerShutdownHandler will fixupDaughters when reassigning region through
ServerShutdownHandler#processDeadRegion
When fixupDaughters, we will added daughters to .META., but it coudn't
prevent the above case, because FindDaughterVisitor.
The detail is as the following:
Suppose region A is a splitted parent region, and its daughter region B is
missing
1.First, ServerShutdownHander thread fixup daughter, so add daughter region B
to .META. with serverName=null, and assign the daughter.
2.Then, Master's initialization thread will also find the daughter region B
is missing and assign it. It is because FindDaughterVisitor consider daughter
is missing if its serverName=null

[jira] [Commented] (HBASE-6205) Support an option to keep data of dropped table for some time

[
https://issues.apache.org/jira/browse/HBASE-6205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401245#comment-13401245
]

ramkrishna.s.vasudevan commented on HBASE-6205:
---

bq.if we set table disable_delete, could user see this table?
I think here the user should be able to see until the table is really deleted.
bq.create the table with the same name, is it will something wrong?
If drop is done completely then we should allow the table creation with same
name. Maybe till then we should not allow. Just my thoughts on this.

Support an option to keep data of dropped table for some time
-

Attachments: HBASE-6205.patch, HBASE-6205v2.patch,
HBASE-6205v3.patch, HBASE-6205v4.patch, HBASE-6205v5.patch

[jira] [Created] (HBASE-6269) Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest KeyValue

ShiXing created HBASE-6269:
--

 Summary: Lazyseek should use the maxSequenseId StoreFile's 
KeyValue as the latest KeyValue
 Key: HBASE-6269
 URL: https://issues.apache.org/jira/browse/HBASE-6269
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.94.0
Reporter: ShiXing
Assignee: ShiXing
 Attachments: HBASE-6269-v1.patch

When I fix the bug HBASE-6195, there is happened to find sometimes the test 
case will fail, https://builds.apache.org/job/HBase-0.94/259/.

If there are two Put/Increment with same row, family, qualifier, timestamp and 
different memstoreTS, after each Put/Increment, we do a memstore flush. So 
there will be two StoreFile with same KeyValue(except memstoreTS and 
SequenceId).

When I got the row, I always got the old records, the test case like this:
{code}
  public void testPutWithMemStoreFlush() throws Exception {
Configuration conf = HBaseConfiguration.create();
String method = testPutWithMemStoreFlush;
byte[] tableName = Bytes.toBytes(method);
byte[] family = Bytes.toBytes(family);;
byte[] qualifier = Bytes.toBytes(qualifier);
byte[] row = Bytes.toBytes(putRow);
byte[] value = null;
this.region = initHRegion(tableName, method, conf, family);
Put put = null;
Get get = null;
ListKeyValue kvs = null;
Result res = null;

put = new Put(row);
value = Bytes.toBytes(value0);
put.add(family, qualifier, 1234567l, value);
region.put(put);
System.out.print(get value before flush after put value0 : );
get = new Get(row);
get.addColumn(family, qualifier);
get.setMaxVersions();
res = this.region.get(get, null);
kvs = res.getColumn(family, qualifier);
for (int i = 0; i  kvs.size(); i++) {
  System.out.println(Bytes.toString(kvs.get(i).getValue()));
}

region.flushcache();

System.out.print(get value after flush after put value0 : );
get = new Get(row);
get.addColumn(family, qualifier);
get.setMaxVersions();
res = this.region.get(get, null);
kvs = res.getColumn(family, qualifier);
for (int i = 0; i  kvs.size(); i++) {
  System.out.println(Bytes.toString(kvs.get(i).getValue()));
}

put = new Put(row);
value = Bytes.toBytes(value1);
put.add(family, qualifier, 1234567l, value);
region.put(put);
System.out.print(get value before flush after put value1 : );
get = new Get(row);
get.addColumn(family, qualifier);
get.setMaxVersions();
res = this.region.get(get, null);
kvs = res.getColumn(family, qualifier);
for (int i = 0; i  kvs.size(); i++) {
  System.out.println(Bytes.toString(kvs.get(i).getValue()));
}
region.flushcache();
System.out.print(get value after flush after put value1 : );
get = new Get(row);
get.addColumn(family, qualifier);
get.setMaxVersions();
res = this.region.get(get, null);
kvs = res.getColumn(family, qualifier);
for (int i = 0; i  kvs.size(); i++) {
  System.out.println(Bytes.toString(kvs.get(i).getValue()));
}

put = new Put(row);
value = Bytes.toBytes(value2);
put.add(family, qualifier, 1234567l, value);
region.put(put);
System.out.print(get value before flush after put value2 : );
get = new Get(row);
get.addColumn(family, qualifier);
get.setMaxVersions();
res = this.region.get(get, null);
kvs = res.getColumn(family, qualifier);
for (int i = 0; i  kvs.size(); i++) {
  System.out.println(Bytes.toString(kvs.get(i).getValue()));
}
region.flushcache();
System.out.print(get value after flush after put value2 : );
get = new Get(row);
get.addColumn(family, qualifier);
get.setMaxVersions();
res = this.region.get(get, null);
kvs = res.getColumn(family, qualifier);
for (int i = 0; i  kvs.size(); i++) {
  System.out.println(Bytes.toString(kvs.get(i).getValue()));
} 
  }
{code}
and the result print as followed:
{code}
get value before flush after put value0 : value0
get value after flush after put value0 : value0
get value before flush after put value1 : value1
get value after flush after put value1 : value0
get value before flush after put value2 : value2
get value after flush after put value2 : value0
{code}

I analyze the code for StoreFileScanner with lazy seek, the StoreFileScanners 
are sorted by SequenceId, so the latest StoreFile is on the top KeyValueHeap, 
and the KeyValue for latest StoreFile will comapre to the second latest 
StoreFile, but the second latest StoreFile generated the fake row for same row, 
family, qualifier excepts the timestamp( maximum), memstoreTS(0). And the 
latest KeyValue recognized as not latest than the second latest.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:

[jira] [Updated] (HBASE-6269) Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest KeyValue


 [ 
https://issues.apache.org/jira/browse/HBASE-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ShiXing updated HBASE-6269:
---

Attachment: HBASE-6269-v1.patch

 Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest 
 KeyValue
 -

 Key: HBASE-6269
 URL: https://issues.apache.org/jira/browse/HBASE-6269
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.94.0
Reporter: ShiXing
Assignee: ShiXing
 Attachments: HBASE-6269-v1.patch


 When I fix the bug HBASE-6195, there is happened to find sometimes the test 
 case will fail, https://builds.apache.org/job/HBase-0.94/259/.
 If there are two Put/Increment with same row, family, qualifier, timestamp 
 and different memstoreTS, after each Put/Increment, we do a memstore flush. 
 So there will be two StoreFile with same KeyValue(except memstoreTS and 
 SequenceId).
 When I got the row, I always got the old records, the test case like this:
 {code}
   public void testPutWithMemStoreFlush() throws Exception {
 Configuration conf = HBaseConfiguration.create();
 String method = testPutWithMemStoreFlush;
 byte[] tableName = Bytes.toBytes(method);
 byte[] family = Bytes.toBytes(family);;
 byte[] qualifier = Bytes.toBytes(qualifier);
 byte[] row = Bytes.toBytes(putRow);
 byte[] value = null;
 this.region = initHRegion(tableName, method, conf, family);
 Put put = null;
 Get get = null;
 ListKeyValue kvs = null;
 Result res = null;
 
 put = new Put(row);
 value = Bytes.toBytes(value0);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 
 System.out.print(get value after flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value1);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value2);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 } 
   }
 {code}
 and the result print as followed:
 {code}
 get value before flush after put value0 : value0
 get value after flush after put value0 : value0
 get value before flush after put value1 : value1
 get value after flush after put value1 : value0
 get value before flush after put value2 : value2
 get value after flush after put value2 : value0
 {code}
 I analyze the code for StoreFileScanner with lazy seek, the StoreFileScanners 
 are sorted by SequenceId, so the latest StoreFile is on the top KeyValueHeap, 
 and the KeyValue for latest StoreFile will comapre to the second latest 
 StoreFile, but the second latest

[jira] [Commented] (HBASE-6195) Increment data will be lost when the memstore is flushed


[ 
https://issues.apache.org/jira/browse/HBASE-6195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401267#comment-13401267
 ] 

ShiXing commented on HBASE-6195:


I find that the problem is introduced by the lazyseek. I have open a jira  for 
this problem HBASE-6269.

 Increment data will be lost when the memstore is flushed
 

 Key: HBASE-6195
 URL: https://issues.apache.org/jira/browse/HBASE-6195
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Reporter: ShiXing
Assignee: ShiXing
 Fix For: 0.96.0, 0.94.1

 Attachments: 6195-trunk-V7.patch, 6195.addendum, 
 HBASE-6195-trunk-V2.patch, HBASE-6195-trunk-V3.patch, 
 HBASE-6195-trunk-V4.patch, HBASE-6195-trunk-V5.patch, 
 HBASE-6195-trunk-V6.patch, HBASE-6195-trunk.patch


 There are two problems in increment() now:
 First:
 I see that the timestamp(the variable now) in HRegion's Increment() is 
 generated before got the rowLock, so when there are multi-thread increment 
 the same row, although it generate earlier, it may got the lock later. 
 Because increment just store one version, so till now, the result will still 
 be right.
 When the region is flushing, these increment will read the kv from snapshot 
 and memstore with whose timestamp is larger, and write it back to memstore. 
 If the snapshot's timestamp larger than the memstore, the increment will got 
 the old data and then do the increment, it's wrong.
 Secondly:
 Also there is a risk in increment. Because it writes the memstore first and 
 then HLog, so if it writes HLog failed, the client will also read the 
 incremented value.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


 [ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jieshan Bean updated HBASE-6200:


Attachment: (was: HBASE-6200-90.patch)

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, HBASE-6200-90-v2.patch, 
 HBASE-6200-92-v2.patch, HBASE-6200-94-v2.patch, 
 PerformanceTestCase-6200-94.patch


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


 [ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jieshan Bean updated HBASE-6200:


Attachment: (was: HBASE-6200-94.patch)

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, HBASE-6200-90-v2.patch, 
 HBASE-6200-92-v2.patch, HBASE-6200-94-v2.patch, 
 PerformanceTestCase-6200-94.patch


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4379) [hbck] Does not complain about tables with no end region [Z,]

[
https://issues.apache.org/jira/browse/HBASE-4379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401286#comment-13401286
]

Hadoop QA commented on HBASE-4379:
--

-1 overall. Here are the results of testing the latest attachment

http://issues.apache.org/jira/secure/attachment/12533445/HBASE-4379_Trunk.patch
against trunk revision .

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 3 new or modified tests.

+1 hadoop2.0. The patch compiles against the hadoop 2.0 profile.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac
compiler warnings.

-1 findbugs. The patch appears to introduce 7 new Findbugs (version 1.3.9)
warnings.

+1 release audit. The applied patch does not increase the total number of
release audit warnings.

+1 core tests. The patch passed unit tests in .

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/2257//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/2257//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/2257//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/2257//console

This message is automatically generated.

[hbck] Does not complain about tables with no end region [Z,]
-

Key: HBASE-4379
URL: https://issues.apache.org/jira/browse/HBASE-4379
Project: HBase
Issue Type: Bug
Components: hbck
Affects Versions: 0.90.5, 0.92.0, 0.94.0, 0.96.0
Reporter: Jonathan Hsieh
Assignee: Anoop Sam John
Fix For: 0.96.0, 0.94.1

Attachments:
0001-HBASE-4379-hbck-does-not-complain-about-tables-with-.patch,
HBASE-4379_94.patch, HBASE-4379_94_V2.patch, HBASE-4379_Trunk.patch,
hbase-4379.v2.patch

hbck does not detect or have an error condition when the last region of a
table is missing (end key != '').

[jira] [Commented] (HBASE-5631) hbck should handle case where .tableinfo file is missing.

[
https://issues.apache.org/jira/browse/HBASE-5631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401288#comment-13401288
]

Anoop Sam John commented on HBASE-5631:
---

One point is with out the .tableinfo file in HDFS, HBCK can not fix HDFS
integrity issues.
For recreating the .tableinfo file in HDFS we need the HTD instance of the
table. Well we can try getting this from the Master or RSs. In RS side the
HRegion will have HTD instances. Also in Master, it might be already cached in
FSTableDescriptors before the file actually got missed. We can try getting from
any where possible (Hope from some where we will get HTD) and recreate the
.tableinfo file. One point is this can work only in online mode.

And if the .tableinfo file is missing the offline mode fixes wont work also.

Pls validate my analysis

hbck should handle case where .tableinfo file is missing.
-

Key: HBASE-5631
URL: https://issues.apache.org/jira/browse/HBASE-5631
Project: HBase
Issue Type: Improvement
Components: hbck
Affects Versions: 0.92.2, 0.94.0, 0.96.0
Reporter: Jonathan Hsieh

0.92+ branches have a .tableinfo file which could be missing from hdfs. hbck
should be able to detect and repair this properly.

[jira] [Commented] (HBASE-6269) Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest KeyValue

2012-06-26 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401294#comment-13401294
 ] 

Ted Yu commented on HBASE-6269:
---

Can you generate patch for trunk for Hadoop aa ?

 Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest 
 KeyValue
 -

 Key: HBASE-6269
 URL: https://issues.apache.org/jira/browse/HBASE-6269
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.94.0
Reporter: ShiXing
Assignee: ShiXing
 Attachments: HBASE-6269-v1.patch


 When I fix the bug HBASE-6195, there is happened to find sometimes the test 
 case will fail, https://builds.apache.org/job/HBase-0.94/259/.
 If there are two Put/Increment with same row, family, qualifier, timestamp 
 and different memstoreTS, after each Put/Increment, we do a memstore flush. 
 So there will be two StoreFile with same KeyValue(except memstoreTS and 
 SequenceId).
 When I got the row, I always got the old records, the test case like this:
 {code}
   public void testPutWithMemStoreFlush() throws Exception {
 Configuration conf = HBaseConfiguration.create();
 String method = testPutWithMemStoreFlush;
 byte[] tableName = Bytes.toBytes(method);
 byte[] family = Bytes.toBytes(family);;
 byte[] qualifier = Bytes.toBytes(qualifier);
 byte[] row = Bytes.toBytes(putRow);
 byte[] value = null;
 this.region = initHRegion(tableName, method, conf, family);
 Put put = null;
 Get get = null;
 ListKeyValue kvs = null;
 Result res = null;
 
 put = new Put(row);
 value = Bytes.toBytes(value0);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 
 System.out.print(get value after flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value1);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value2);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 } 
   }
 {code}
 and the result print as followed:
 {code}
 get value before flush after put value0 : value0
 get value after flush after put value0 : value0
 get value before flush after put value1 : value1
 get value after flush after put value1 : value0
 get value before flush after put value2 : value2
 get value after flush after put value2 : value0
 {code}
 I analyze the code for StoreFileScanner with lazy seek, the StoreFileScanners 
 are sorted by SequenceId, so the latest StoreFile is on the top KeyValueHeap, 
 and the KeyValue for latest

[jira] [Comment Edited] (HBASE-6269) Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest KeyValue

2012-06-26 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401294#comment-13401294
 ] 

Ted Yu edited comment on HBASE-6269 at 6/26/12 11:00 AM:
-

Can you generate patch for trunk for Hadoop QA ?

  was (Author: yuzhih...@gmail.com):
Can you generate patch for trunk for Hadoop aa ?
  
 Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest 
 KeyValue
 -

 Key: HBASE-6269
 URL: https://issues.apache.org/jira/browse/HBASE-6269
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.94.0
Reporter: ShiXing
Assignee: ShiXing
 Attachments: HBASE-6269-v1.patch


 When I fix the bug HBASE-6195, there is happened to find sometimes the test 
 case will fail, https://builds.apache.org/job/HBase-0.94/259/.
 If there are two Put/Increment with same row, family, qualifier, timestamp 
 and different memstoreTS, after each Put/Increment, we do a memstore flush. 
 So there will be two StoreFile with same KeyValue(except memstoreTS and 
 SequenceId).
 When I got the row, I always got the old records, the test case like this:
 {code}
   public void testPutWithMemStoreFlush() throws Exception {
 Configuration conf = HBaseConfiguration.create();
 String method = testPutWithMemStoreFlush;
 byte[] tableName = Bytes.toBytes(method);
 byte[] family = Bytes.toBytes(family);;
 byte[] qualifier = Bytes.toBytes(qualifier);
 byte[] row = Bytes.toBytes(putRow);
 byte[] value = null;
 this.region = initHRegion(tableName, method, conf, family);
 Put put = null;
 Get get = null;
 ListKeyValue kvs = null;
 Result res = null;
 
 put = new Put(row);
 value = Bytes.toBytes(value0);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 
 System.out.print(get value after flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value1);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value2);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 } 
   }
 {code}
 and the result print as followed:
 {code}
 get value before flush after put value0 : value0
 get value after flush after put value0 : value0
 get value before flush after put value1 : value1
 get value after flush after put value1 : value0
 get value before flush after put value2 : value2
 get value after flush after put value2 : value0
 {code}
 I analyze the code for

[jira] [Commented] (HBASE-6269) Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest KeyValue


[ 
https://issues.apache.org/jira/browse/HBASE-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401296#comment-13401296
 ] 

ramkrishna.s.vasudevan commented on HBASE-6269:
---

@ShiXing
If the data is not flushed we are able to get value1 which is latest.  But when 
we flush we have this behavourial change like the 
StoreFileScanner(KeyValueHeap) gives us the older value?



 Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest 
 KeyValue
 -

 Key: HBASE-6269
 URL: https://issues.apache.org/jira/browse/HBASE-6269
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.94.0
Reporter: ShiXing
Assignee: ShiXing
 Attachments: HBASE-6269-v1.patch


 When I fix the bug HBASE-6195, there is happened to find sometimes the test 
 case will fail, https://builds.apache.org/job/HBase-0.94/259/.
 If there are two Put/Increment with same row, family, qualifier, timestamp 
 and different memstoreTS, after each Put/Increment, we do a memstore flush. 
 So there will be two StoreFile with same KeyValue(except memstoreTS and 
 SequenceId).
 When I got the row, I always got the old records, the test case like this:
 {code}
   public void testPutWithMemStoreFlush() throws Exception {
 Configuration conf = HBaseConfiguration.create();
 String method = testPutWithMemStoreFlush;
 byte[] tableName = Bytes.toBytes(method);
 byte[] family = Bytes.toBytes(family);;
 byte[] qualifier = Bytes.toBytes(qualifier);
 byte[] row = Bytes.toBytes(putRow);
 byte[] value = null;
 this.region = initHRegion(tableName, method, conf, family);
 Put put = null;
 Get get = null;
 ListKeyValue kvs = null;
 Result res = null;
 
 put = new Put(row);
 value = Bytes.toBytes(value0);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 
 System.out.print(get value after flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value1);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value2);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 } 
   }
 {code}
 and the result print as followed:
 {code}
 get value before flush after put value0 : value0
 get value after flush after put value0 : value0
 get value before flush after put value1 : value1
 get value after flush after put value1 : value0
 get value before flush after put value2 : value2
 get value after flush after put value2 : value0
 {code}
 I

[jira] [Updated] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


 [ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jieshan Bean updated HBASE-6200:


Attachment: 6200-trunk-v3.patch

I tried my best to decrease the calculate times in this version of patch. And I 
changed the test case to test comparison directly（Do 1,000,000 compares each 
time）. 
With the same steps:
1. Test comparing between 'famia:qualia' with 'famib:qualia'. Run for 100 
times. Calculate the time consumed(Using System.currentTimeMillis() to get 
current time).
2. Test comparing between 'fami:qualia' with 'fami:qualib'. Run for 1,000,000 
times. Calculate the time consumed.
3. Repeats 1~2 for 20 times. Accumulate the total consumed time at step 1 and 
step 2, and then calculate for the average time.

Test code:
{noformat}
   for (int loop = 0; loop  20; loop++) {
  long start = System.currentTimeMillis();
  for (int i = 0; i  100; i++) {
compareIgnoringPrefix(c, 0, kvf_a, kvf_b);
  }
  long end = System.currentTimeMillis();
  long useTimeA = end - start;
  start = end;
  for (int i = 0; i  100; i++) {
compareIgnoringPrefix(c, 0, kvq_a, kvq_b);
  }
  end = System.currentTimeMillis();
  long useTimeB = end - start;
  totalTimeA += useTimeA;
  totalTimeB += useTimeB;
   }
 private void compareIgnoringPrefix(KeyValue.KeyComparator c, int common, 
KeyValue less,
  KeyValue greater) {
int cmp = c.compareIgnoringPrefix(common, less.getBuffer(), less.getOffset()
+ KeyValue.ROW_OFFSET, less.getKeyLength(), greater.getBuffer(),
greater.getOffset() + KeyValue.ROW_OFFSET, greater.getKeyLength());
  } 
{noformat}
And this is the new result: 
[without patch 6200]
{noformat}
Compare {famia:qualia} with {famib:qualia}, run for 1,000,000 times. used time 
- 50
Compare {fami:qualia} with {fami:qualib}, run for 1,000,000 times. used time - 
58
{noformat}
[with patch 6200]
{noformat}
Compare {famia:qualia} with {famib:qualia}, run for 1,000,000 times.  used time 
- 56
Compare {fami:qualia} with {fami:qualib}, run for 1,000,000 times.  used time 
- 64
{noformat}


 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, 6200-trunk-v3.patch, 
 HBASE-6200-90-v2.patch, HBASE-6200-92-v2.patch, HBASE-6200-94-v2.patch


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


 [ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jieshan Bean updated HBASE-6200:


Attachment: (was: PerformanceTestCase-6200-94.patch)

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, 6200-trunk-v3.patch, 
 HBASE-6200-90-v2.patch, HBASE-6200-92-v2.patch, HBASE-6200-94-v2.patch


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


 [ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jieshan Bean updated HBASE-6200:


Attachment: PerformanceTest-trunk.patch

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, 6200-trunk-v3.patch, 
 HBASE-6200-90-v2.patch, HBASE-6200-92-v2.patch, HBASE-6200-94-v2.patch


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


 [ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jieshan Bean updated HBASE-6200:


Attachment: (was: PerformanceTest-trunk.patch)

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, 6200-trunk-v3.patch, 
 HBASE-6200-90-v2.patch, HBASE-6200-92-v2.patch, HBASE-6200-94-v2.patch


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


 [ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jieshan Bean updated HBASE-6200:


Attachment: (was: HBASE-6200-92-v2.patch)

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, 6200-trunk-v3.patch


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


 [ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jieshan Bean updated HBASE-6200:


Attachment: (was: HBASE-6200-90-v2.patch)

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, 6200-trunk-v3.patch


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


 [ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jieshan Bean updated HBASE-6200:


Attachment: (was: HBASE-6200-94-v2.patch)

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, 6200-trunk-v3.patch


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6267) hbase.store.delete.expired.storefile should be true by default

2012-06-26 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-6267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401322#comment-13401322
 ] 

Hudson commented on HBASE-6267:
---

Integrated in HBase-TRUNK-on-Hadoop-2.0.0 #69 (See 
[https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/69/])
HBASE-6267. hbase.store.delete.expired.storefile should be true by default 
(Revision 1353812)

 Result = FAILURE
apurtell : 
Files : 
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/Store.java
* 
/hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/io/hfile/TestScannerSelectionUsingTTL.java


 hbase.store.delete.expired.storefile should be true by default
 --

 Key: HBASE-6267
 URL: https://issues.apache.org/jira/browse/HBASE-6267
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Affects Versions: 0.96.0, 0.94.1
Reporter: Andrew Purtell
Assignee: Andrew Purtell
 Fix For: 0.96.0, 0.94.1

 Attachments: HBASE-6267-0.94.patch, HBASE-6267.patch


 HBASE-5199 introduces this logic into Store:
 {code}
 +  // Delete the expired store files before the compaction selection.
 +  if (conf.getBoolean(hbase.store.delete.expired.storefile, false)
 +   (ttl != Long.MAX_VALUE)  (this.scanInfo.minVersions == 0)) {
 +CompactSelection expiredSelection = compactSelection
 +.selectExpiredStoreFilesToCompact(
 +EnvironmentEdgeManager.currentTimeMillis() - this.ttl);
 +
 +// If there is any expired store files, delete them  by compaction.
 +if (expiredSelection != null) {
 +  return expiredSelection;
 +}
 +  }
 {code}
 Is there any reason why that should not be default {{true}}?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6269) Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest KeyValue


[ 
https://issues.apache.org/jira/browse/HBASE-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401325#comment-13401325
 ] 

ramkrishna.s.vasudevan commented on HBASE-6269:
---

@ShiXing
Yes, I agree that instead of getting the highest store file's scanner we get 
the second highest. And since in this case comparing for '0' should be fine i 
feel.
Its better we fix this tho it may be very rare to get this problem.



 Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest 
 KeyValue
 -

 Key: HBASE-6269
 URL: https://issues.apache.org/jira/browse/HBASE-6269
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.94.0
Reporter: ShiXing
Assignee: ShiXing
 Attachments: HBASE-6269-v1.patch


 When I fix the bug HBASE-6195, there is happened to find sometimes the test 
 case will fail, https://builds.apache.org/job/HBase-0.94/259/.
 If there are two Put/Increment with same row, family, qualifier, timestamp 
 and different memstoreTS, after each Put/Increment, we do a memstore flush. 
 So there will be two StoreFile with same KeyValue(except memstoreTS and 
 SequenceId).
 When I got the row, I always got the old records, the test case like this:
 {code}
   public void testPutWithMemStoreFlush() throws Exception {
 Configuration conf = HBaseConfiguration.create();
 String method = testPutWithMemStoreFlush;
 byte[] tableName = Bytes.toBytes(method);
 byte[] family = Bytes.toBytes(family);;
 byte[] qualifier = Bytes.toBytes(qualifier);
 byte[] row = Bytes.toBytes(putRow);
 byte[] value = null;
 this.region = initHRegion(tableName, method, conf, family);
 Put put = null;
 Get get = null;
 ListKeyValue kvs = null;
 Result res = null;
 
 put = new Put(row);
 value = Bytes.toBytes(value0);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 
 System.out.print(get value after flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value1);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value2);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 } 
   }
 {code}
 and the result print as followed:
 {code}
 get value before flush after put value0 : value0
 get value after flush after put value0 : value0
 get value before flush after put value1 : value1
 get value after flush after put value1 : value0
 get value before flush after put value2 : value2
 get value after

[jira] [Commented] (HBASE-6228) Fixup daughters twice cause daughter region assigned twice

2012-06-26 Thread Jonathan Hsieh (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-6228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401326#comment-13401326
 ] 

Jonathan Hsieh commented on HBASE-6228:
---

Is there anyway we can add tests to these subtle recovery fixes?  

Part of me says we should just take a something like lock on the region (in zk, 
possibly moving it into RIT) before we start fixing them up like this to make 
this obvious and to eliminate these classes of races.  

 Fixup daughters twice  cause daughter region assigned twice
 ---

 Key: HBASE-6228
 URL: https://issues.apache.org/jira/browse/HBASE-6228
 Project: HBase
  Issue Type: Bug
  Components: master
Reporter: chunhui shen
Assignee: chunhui shen
 Fix For: 0.96.0

 Attachments: HBASE-6228.patch, HBASE-6228v2.patch, HBASE-6228v2.patch


 First, how fixup daughters twice happen?
 1.we will fixupDaughters at the last of HMaster#finishInitialization
 2.ServerShutdownHandler will fixupDaughters when reassigning region through 
 ServerShutdownHandler#processDeadRegion
 When fixupDaughters, we will added daughters to .META., but it coudn't 
 prevent the above case, because FindDaughterVisitor.
 The detail is as the following:
 Suppose region A is a splitted parent region, and its daughter region B is 
 missing
 1.First, ServerShutdownHander thread fixup daughter, so add daughter region B 
 to .META. with serverName=null, and assign the daughter.
 2.Then, Master's initialization thread will also find the daughter region B 
 is missing and assign it. It is because FindDaughterVisitor consider daughter 
 is missing if its serverName=null

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6269) Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest KeyValue


[ 
https://issues.apache.org/jira/browse/HBASE-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401383#comment-13401383
 ] 

ShiXing commented on HBASE-6269:


@anoop

There are 2 StoreFiles after flush two times, the sf2's sequenceId  sf1's 
sequenceId.

When get:
step1. the sf2 is the highest StoreFileScanner, and it enforceSeek() in 
KeyValueHeap.pollRealKV(), so the KeyValue2 is read out from StoreFile by real 
seek. And it compares to the fake KeyValue(called FakeKeyValue) that generated 
by KeyValue.createFirstOnRow() in StoreScanner.next(), and the FakeKeyValue's 
row, family, qualifier, timestamp, memstoreTS(always 0 for StoreFileScanner) 
are the same as KeyValue2 excepts Key type is Maximum, and Key type in 
KeyValue2 is Put. And the {code}comparator.compare(curKV=KeyValue2, 
nextKV=FakeKeyValue) = 251  0{code}. It means that the highest 
StoreFileScanner's highest KeyValue is not higher than the second. Followed is 
the value for example
{code}
KeyValue2 : putRow/family:qualifier/1234567/Put/vlen=6/ts=0
FakeKeyValue : putRow/family:qualifier/1234567/Maximum/vlen=0/ts=0
{code}

And then the second highest StoreFileScanner becomes the highest, and the 
highest is added to the heap.

Step2. The sf1's highest KeyValue is read out , we call it KeyValue1, the real 
value is the same as KeyValue2 fetched again by heap.peek():
{code}
KeyValue1 : putRow/family:qualifier/1234567/Put/vlen=6/ts=0
{code}

Step3. KeyValue1 compares KeyValue2, and the 
{code}comparator.compare(curKV=KeyValue1, nextKV=KeyValue2) = 0{code}, and 
return the sf1's scanner as the highest StoreFileScanner.

My solution is that:

If all the highest KeyValue read out from the StoreFileScanners are the 
same(compare return 0), then we should keep the Scanners orig order by 
sequenceId.

 Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest 
 KeyValue
 -

 Key: HBASE-6269
 URL: https://issues.apache.org/jira/browse/HBASE-6269
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.94.0
Reporter: ShiXing
Assignee: ShiXing
 Attachments: HBASE-6269-v1.patch


 When I fix the bug HBASE-6195, there is happened to find sometimes the test 
 case will fail, https://builds.apache.org/job/HBase-0.94/259/.
 If there are two Put/Increment with same row, family, qualifier, timestamp 
 and different memstoreTS, after each Put/Increment, we do a memstore flush. 
 So there will be two StoreFile with same KeyValue(except memstoreTS and 
 SequenceId).
 When I got the row, I always got the old records, the test case like this:
 {code}
   public void testPutWithMemStoreFlush() throws Exception {
 Configuration conf = HBaseConfiguration.create();
 String method = testPutWithMemStoreFlush;
 byte[] tableName = Bytes.toBytes(method);
 byte[] family = Bytes.toBytes(family);;
 byte[] qualifier = Bytes.toBytes(qualifier);
 byte[] row = Bytes.toBytes(putRow);
 byte[] value = null;
 this.region = initHRegion(tableName, method, conf, family);
 Put put = null;
 Get get = null;
 ListKeyValue kvs = null;
 Result res = null;
 
 put = new Put(row);
 value = Bytes.toBytes(value0);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 
 System.out.print(get value after flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value1);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs =

[jira] [Updated] (HBASE-6269) Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest KeyValue


 [ 
https://issues.apache.org/jira/browse/HBASE-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ShiXing updated HBASE-6269:
---

Attachment: HBASE-6269-trunk-V1.patch

 Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest 
 KeyValue
 -

 Key: HBASE-6269
 URL: https://issues.apache.org/jira/browse/HBASE-6269
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.94.0
Reporter: ShiXing
Assignee: ShiXing
 Attachments: HBASE-6269-trunk-V1.patch, HBASE-6269-v1.patch


 When I fix the bug HBASE-6195, there is happened to find sometimes the test 
 case will fail, https://builds.apache.org/job/HBase-0.94/259/.
 If there are two Put/Increment with same row, family, qualifier, timestamp 
 and different memstoreTS, after each Put/Increment, we do a memstore flush. 
 So there will be two StoreFile with same KeyValue(except memstoreTS and 
 SequenceId).
 When I got the row, I always got the old records, the test case like this:
 {code}
   public void testPutWithMemStoreFlush() throws Exception {
 Configuration conf = HBaseConfiguration.create();
 String method = testPutWithMemStoreFlush;
 byte[] tableName = Bytes.toBytes(method);
 byte[] family = Bytes.toBytes(family);;
 byte[] qualifier = Bytes.toBytes(qualifier);
 byte[] row = Bytes.toBytes(putRow);
 byte[] value = null;
 this.region = initHRegion(tableName, method, conf, family);
 Put put = null;
 Get get = null;
 ListKeyValue kvs = null;
 Result res = null;
 
 put = new Put(row);
 value = Bytes.toBytes(value0);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 
 System.out.print(get value after flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value1);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value2);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 } 
   }
 {code}
 and the result print as followed:
 {code}
 get value before flush after put value0 : value0
 get value after flush after put value0 : value0
 get value before flush after put value1 : value1
 get value after flush after put value1 : value0
 get value before flush after put value2 : value2
 get value after flush after put value2 : value0
 {code}
 I analyze the code for StoreFileScanner with lazy seek, the StoreFileScanners 
 are sorted by SequenceId, so the latest StoreFile is on the top KeyValueHeap, 
 and the KeyValue for latest StoreFile will comapre to the second latest

[jira] [Updated] (HBASE-6269) Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest KeyValue


 [ 
https://issues.apache.org/jira/browse/HBASE-6269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ramkrishna.s.vasudevan updated HBASE-6269:
--

Status: Patch Available  (was: Open)

 Lazyseek should use the maxSequenseId StoreFile's KeyValue as the latest 
 KeyValue
 -

 Key: HBASE-6269
 URL: https://issues.apache.org/jira/browse/HBASE-6269
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.94.0
Reporter: ShiXing
Assignee: ShiXing
 Attachments: HBASE-6269-trunk-V1.patch, HBASE-6269-v1.patch


 When I fix the bug HBASE-6195, there is happened to find sometimes the test 
 case will fail, https://builds.apache.org/job/HBase-0.94/259/.
 If there are two Put/Increment with same row, family, qualifier, timestamp 
 and different memstoreTS, after each Put/Increment, we do a memstore flush. 
 So there will be two StoreFile with same KeyValue(except memstoreTS and 
 SequenceId).
 When I got the row, I always got the old records, the test case like this:
 {code}
   public void testPutWithMemStoreFlush() throws Exception {
 Configuration conf = HBaseConfiguration.create();
 String method = testPutWithMemStoreFlush;
 byte[] tableName = Bytes.toBytes(method);
 byte[] family = Bytes.toBytes(family);;
 byte[] qualifier = Bytes.toBytes(qualifier);
 byte[] row = Bytes.toBytes(putRow);
 byte[] value = null;
 this.region = initHRegion(tableName, method, conf, family);
 Put put = null;
 Get get = null;
 ListKeyValue kvs = null;
 Result res = null;
 
 put = new Put(row);
 value = Bytes.toBytes(value0);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 
 System.out.print(get value after flush after put value0 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value1);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value1 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 
 put = new Put(row);
 value = Bytes.toBytes(value2);
 put.add(family, qualifier, 1234567l, value);
 region.put(put);
 System.out.print(get value before flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 }
 region.flushcache();
 System.out.print(get value after flush after put value2 : );
 get = new Get(row);
 get.addColumn(family, qualifier);
 get.setMaxVersions();
 res = this.region.get(get, null);
 kvs = res.getColumn(family, qualifier);
 for (int i = 0; i  kvs.size(); i++) {
   System.out.println(Bytes.toString(kvs.get(i).getValue()));
 } 
   }
 {code}
 and the result print as followed:
 {code}
 get value before flush after put value0 : value0
 get value after flush after put value0 : value0
 get value before flush after put value1 : value1
 get value after flush after put value1 : value0
 get value before flush after put value2 : value2
 get value after flush after put value2 : value0
 {code}
 I analyze the code for StoreFileScanner with lazy seek, the StoreFileScanners 
 are sorted by SequenceId, so the latest StoreFile is on the top KeyValueHeap, 
 and the KeyValue for latest StoreFile will

[jira] [Updated] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


 [ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhihong Ted Yu updated HBASE-6200:
--

Status: Patch Available  (was: Open)

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.0, 0.92.1, 0.90.6
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, 6200-trunk-v3.patch


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HBASE-6270) If all data is locally cached, undo locking and context switching so we are cpu bound

2012-06-26 Thread stack (JIRA)

stack created HBASE-6270:


 Summary: If all data is locally cached, undo locking and context 
switching so we are cpu bound
 Key: HBASE-6270
 URL: https://issues.apache.org/jira/browse/HBASE-6270
 Project: HBase
  Issue Type: Bug
  Components: performance
Reporter: stack


See Dhruba's blog here towards the end where he talks about HBase: 
http://hadoopblog.blogspot.com.es/2012/05/hadoop-and-solid-state-drives.html

He says that when all data is local and cached, we bind ourselves up with locks 
and context switching, so much so, that we are unable to use all CPU.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix


 [ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhihong Ted Yu updated HBASE-6200:
--

Attachment: 6200-trunk-v4.txt

Patch v4 reorders some assignments so that variables are calculated immediately 
before their usage.

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, 6200-trunk-v3.patch, 
 6200-trunk-v4.txt


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-5967) OpenDataException because HBaseProtos.ServerLoad cannot be converted to an open data type


 [ 
https://issues.apache.org/jira/browse/HBASE-5967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhihong Ted Yu updated HBASE-5967:
--

Attachment: 5967-v2.patch

Re-attaching patch v2.

 OpenDataException because HBaseProtos.ServerLoad cannot be converted to an 
 open data type
 -

 Key: HBASE-5967
 URL: https://issues.apache.org/jira/browse/HBASE-5967
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.0
Reporter: Jimmy Xiang
Assignee: Gregory Chanan
Priority: Minor
 Fix For: 0.96.0

 Attachments: 5967-v2.patch, HBASE-5967-v2.patch, HBASE-5967.patch, 
 master.log


 I saw this error in the master log:
 Caused by: java.lang.IllegalArgumentException: Method 
 org.apache.hadoop.hbase.master.MXBean.getRegionServers has parameter or 
 return type that cannot be translated into an open type
 at com.sun.jmx.mbeanserver.ConvertingMethod.from(ConvertingMethod.java:32)
 at 
 com.sun.jmx.mbeanserver.MXBeanIntrospector.mFrom(MXBeanIntrospector.java:63)
 at 
 com.sun.jmx.mbeanserver.MXBeanIntrospector.mFrom(MXBeanIntrospector.java:33)
 at com.sun.jmx.mbeanserver.MBeanAnalyzer.initMaps(MBeanAnalyzer.java:118)
 at com.sun.jmx.mbeanserver.MBeanAnalyzer.init(MBeanAnalyzer.java:99)
 ... 14 more
 Caused by: javax.management.openmbean.OpenDataException: Cannot convert type: 
 java.util.Mapjava.lang.String, org.apache.hadoop.hbase.ServerLoad
 at 
 com.sun.jmx.mbeanserver.OpenConverter.openDataException(OpenConverter.jav

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix

2012-06-26 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401454#comment-13401454
 ] 

stack commented on HBASE-6200:
--

Thanks for taking time on perf Jieshan.  +1 on commit (It looks like you have 
enough tests).

 KeyComparator.compareWithoutRow can be wrong when families have the same 
 prefix
 ---

 Key: HBASE-6200
 URL: https://issues.apache.org/jira/browse/HBASE-6200
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.6, 0.92.1, 0.94.0
Reporter: Jean-Daniel Cryans
Assignee: Jieshan Bean
Priority: Blocker
 Fix For: 0.90.7, 0.92.2, 0.96.0, 0.94.1

 Attachments: 6200-trunk-v2.patch, 6200-trunk-v3.patch, 
 6200-trunk-v4.txt


 As reported by Desert Rose on IRC and on the ML, {{Result}} has a weird 
 behavior when some families share the same prefix. He posted a link to his 
 code to show how it fails, http://pastebin.com/7TBA1XGh
 Basically {{KeyComparator.compareWithoutRow}} doesn't differentiate families 
 and qualifiers so f:a is said to be bigger than f1:, which is false. Then 
 what happens is that the KVs are returned in the right order from the RS but 
 then doing {{Result.binarySearch}} it uses 
 {{KeyComparator.compareWithoutRow}} which has a different sorting so the end 
 result is undetermined.
 I added some debug and I can see that the data is returned in the right order 
 but {{Arrays.binarySearch}} returned the wrong KV, which is then verified 
 agains the passed family and qualifier which fails so null is returned.
 I don't know how frequent it is for users to have families with the same 
 prefix, but those that do have that and that use those families at the same 
 time will have big correctness issues. This is why I mark this as a blocker.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-5967) OpenDataException because HBaseProtos.ServerLoad cannot be converted to an open data type

2012-06-26 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-5967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401455#comment-13401455
 ] 

stack commented on HBASE-5967:
--

Is this same as HBASE-5971?  (If so, lets close HBASE-5971)

+1 on patch (I would have go another route banging my head trying to make 
ServerLoad resolve as an OpenData type... That would have taken 100x times 
longer and in the end might not have worked... This is better way to go).

 OpenDataException because HBaseProtos.ServerLoad cannot be converted to an 
 open data type
 -

 Key: HBASE-5967
 URL: https://issues.apache.org/jira/browse/HBASE-5967
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.0
Reporter: Jimmy Xiang
Assignee: Gregory Chanan
Priority: Minor
 Fix For: 0.96.0

 Attachments: 5967-v2.patch, HBASE-5967-v2.patch, HBASE-5967.patch, 
 master.log


 I saw this error in the master log:
 Caused by: java.lang.IllegalArgumentException: Method 
 org.apache.hadoop.hbase.master.MXBean.getRegionServers has parameter or 
 return type that cannot be translated into an open type
 at com.sun.jmx.mbeanserver.ConvertingMethod.from(ConvertingMethod.java:32)
 at 
 com.sun.jmx.mbeanserver.MXBeanIntrospector.mFrom(MXBeanIntrospector.java:63)
 at 
 com.sun.jmx.mbeanserver.MXBeanIntrospector.mFrom(MXBeanIntrospector.java:33)
 at com.sun.jmx.mbeanserver.MBeanAnalyzer.initMaps(MBeanAnalyzer.java:118)
 at com.sun.jmx.mbeanserver.MBeanAnalyzer.init(MBeanAnalyzer.java:99)
 ... 14 more
 Caused by: javax.management.openmbean.OpenDataException: Cannot convert type: 
 java.util.Mapjava.lang.String, org.apache.hadoop.hbase.ServerLoad
 at 
 com.sun.jmx.mbeanserver.OpenConverter.openDataException(OpenConverter.jav

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6261) Better approximate high-percentile percentile latency metrics

2012-06-26 Thread Otis Gospodnetic (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-6261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401463#comment-13401463
]

Otis Gospodnetic commented on HBASE-6261:
-

@Andrew See https://twitter.com/otisg/status/217487624804376576

Better approximate high-percentile percentile latency metrics
-

Key: HBASE-6261
URL: https://issues.apache.org/jira/browse/HBASE-6261
Project: HBase
Issue Type: New Feature
Reporter: Andrew Wang
Labels: metrics

The existing reservoir-sampling based latency metrics in HBase are not
well-suited for providing accurate estimates of high-percentile (e.g. 90th,
95th, or 99th) latency. This is a well-studied problem in the literature (see
[1] and [2]), the question is determining which methods best suit our needs
and then implementing it.
Ideally, we should be able to estimate these high percentiles with minimal
memory and CPU usage as well as minimal error (e.g. 1% error on 90th, or .1%
on 99th). It's also desirable to provide this over different time-based
sliding windows, e.g. last 1 min, 5 mins, 15 mins, and 1 hour.
I'll note that this would also be useful in HDFS, or really anywhere latency
metrics are kept.
[1] http://www.cs.rutgers.edu/~muthu/bquant.pdf
[2] http://infolab.stanford.edu/~manku/papers/04pods-sliding.pdf

[jira] [Commented] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix

[
https://issues.apache.org/jira/browse/HBASE-6200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401483#comment-13401483
]

Hadoop QA commented on HBASE-6200:
--

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12533487/6200-trunk-v4.txt
against trunk revision .

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 3 new or modified tests.

+1 hadoop2.0. The patch compiles against the hadoop 2.0 profile.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac
compiler warnings.

-1 findbugs. The patch appears to introduce 6 new Findbugs (version 1.3.9)
warnings.

+1 release audit. The applied patch does not increase the total number of
release audit warnings.

-1 core tests. The patch failed these unit tests:

org.apache.hadoop.hbase.regionserver.TestServerCustomProtocol
org.apache.hadoop.hbase.security.access.TestAccessController

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/2260//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/2260//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/2260//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/2260//console

This message is automatically generated.

KeyComparator.compareWithoutRow can be wrong when families have the same
prefix
---

Attachments: 6200-trunk-v2.patch, 6200-trunk-v3.patch,
6200-trunk-v4.txt

[jira] [Commented] (HBASE-5967) OpenDataException because HBaseProtos.ServerLoad cannot be converted to an open data type

[
https://issues.apache.org/jira/browse/HBASE-5967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401486#comment-13401486
]

Hadoop QA commented on HBASE-5967:
--

-1 overall. Here are the results of testing the latest attachment
http://issues.apache.org/jira/secure/attachment/12533490/5967-v2.patch
against trunk revision .

+1 @author. The patch does not contain any @author tags.

+1 tests included. The patch appears to include 3 new or modified tests.

+1 hadoop2.0. The patch compiles against the hadoop 2.0 profile.

+1 javadoc. The javadoc tool did not generate any warning messages.

+1 javac. The applied patch does not increase the total number of javac
compiler warnings.

-1 findbugs. The patch appears to introduce 6 new Findbugs (version 1.3.9)
warnings.

+1 release audit. The applied patch does not increase the total number of
release audit warnings.

-1 core tests. The patch failed these unit tests:

org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/2261//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/2261//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/2261//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/2261//console

This message is automatically generated.

OpenDataException because HBaseProtos.ServerLoad cannot be converted to an
open data type
-

Key: HBASE-5967
URL: https://issues.apache.org/jira/browse/HBASE-5967
Project: HBase
Issue Type: Bug
Affects Versions: 0.96.0
Reporter: Jimmy Xiang
Assignee: Gregory Chanan
Priority: Minor
Fix For: 0.96.0

Attachments: 5967-v2.patch, HBASE-5967-v2.patch, HBASE-5967.patch,
master.log

I saw this error in the master log:
Caused by: java.lang.IllegalArgumentException: Method
org.apache.hadoop.hbase.master.MXBean.getRegionServers has parameter or
return type that cannot be translated into an open type
at com.sun.jmx.mbeanserver.ConvertingMethod.from(ConvertingMethod.java:32)
at
com.sun.jmx.mbeanserver.MXBeanIntrospector.mFrom(MXBeanIntrospector.java:63)
at
com.sun.jmx.mbeanserver.MXBeanIntrospector.mFrom(MXBeanIntrospector.java:33)
at com.sun.jmx.mbeanserver.MBeanAnalyzer.initMaps(MBeanAnalyzer.java:118)
at com.sun.jmx.mbeanserver.MBeanAnalyzer.init(MBeanAnalyzer.java:99)
... 14 more
Caused by: javax.management.openmbean.OpenDataException: Cannot convert type:
java.util.Mapjava.lang.String, org.apache.hadoop.hbase.ServerLoad
at
com.sun.jmx.mbeanserver.OpenConverter.openDataException(OpenConverter.jav

[jira] [Commented] (HBASE-5967) OpenDataException because HBaseProtos.ServerLoad cannot be converted to an open data type


[ 
https://issues.apache.org/jira/browse/HBASE-5967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13401497#comment-13401497
 ] 

Zhihong Ted Yu commented on HBASE-5967:
---

I ran the 4 failed tests manually and they passed.

Integrated to trunk.

Thanks for the patch, Gregory.

Thanks for the review, Stack.

 OpenDataException because HBaseProtos.ServerLoad cannot be converted to an 
 open data type
 -

 Key: HBASE-5967
 URL: https://issues.apache.org/jira/browse/HBASE-5967
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.0
Reporter: Jimmy Xiang
Assignee: Gregory Chanan
Priority: Minor
 Fix For: 0.96.0

 Attachments: 5967-v2.patch, HBASE-5967-v2.patch, HBASE-5967.patch, 
 master.log


 I saw this error in the master log:
 Caused by: java.lang.IllegalArgumentException: Method 
 org.apache.hadoop.hbase.master.MXBean.getRegionServers has parameter or 
 return type that cannot be translated into an open type
 at com.sun.jmx.mbeanserver.ConvertingMethod.from(ConvertingMethod.java:32)
 at 
 com.sun.jmx.mbeanserver.MXBeanIntrospector.mFrom(MXBeanIntrospector.java:63)
 at 
 com.sun.jmx.mbeanserver.MXBeanIntrospector.mFrom(MXBeanIntrospector.java:33)
 at com.sun.jmx.mbeanserver.MBeanAnalyzer.initMaps(MBeanAnalyzer.java:118)
 at com.sun.jmx.mbeanserver.MBeanAnalyzer.init(MBeanAnalyzer.java:99)
 ... 14 more
 Caused by: javax.management.openmbean.OpenDataException: Cannot convert type: 
 java.util.Mapjava.lang.String, org.apache.hadoop.hbase.ServerLoad
 at 
 com.sun.jmx.mbeanserver.OpenConverter.openDataException(OpenConverter.jav

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-6200) KeyComparator.compareWithoutRow can be wrong when families have the same prefix