[
https://issues.apache.org/jira/browse/HBASE-14054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14694051#comment-14694051
]
Hadoop QA commented on HBASE-14054:
-----------------------------------
{color:red}-1 overall{color}. Here are the results of testing the latest
attachment
http://issues.apache.org/jira/secure/attachment/12750112/hbase-14054_v1.patch
against master branch at commit 72a48a1333f6c01c46cd244439198ccce3f941ac.
ATTACHMENT ID: 12750112
{color:green}+1 @author{color}. The patch does not contain any @author
tags.
{color:green}+1 tests included{color}. The patch appears to include 3 new
or modified tests.
{color:green}+1 hadoop versions{color}. The patch compiles with all
supported hadoop versions (2.4.0 2.4.1 2.5.0 2.5.1 2.5.2 2.6.0 2.7.0)
{color:green}+1 javac{color}. The applied patch does not increase the
total number of javac compiler warnings.
{color:green}+1 protoc{color}. The applied patch does not increase the
total number of protoc compiler warnings.
{color:green}+1 javadoc{color}. The javadoc tool did not generate any
warning messages.
{color:green}+1 checkstyle{color}. The applied patch does not increase the
total number of checkstyle errors
{color:green}+1 findbugs{color}. The patch does not introduce any new
Findbugs (version 2.0.3) warnings.
{color:green}+1 release audit{color}. The applied patch does not increase
the total number of release audit warnings.
{color:green}+1 lineLengths{color}. The patch does not introduce lines
longer than 100
{color:red}-1 site{color}. The patch appears to cause mvn post-site goal
to fail.
{color:red}-1 core tests{color}. The patch failed these unit tests:
org.apache.hadoop.hbase.util.TestProcessBasedCluster
org.apache.hadoop.hbase.mapreduce.TestImportExport
Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/15070//testReport/
Release Findbugs (version 2.0.3) warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/15070//artifact/patchprocess/newFindbugsWarnings.html
Checkstyle Errors:
https://builds.apache.org/job/PreCommit-HBASE-Build/15070//artifact/patchprocess/checkstyle-aggregate.html
Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/15070//console
This message is automatically generated.
> Acknowledged writes may get lost if regionserver clock is set backwards
> -----------------------------------------------------------------------
>
> Key: HBASE-14054
> URL: https://issues.apache.org/jira/browse/HBASE-14054
> Project: HBase
> Issue Type: Bug
> Components: regionserver
> Affects Versions: 0.98.6
> Environment: Linux
> Reporter: Tobi Vollebregt
> Assignee: Enis Soztutar
> Fix For: 2.0.0, 0.98.14, 1.0.2, 1.2.0, 1.1.2, 1.3.0
>
> Attachments: hbase-14054_v1.patch
>
>
> We experience a small amount of lost acknowledged writes in production on
> July 1st (~700 identified so far).
> What happened was that we had NTP turned off since June 29th to prevent
> issues due to the leap second on June 30th. NTP was turned back on July 1st.
> The next day, we noticed we were missing writes to a few of our higher
> throughput aggregation tables.
> We found that this is caused by HBase taking the current time using
> System.currentTimeMillis, which may be set backwards by NTP, and using this
> without any checks to populate the timestamp of rows for which the client
> didn't supply a timestamp.
> Our application uses a read-modify-write pattern using get+checkAndPut to
> perform aggregation as follows:
> 1. read version 1
> 2. mutate
> 3. write version 2
> 4. read version 2
> 5. mutate
> 6. write version 3
> The application retries the full read-modify-write if the checkAndPut fails.
> What must have happened on July 1st, after we started NTP back up, was this
> (timestamps added):
> 1. read version 1 (timestamp 10)
> 2. mutate
> 3. write version 2 (HBase-assigned timestamp 11)
> 4. read version 2 (timestamp 11)
> 5. mutate
> 6. write version 3 (HBase-assigned timestamp 10)
> Hence, the last write was eclipsed by the first write, and hence, an
> acknowledged write was lost.
> While this seems to match documented behavior (paraphrasing: "if timestamp is
> not specified HBase will assign a timestamp using System.currentTimeMillis"
> "the row with the highest timestamp will be returned by get"), I think it is
> very unintuitive and needs at least a big warning in the documentation, along
> the lines of "Acknowledged writes may not be visible unless the timestamp is
> explicitly specified and equal to or larger than the highest timestamp for
> that row".
> I would also like to use this ticket to start a discussion on if we can make
> the behavior better:
> Could HBase assign a timestamp of {{max(max timestamp for the row,
> System.currentTimeMillis())}} in the checkAndPut write path, instead of
> blindly taking {{System.currentTimeMillis()}}, similar to what has been done
> in HBASE-12449 for increment and append?
> Thoughts?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)