[ 
https://issues.apache.org/jira/browse/ACCUMULO-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16280853#comment-16280853
 ] 

Adam J Shook edited comment on ACCUMULO-4751 at 12/6/17 8:34 PM:
-----------------------------------------------------------------

I have attached some logs tracking a particular WAL file.  You can see that it 
has a {{createdTime}} but at some point a deleting entry must be written (note 
the timestamp change but the {{createdTime}} is gone) and then other entries 
added.

And some other interesting messages back-to-back:
{code}
2017-12-06 19:55:37,712 [replication.StatusCombiner] TRACE: Returned single 
value: 
~replhdfs://namenode:9000/accumulo/wal/tserver+31761/140223d6-30bd-41ae-a96d-d8af9884f85c
 stat:114 [] 14898338 false [begin: 0 end: 0 infiniteEnd: true closed: true 
createdTime: 1512589530002]
2017-12-06 19:55:37,712 [replication.StatusCombiner] TRACE: Returned single 
value: 
~replhdfs://namenode:9000/accumulo/wal/tserver+31761/140223d6-30bd-41ae-a96d-d8af9884f85c
 stat:12l [] 14898372 false [begin: 0 end: 0 infiniteEnd: true closed: false]
{code}


was (Author: adamjshook):
I have attached some logs tracking a particular WAL file.  You can see that it 
has a {{createdTime}} but at some point a deleting entry must be written (note 
the timestamp change but the {{createdTime}} is gone) and then other entries 
added.

And some other interesting messages back-to-back:
{code}
2017-12-06 19:55:37,712 [replication.StatusCombiner] TRACE: Returned single 
value: 
~replhdfs://dev-ob-Cluster/accumulo/wal/dob1-bvlt-r2n05.bloomberg.com+31761/140223d6-30bd-41ae-a96d-d8af9884f85c
 stat:114 [] 14898338 false [begin: 0 end: 0 infiniteEnd: true closed: true 
createdTime: 1512589530002]
2017-12-06 19:55:37,712 [replication.StatusCombiner] TRACE: Returned single 
value: 
~replhdfs://dev-ob-Cluster/accumulo/wal/dob1-bvlt-r2n05.bloomberg.com+31761/140223d6-30bd-41ae-a96d-d8af9884f85c
 stat:12l [] 14898372 false [begin: 0 end: 0 infiniteEnd: true closed: false]
{code}

> Some WALs don't replicate due to lacking a createdTime entry
> ------------------------------------------------------------
>
>                 Key: ACCUMULO-4751
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-4751
>             Project: Accumulo
>          Issue Type: Bug
>    Affects Versions: 1.7.3, 1.8.1
>            Reporter: Adam J Shook
>            Assignee: Adam J Shook
>         Attachments: repl_logs.txt
>
>
> From what I can tell, the below error is thrown when no data for a particular 
> table is written to a WAL, but the file is closed.  This would be because the 
> {{Status}} entry from the {{StatusUtil}} for {{fileClosed}} is pre-built and 
> therefore does not have a {{createdTime}}.  This prevents a WAL from being 
> replicated until a {{createdTime}} entry is added manually.
> From the Accumulo master:
> {code}
> Status record ([begin: 0 end: 0 infiniteEnd: true closed: true]) for 
> hdfs://namenode:9000/accumulo/wal/tserver.example.com+31732/f922df9c-3ffc-49ee-8d0c-261c7a05fea2
>  in table 7l was written to metadata table which lacked createdTime
> {code}
> There are two solutions I have in mind:
> 1. Update the {{StatusUtil}} such that every returned {{Status}} object sets 
> the {{createdTime}} to {{System.currentTimeMillis}} if not explicitly given.
> 2. Update the Accumulo Master to set the {{createdTime}} to the WAL's 
> modification time in HDFS if the WAL is closed but there is no 
> {{createdTime}}.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to