date:20161025

[jira] [Commented] (HBASE-16436) Add the CellChunkMap variant

2016-10-25 Thread ramkrishna.s.vasudevan (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607608#comment-15607608
 ] 

ramkrishna.s.vasudevan commented on HBASE-16436:


That second way of doing things will be much simpler I believe. We can have 
control on the whole segment with one boolean and decide. 
So suppose we have 3 segments and one of it has a bigger cell then I think it 
still makes sense to index the other two segments in the chunk way and leave 
the one with bigger cell to cellArray map way of indexing and things should 
work for scans? I need to verify this part first.

> Add the CellChunkMap variant
> 
>
> Key: HBASE-16436
> URL: https://issues.apache.org/jira/browse/HBASE-16436
> Project: HBase
>  Issue Type: Sub-task
>Reporter: ramkrishna.s.vasudevan
>
> This sub-task is specifically to add the CellChunkMap created by [~anastas] 
> and [~eshcar] with specific tests and integrate it with the in memory 
> flush/compaction flow. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16436) Add the CellChunkMap variant

2016-10-25 Thread Anoop Sam John (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607579#comment-15607579
 ] 

Anoop Sam John commented on HBASE-16436:


Till it reaches the memstore add point, the big cell data is in the BB from the 
Rpc pool. But when we add it to memstore, we will do an on heap copy if we can 
not copy to MSLAB. So we are good here.

Ya because of 1 big cell in 1M cells not allowing us go off heap fully is not 
so good :-(  My feeling is that the latter will be much more simple than 1st 
way.  Ya let us try out boths. [~ram_krish]  U are ok to do this poc?  No hurry.

> Add the CellChunkMap variant
> 
>
> Key: HBASE-16436
> URL: https://issues.apache.org/jira/browse/HBASE-16436
> Project: HBase
>  Issue Type: Sub-task
>Reporter: ramkrishna.s.vasudevan
>
> This sub-task is specifically to add the CellChunkMap created by [~anastas] 
> and [~eshcar] with specific tests and integrate it with the in memory 
> flush/compaction flow. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16438) Create a cell type so that chunk id is embedded in it

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607575#comment-15607575
 ] 

stack commented on HBASE-16438:
---

Thanks Ram. So why we write the chunkid back into the Cell? How would that 
help? Thanks.

> Create a cell type so that chunk id is embedded in it
> -
>
> Key: HBASE-16438
> URL: https://issues.apache.org/jira/browse/HBASE-16438
> Project: HBase
>  Issue Type: Sub-task
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
>
> For CellChunkMap we may need a cell such that the chunk out of which it was 
> created, the id of the chunk be embedded in it so that when doing flattening 
> we can use the chunk id as a meta data. More details will follow once the 
> initial tasks are completed. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16436) Add the CellChunkMap variant

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607562#comment-15607562
 ] 

stack commented on HBASE-16436:
---

[~anoop.hbase] Thanks for the better explanation. Will the too-big Cell be 
occupying a BB from the RPC Pool preventing our releasing it? Or will it have 
already have been allocated on the heap? If the latter good. We just need a 
hash of fake chunkid to this Cell on-heap. 

Ok if the fake chunk id solves two problems.

Yes to both ways. On latter, would be a shame if we couldn't go offheap because 
1 in 10M Cells is bigger than a SLAB.

> Add the CellChunkMap variant
> 
>
> Key: HBASE-16436
> URL: https://issues.apache.org/jira/browse/HBASE-16436
> Project: HBase
>  Issue Type: Sub-task
>Reporter: ramkrishna.s.vasudevan
>
> This sub-task is specifically to add the CellChunkMap created by [~anastas] 
> and [~eshcar] with specific tests and integrate it with the in memory 
> flush/compaction flow. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607532#comment-15607532
 ] 

stack commented on HBASE-16890:
---

Let me try the sloppy way first. Maybe the synchronizes are not the problem 
[~ram_krish]

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: Screen Shot 2016-10-25 at 7.34.47 PM.png, Screen Shot 
> 2016-10-25 at 7.39.07 PM.png, Screen Shot 2016-10-25 at 7.39.48 PM.png, 
> async.svg, classic.svg, contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread ramkrishna.s.vasudevan (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607512#comment-15607512
 ] 

ramkrishna.s.vasudevan commented on HBASE-16890:


[~saint@gmail.com]
Can I check the disruptor way to reduce synchronization overhead? If am not 
able to get to it I shall tell you.
Just telling before you put in your time in that?

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: Screen Shot 2016-10-25 at 7.34.47 PM.png, Screen Shot 
> 2016-10-25 at 7.39.07 PM.png, Screen Shot 2016-10-25 at 7.39.48 PM.png, 
> async.svg, classic.svg, contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16438) Create a cell type so that chunk id is embedded in it

2016-10-25 Thread Anoop Sam John (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607511#comment-15607511
 ] 

Anoop Sam John commented on HBASE-16438:


In CellChunkMap, we will get rid of Cell objects and will keep only some index 
data.  So this index data has to tell us in which byte[]/BB/Chunk the cell data 
resides and in which offset and what is its total length. The latter 2 are 
ints. The 1st one is a pointer. Again we will need to keep it on heap. Where 
what the solution proposes is to keep an id of the chunk where this cell 
resides instead of keeping a java ref to this chunk object.  That is why the 
chunk id is imp now.  Now normal cells will have only ref to chunks. So when an 
in memory flush happens and we make the chunk map, we dont know the index/id of 
this chunk.  That is why we were saying that we can keep the chunk id int 
within the cell object.

> Create a cell type so that chunk id is embedded in it
> -
>
> Key: HBASE-16438
> URL: https://issues.apache.org/jira/browse/HBASE-16438
> Project: HBase
>  Issue Type: Sub-task
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
>
> For CellChunkMap we may need a cell such that the chunk out of which it was 
> created, the id of the chunk be embedded in it so that when doing flattening 
> we can use the chunk id as a meta data. More details will follow once the 
> initial tasks are completed. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607505#comment-15607505
 ] 

stack commented on HBASE-16890:
---

Let me try a [~Apache9] suggestion of a version that is sloppy synchronizing 
and see if we get the expected speedup. I'll be back.

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: Screen Shot 2016-10-25 at 7.34.47 PM.png, Screen Shot 
> 2016-10-25 at 7.39.07 PM.png, Screen Shot 2016-10-25 at 7.39.48 PM.png, 
> async.svg, classic.svg, contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16436) Add the CellChunkMap variant

2016-10-25 Thread Anoop Sam John (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607503#comment-15607503
 ] 

Anoop Sam John commented on HBASE-16436:


You mean to say when the cell size does not allow it to be copied to an MSLAB 
chunk (from pool) let us have the cell be there as such (we will copy the data 
then as till it reaches memstore level, the cells data is backed by the same 
byte[] where the RPCServer copied the req bytes into) and that cell's backing 
buffer we will make as a fake chunk?  We will keep it temp with a temp chunk id 
and once the life of the cell is over, flushed, we will throw away that chunk?  
Hmm seems some thing we can try out.
MSLAB pool default ON am not sure whether that jira got committed already. Ya 
we can do.
There is one more issue.  [~ram_krish] reminded me some days back. When a cell 
is upserted (append/increment op), we will not copy that cell to MSLAB.  So in 
such a case also we may need to go with this way of temp chunks with temp ids.

Another idea I can think of is as follows
Let us keep the info that whether a Segment contains a cell which is not copied 
to MSLAB.  A boolean like 'tagsPresent'.  When we do in memory flush we will do 
CellArrayMap if any of the  cell is NOT in MSLAB.  If all in MSLAB, we will do 
chunk map.  Also when an in memory merge or data compaction happens, then also 
similar way.  When any of the segment undergoing merge is having a cell not in 
MSLAB, lets go with cell array map instead of cell chunkmap. May be we need 
some logic there where we can avoid segments with cells not in MSLAB from 
getting merged with segmnets whose all cells in MSALB only.  Chance is that 
when a segment is having cells not in MSLAB, other segments also will follow 
same pattern (99% I can say)..  Need evaluation of both ways anyway.  Just some 
thoughts

> Add the CellChunkMap variant
> 
>
> Key: HBASE-16436
> URL: https://issues.apache.org/jira/browse/HBASE-16436
> Project: HBase
>  Issue Type: Sub-task
>Reporter: ramkrishna.s.vasudevan
>
> This sub-task is specifically to add the CellChunkMap created by [~anastas] 
> and [~eshcar] with specific tests and integrate it with the in memory 
> flush/compaction flow. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16438) Create a cell type so that chunk id is embedded in it

2016-10-25 Thread ramkrishna.s.vasudevan (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607499#comment-15607499
 ] 

ramkrishna.s.vasudevan commented on HBASE-16438:


Thanks for your comments. The case I think of is this.
We do in memory flushes now. So every in memory flush will create a segment.
The cells are located in this segment. When MSLAB is ON and chunk pool is ON we 
create a chunk from the pool and that is where the cells are added. 
Now this segment is of type CSLM each entry pointing to the chunk created from 
the chunk pool. 
When I try to create an index out of this I don't add the chunk where the cell 
is added. Instead I just add the chunkid (assume when we ask for a chunk from 
the pool it gives out an id) and the cell's offset and length.
Why chunk id is important is because again say there is one more in memory 
flush and we create one more segment and the old one is pushed to the pipeline. 
So we convert this segment from CSLM to chunk based. Now in my index array 
since I have another segment added I  need to clearly know for which segment 
was this chunk created because the cell's offset and length could be same and 
there is no unique way to distinguish.
In the CellArrayMap impl there is no such need because just that they create an 
array of cells and every index in the array points to the cell referenced 
inside the chunks. We don't need any chunkid there. Because it points to actual 
cells. But in CellChunkIndex we don't point to actual cells.
Let me know if I am writing it clearly if not can rephrase again. Thanks.

> Create a cell type so that chunk id is embedded in it
> -
>
> Key: HBASE-16438
> URL: https://issues.apache.org/jira/browse/HBASE-16438
> Project: HBase
>  Issue Type: Sub-task
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
>
> For CellChunkMap we may need a cell such that the chunk out of which it was 
> created, the id of the chunk be embedded in it so that when doing flattening 
> we can use the chunk id as a meta data. More details will follow once the 
> initial tasks are completed. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16932) Implement small scan

2016-10-25 Thread Duo Zhang (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Duo Zhang updated HBASE-16932:
--
Attachment: HBASE-16932-v1.patch

Missed the new UT.

> Implement small scan
> 
>
> Key: HBASE-16932
> URL: https://issues.apache.org/jira/browse/HBASE-16932
> Project: HBase
>  Issue Type: Sub-task
>  Components: Client
>Affects Versions: 2.0.0
>Reporter: Duo Zhang
>Assignee: Duo Zhang
> Fix For: 2.0.0
>
> Attachments: HBASE-16932-v1.patch, HBASE-16932.patch
>
>
> As said in HBASE-16838, we should have a scan method that returns a 
> CompletaFuture for the whole result. It is suitable for a small scan.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16783) Use ByteBufferPool for the header and message during Rpc response

2016-10-25 Thread ramkrishna.s.vasudevan (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ramkrishna.s.vasudevan updated HBASE-16783:
---
Attachment: HBASE-16783_7.patch

REtry QA.

> Use ByteBufferPool for the header and message during Rpc response
> -
>
> Key: HBASE-16783
> URL: https://issues.apache.org/jira/browse/HBASE-16783
> Project: HBase
>  Issue Type: Improvement
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
>Priority: Minor
> Attachments: HBASE-16783.patch, HBASE-16783_1.patch, 
> HBASE-16783_2.patch, HBASE-16783_3.patch, HBASE-16783_4.patch, 
> HBASE-16783_5.patch, HBASE-16783_6.patch, HBASE-16783_7.patch, 
> HBASE-16783_7.patch
>
>
> With ByteBufferPool in place we could avoid the byte[] creation in 
> RpcServer#createHeaderAndMessageBytes and try using the Buffer from the pool 
> rather than creating byte[] every time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16783) Use ByteBufferPool for the header and message during Rpc response

2016-10-25 Thread ramkrishna.s.vasudevan (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ramkrishna.s.vasudevan updated HBASE-16783:
---
Status: Patch Available  (was: Open)

> Use ByteBufferPool for the header and message during Rpc response
> -
>
> Key: HBASE-16783
> URL: https://issues.apache.org/jira/browse/HBASE-16783
> Project: HBase
>  Issue Type: Improvement
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
>Priority: Minor
> Attachments: HBASE-16783.patch, HBASE-16783_1.patch, 
> HBASE-16783_2.patch, HBASE-16783_3.patch, HBASE-16783_4.patch, 
> HBASE-16783_5.patch, HBASE-16783_6.patch, HBASE-16783_7.patch, 
> HBASE-16783_7.patch
>
>
> With ByteBufferPool in place we could avoid the byte[] creation in 
> RpcServer#createHeaderAndMessageBytes and try using the Buffer from the pool 
> rather than creating byte[] every time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16783) Use ByteBufferPool for the header and message during Rpc response

2016-10-25 Thread ramkrishna.s.vasudevan (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ramkrishna.s.vasudevan updated HBASE-16783:
---
Status: Open  (was: Patch Available)

> Use ByteBufferPool for the header and message during Rpc response
> -
>
> Key: HBASE-16783
> URL: https://issues.apache.org/jira/browse/HBASE-16783
> Project: HBase
>  Issue Type: Improvement
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
>Priority: Minor
> Attachments: HBASE-16783.patch, HBASE-16783_1.patch, 
> HBASE-16783_2.patch, HBASE-16783_3.patch, HBASE-16783_4.patch, 
> HBASE-16783_5.patch, HBASE-16783_6.patch, HBASE-16783_7.patch, 
> HBASE-16783_7.patch
>
>
> With ByteBufferPool in place we could avoid the byte[] creation in 
> RpcServer#createHeaderAndMessageBytes and try using the Buffer from the pool 
> rather than creating byte[] every time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16932) Implement small scan

2016-10-25 Thread Duo Zhang (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Duo Zhang updated HBASE-16932:
--
Attachment: (was: HBASE-16932-v1.patch)

> Implement small scan
> 
>
> Key: HBASE-16932
> URL: https://issues.apache.org/jira/browse/HBASE-16932
> Project: HBase
>  Issue Type: Sub-task
>  Components: Client
>Affects Versions: 2.0.0
>Reporter: Duo Zhang
>Assignee: Duo Zhang
> Fix For: 2.0.0
>
> Attachments: HBASE-16932.patch
>
>
> As said in HBASE-16838, we should have a scan method that returns a 
> CompletaFuture for the whole result. It is suitable for a small scan.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread ramkrishna.s.vasudevan (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607473#comment-15607473
 ] 

ramkrishna.s.vasudevan commented on HBASE-16890:


bq.The disruptor consumer thread would feed the eventloop? Could. Another 
thought was just collect the appends and not assign the trans id until we are 
going to sync since that is when we add the edit to the WAL and given that the 
write path is different in master – we don't update memstore until after sync 
has come home – then this could work.
For doing the second one then it means we can't create a FSWALEntry till we do 
the sync and we need to carry the edits upto the sync part. 
The first one I think is doable. As I said since we don't go with disruptor we 
need a sequence generator and since 'waitingConsumePayloads' acts as the 
critical section for the consumer - we need to do a sync for getting unique 
txids. Now may be we can add a disruptor and onEvent of the disruptor we can 
hand over to the eventLoop?
But there is context switch here and one thread handing over to the other. I 
can work on these two. 
So we can first check the disruptor way or the append way?
Thanks Stack for your tests. Even myy JFR report had the contention issue but I 
felt that sync and CRC update was *also* taking more time than the FSHLog 
(classic) case. 

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: Screen Shot 2016-10-25 at 7.34.47 PM.png, Screen Shot 
> 2016-10-25 at 7.39.07 PM.png, Screen Shot 2016-10-25 at 7.39.48 PM.png, 
> async.svg, classic.svg, contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16932) Implement small scan

2016-10-25 Thread Duo Zhang (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Duo Zhang updated HBASE-16932:
--
Attachment: HBASE-16932-v1.patch

Fix the javadoc and findbugs warnings.

> Implement small scan
> 
>
> Key: HBASE-16932
> URL: https://issues.apache.org/jira/browse/HBASE-16932
> Project: HBase
>  Issue Type: Sub-task
>  Components: Client
>Affects Versions: 2.0.0
>Reporter: Duo Zhang
>Assignee: Duo Zhang
> Fix For: 2.0.0
>
> Attachments: HBASE-16932-v1.patch, HBASE-16932.patch
>
>
> As said in HBASE-16838, we should have a scan method that returns a 
> CompletaFuture for the whole result. It is suitable for a small scan.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (HBASE-13272) Get.setClosestRowBefore() breaks specific column Get

2016-10-25 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-13272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-13272.
---
Resolution: Won't Fix

getClosestRowOrBefore got removed by HBASE-13954 after @deprecation cycle.

> Get.setClosestRowBefore() breaks specific column Get
> 
>
> Key: HBASE-13272
> URL: https://issues.apache.org/jira/browse/HBASE-13272
> Project: HBase
>  Issue Type: Bug
>Reporter: stack
>Priority: Trivial
>
> Via [~larsgeorge]
> Get.setClosestRowBefore() is breaking a specific Get that specifies a column. 
> If you set the latter to "true" it will return the _entire_ row!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16436) Add the CellChunkMap variant

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607391#comment-15607391
 ] 

stack commented on HBASE-16436:
---

Hard question by [~anoop.hbase] ported from HBASE-16421

bq. The CellChunkMap variant needs cell data to be available in MSLAB chunks. 
So when MSLAB is OFF, we will not be using this. What abt the size restriction? 
In MSLAB there is a max size for Cells and above which cells will not be 
cloned. We might nor be able to handle such a case. Thoughts?

MSLAB is on always in master branch, right? If not, lets make it so by fiat in 
2.0.

On size restrtiction, current default for mslab is 2MB. Seems small. For 
smallest possible Cell (32bytes?), thats 1M Cells. Not that much, especially 
when we are not looking them up via CSLS but via a nice flat index.  Go to 8MB 
slabs?  16M?

If someone wants to write a 64M Cell into HBase, should we even let them? It is 
going to mess us (GC, etc.) up. We have a bound on max size currently. 
Reexamine?

What is the lifecycle of a SLAB? Can we set aside negative chunkids to mean 
onheap and these chunks would have one object in them only? (Might have to have 
a globally accessible means of mapping negative chunkid to an object).

> Add the CellChunkMap variant
> 
>
> Key: HBASE-16436
> URL: https://issues.apache.org/jira/browse/HBASE-16436
> Project: HBase
>  Issue Type: Sub-task
>Reporter: ramkrishna.s.vasudevan
>
> This sub-task is specifically to add the CellChunkMap created by [~anastas] 
> and [~eshcar] with specific tests and integrate it with the in memory 
> flush/compaction flow. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16438) Create a cell type so that chunk id is embedded in it

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607377#comment-15607377
 ] 

stack commented on HBASE-16438:
---

I don't see where you say above why we need to write into the cell the chunkid 
it is associate with? Why do we have to? The index will have the chunk to find 
a Cell in. The Cell will not move, not unless we do a data merge. Then, a new 
index will be made to go with the new merged chunk. This index will point to 
the new location. At what point do we ever have to write the chunk id into the 
Cell? Thanks Ram.

> Create a cell type so that chunk id is embedded in it
> -
>
> Key: HBASE-16438
> URL: https://issues.apache.org/jira/browse/HBASE-16438
> Project: HBase
>  Issue Type: Sub-task
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
>
> For CellChunkMap we may need a cell such that the chunk out of which it was 
> created, the id of the chunk be embedded in it so that when doing flattening 
> we can use the chunk id as a meta data. More details will follow once the 
> initial tasks are completed. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16944) Fix findbugs warning in hbase-server HRegion UL_UNRELEASED_LOCK

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607352#comment-15607352
 ] 

Hudson commented on HBASE-16944:


SUCCESS: Integrated in Jenkins build HBase-Trunk_matrix #1855 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/1855/])
HBASE-16944 Fix findbugs warning in hbase-server HRegion (stack: rev 
bbe88d942ec4ac673cdaf266d4bf7e15d2b334d8)
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java


> Fix findbugs warning in hbase-server HRegion UL_UNRELEASED_LOCK
> ---
>
> Key: HBASE-16944
> URL: https://issues.apache.org/jira/browse/HBASE-16944
> Project: HBase
>  Issue Type: Bug
>  Components: findbugs
>Reporter: stack
>
> Fix the annoying long-lived UL_UNRELEASED_LOCK warning out of line 3251 in 
> HRegion.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16421) Introducing the CellChunkMap as a new additional index variant in the MemStore

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607350#comment-15607350
 ] 

stack commented on HBASE-16421:
---

Copied this, the question we need to answer, over to HBASE-16436

> Introducing the CellChunkMap as a new additional index variant in the MemStore
> --
>
> Key: HBASE-16421
> URL: https://issues.apache.org/jira/browse/HBASE-16421
> Project: HBase
>  Issue Type: Umbrella
>Reporter: Anastasia Braginsky
>
> Follow up for HBASE-14921. This is going to be the umbrella JIRA to include 
> all the parts of integration of the CellChunkMap to the MemStore.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16000) Table#checkAndPut() docs are too vague

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16000?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607348#comment-15607348
 ] 

Hudson commented on HBASE-16000:


SUCCESS: Integrated in Jenkins build HBase-Trunk_matrix #1855 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/1855/])
HBASE-16000 Table#checkAndPut() docs are too vague (Csaba Skrabak) This (stack: 
rev 8e2d410195b1797e2d7e647db431e4fde98aeaba)
* (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/client/Table.java


> Table#checkAndPut() docs are too vague
> --
>
> Key: HBASE-16000
> URL: https://issues.apache.org/jira/browse/HBASE-16000
> Project: HBase
>  Issue Type: Bug
>  Components: documentation
>Reporter: Nick Dimiduk
>Assignee: Csaba Skrabak
>Priority: Minor
>  Labels: beginner, beginners
> Fix For: 2.0.0
>
> Attachments: hbase-16000.patch
>
>
> The docs for our CAS operations that accept a {{CompareOp}} are too vague. 
> They're copied from the part that implies {{CompareOp.EQUAL}}, which means 
> they don't tell you how your op, the provided {{value}}, and any present 
> value are compared.
> Only by digging into HRegion.java can I see that the operation is, for 
> example with CompareOp.GREATER, we get {{value.compareTo(existingKeyValue) > 
> 0}}. Javadocs should be explicit about the order of operands.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-1600) Multiple HBase clients to multiple indpendent HBase clusters from the same jvm should be allowed

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607351#comment-15607351
 ] 

Hudson commented on HBASE-1600:
---

SUCCESS: Integrated in Jenkins build HBase-Trunk_matrix #1855 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/1855/])
Revert "HBASE-1600 Table#checkAndPut() docs are too vague (Csaba (stack: rev 
9c6d123b224f63d8456680dcc5c701d04954634e)
* (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/client/Table.java
HBASE-16000 Table#checkAndPut() docs are too vague (Csaba Skrabak) This (stack: 
rev 8e2d410195b1797e2d7e647db431e4fde98aeaba)
* (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/client/Table.java


> Multiple HBase clients to multiple indpendent HBase clusters from the same 
> jvm should be allowed
> 
>
> Key: HBASE-1600
> URL: https://issues.apache.org/jira/browse/HBASE-1600
> Project: HBase
>  Issue Type: Bug
>  Components: Client
>Affects Versions: 0.20.0
>Reporter: Chris K Wensel
>Assignee: Nitay Joffe
>Priority: Blocker
>
> ZK quorum servers list is static in ZooKeeperWrapper and is coupled to 
> zoo.cfg. 
> This prevents multiple clients from connecting to multiple distinct HBase 
> clusters and requires zoo.cfg to exist on a cluster that needs to connect to 
> a remote HBase cluster.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16932) Implement small scan

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607322#comment-15607322
 ] 

stack commented on HBASE-16932:
---

[~Apache9] This just went in and should fix the old findbugs issue:

commit bbe88d942ec4ac673cdaf266d4bf7e15d2b334d8
Author: Michael Stack 
Date:   Tue Oct 25 14:44:11 2016 -0700

HBASE-16944 Fix findbugs warning in hbase-server HRegion UL_UNRELEASED_LOCK

> Implement small scan
> 
>
> Key: HBASE-16932
> URL: https://issues.apache.org/jira/browse/HBASE-16932
> Project: HBase
>  Issue Type: Sub-task
>  Components: Client
>Affects Versions: 2.0.0
>Reporter: Duo Zhang
>Assignee: Duo Zhang
> Fix For: 2.0.0
>
> Attachments: HBASE-16932.patch
>
>
> As said in HBASE-16838, we should have a scan method that returns a 
> CompletaFuture for the whole result. It is suitable for a small scan.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16944) Fix findbugs warning in hbase-server HRegion UL_UNRELEASED_LOCK

2016-10-25 Thread Yu Li (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607281#comment-15607281
 ] 

Yu Li commented on HBASE-16944:
---

Thanks for fixing this sir [~stack]

> Fix findbugs warning in hbase-server HRegion UL_UNRELEASED_LOCK
> ---
>
> Key: HBASE-16944
> URL: https://issues.apache.org/jira/browse/HBASE-16944
> Project: HBase
>  Issue Type: Bug
>  Components: findbugs
>Reporter: stack
>
> Fix the annoying long-lived UL_UNRELEASED_LOCK warning out of line 3251 in 
> HRegion.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16931) Setting cell's seqId to zero in compaction flow might cause RS down.

2016-10-25 Thread Yu Li (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607278#comment-15607278
 ] 

Yu Li commented on HBASE-16931:
---

Skimmed log of failed post-commit check for trunk/branch-1/branch-1.1 and 
confirmed all caused by known fake case or timed out, no regression caused by 
change here, FWIW.

> Setting cell's seqId to zero in compaction flow might cause RS down.
> 
>
> Key: HBASE-16931
> URL: https://issues.apache.org/jira/browse/HBASE-16931
> Project: HBase
>  Issue Type: Bug
>  Components: regionserver
>Affects Versions: 2.0.0
>Reporter: binlijin
>Assignee: binlijin
>Priority: Critical
> Fix For: 2.0.0, 1.4.0, 1.2.4, 1.1.8
>
> Attachments: HBASE-16931-master.patch, HBASE-16931.branch-1.patch, 
> HBASE-16931.branch-1.v2.patch, HBASE-16931_master_v2.patch, 
> HBASE-16931_master_v3.patch, HBASE-16931_master_v4.patch, 
> HBASE-16931_master_v5.patch
>
>
> Compactor#performCompaction
>   do {
> hasMore = scanner.next(cells, scannerContext);
> // output to writer:
> for (Cell c : cells) {
>   if (cleanSeqId && c.getSequenceId() <= smallestReadPoint) {
> CellUtil.setSequenceId(c, 0);
>   }
>   writer.append(c);
> }
> cells.clear();
>   } while (hasMore);
> scanner.next will choose at most "hbase.hstore.compaction.kv.max" kvs, the 
> last cell still reference by StoreScanner.prevCell, so if cleanSeqId is 
> called when the scanner.next call StoreScanner.checkScanOrder may throw 
> exception and cause regionserver down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16931) Setting cell's seqId to zero in compaction flow might cause RS down.

2016-10-25 Thread Yu Li (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607269#comment-15607269
 ] 

Yu Li commented on HBASE-16931:
---

bq. The choice is between this going into 1.3.0 or waiting for 1.3.1, right?
Yes sir. [~busbey]

> Setting cell's seqId to zero in compaction flow might cause RS down.
> 
>
> Key: HBASE-16931
> URL: https://issues.apache.org/jira/browse/HBASE-16931
> Project: HBase
>  Issue Type: Bug
>  Components: regionserver
>Affects Versions: 2.0.0
>Reporter: binlijin
>Assignee: binlijin
>Priority: Critical
> Fix For: 2.0.0, 1.4.0, 1.2.4, 1.1.8
>
> Attachments: HBASE-16931-master.patch, HBASE-16931.branch-1.patch, 
> HBASE-16931.branch-1.v2.patch, HBASE-16931_master_v2.patch, 
> HBASE-16931_master_v3.patch, HBASE-16931_master_v4.patch, 
> HBASE-16931_master_v5.patch
>
>
> Compactor#performCompaction
>   do {
> hasMore = scanner.next(cells, scannerContext);
> // output to writer:
> for (Cell c : cells) {
>   if (cleanSeqId && c.getSequenceId() <= smallestReadPoint) {
> CellUtil.setSequenceId(c, 0);
>   }
>   writer.append(c);
> }
> cells.clear();
>   } while (hasMore);
> scanner.next will choose at most "hbase.hstore.compaction.kv.max" kvs, the 
> last cell still reference by StoreScanner.prevCell, so if cleanSeqId is 
> called when the scanner.next call StoreScanner.checkScanOrder may throw 
> exception and cause regionserver down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16932) Implement small scan

2016-10-25 Thread Duo Zhang (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607262#comment-15607262
 ] 

Duo Zhang commented on HBASE-16932:
---

Let me fix the findbugs and javadoc warnings and try precommit again.

> Implement small scan
> 
>
> Key: HBASE-16932
> URL: https://issues.apache.org/jira/browse/HBASE-16932
> Project: HBase
>  Issue Type: Sub-task
>  Components: Client
>Affects Versions: 2.0.0
>Reporter: Duo Zhang
>Assignee: Duo Zhang
> Fix For: 2.0.0
>
> Attachments: HBASE-16932.patch
>
>
> As said in HBASE-16838, we should have a scan method that returns a 
> CompletaFuture for the whole result. It is suitable for a small scan.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16932) Implement small scan

2016-10-25 Thread Yu Li (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607256#comment-15607256
 ] 

Yu Li commented on HBASE-16932:
---

+1, makes sense to have a new "smallScan" API for async call, and current API 
lgtm. Thanks.

> Implement small scan
> 
>
> Key: HBASE-16932
> URL: https://issues.apache.org/jira/browse/HBASE-16932
> Project: HBase
>  Issue Type: Sub-task
>  Components: Client
>Affects Versions: 2.0.0
>Reporter: Duo Zhang
>Assignee: Duo Zhang
> Fix For: 2.0.0
>
> Attachments: HBASE-16932.patch
>
>
> As said in HBASE-16838, we should have a scan method that returns a 
> CompletaFuture for the whole result. It is suitable for a small scan.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread Duo Zhang (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607210#comment-15607210
 ] 

Duo Zhang commented on HBASE-16890:
---

I think consume method does the main work here so it is reasonable that it 
spent lots of cpu time. And the contention, maybe I could use the RingBuffer of 
Disruptor to reduce the time spent in critical section. Will try later.

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: Screen Shot 2016-10-25 at 7.34.47 PM.png, Screen Shot 
> 2016-10-25 at 7.39.07 PM.png, Screen Shot 2016-10-25 at 7.39.48 PM.png, 
> async.svg, classic.svg, contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16886) hbase-client: scanner with reversed=true and small=true gets no result

2016-10-25 Thread Guanghao Zhang (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Guanghao Zhang updated HBASE-16886:
---
Affects Version/s: 1.4.0
   1.3.0
   2.0.0
   1.2.3
   1.1.7
   0.98.23

> hbase-client: scanner with reversed=true and small=true gets no result
> --
>
> Key: HBASE-16886
> URL: https://issues.apache.org/jira/browse/HBASE-16886
> Project: HBase
>  Issue Type: Bug
>Affects Versions: 2.0.0, 1.3.0, 1.4.0, 1.2.3, 1.1.7, 0.98.23
>Reporter: huzheng
>  Labels: patch
> Attachments: HBASE-16886.v0.patch, HBASE-16886.v1.patch, 
> HBASE-16886.v2.patch, HBASE-16886.v3.patch, TestReversedSmallScan.java
>
>
> Assume HBase have four regions (-oo, b), [b, c), [c, d), [d,+oo) , and all 
> rowKeys are located in region [d, +oo).  using a Reversed Small Scanner will 
> get no result.
> Attached file show this failed test case.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607175#comment-15607175
 ] 

stack commented on HBASE-16890:
---

I can spend some time on this if you want. Just say.

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: Screen Shot 2016-10-25 at 7.34.47 PM.png, Screen Shot 
> 2016-10-25 at 7.39.07 PM.png, Screen Shot 2016-10-25 at 7.39.48 PM.png, 
> async.svg, classic.svg, contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-16890:
--
Attachment: Screen Shot 2016-10-25 at 7.39.48 PM.png
Screen Shot 2016-10-25 at 7.39.07 PM.png
Screen Shot 2016-10-25 at 7.34.47 PM.png
classic.svg
async.svg

Here are some flame graphs of classic vs async. We are in consume a good while 
of the cpu spent.

Screenshot shows call tree in JFR for same async.svg period.

These are time spent when sampling. Hard to get a good handle on 
synchronize/lock friction. One view is if an item shows up frequently in the 
thread contention view. Again we are sampling so if it shows here my thinking 
is that it must be a pain point (JFR does the unsafe safe point sampling so its 
view is a bit closer to what is going on). I include two images from same jfr 
compares of the contention windows for async and classic.

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: Screen Shot 2016-10-25 at 7.34.47 PM.png, Screen Shot 
> 2016-10-25 at 7.39.07 PM.png, Screen Shot 2016-10-25 at 7.39.48 PM.png, 
> async.svg, classic.svg, contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-15748) Don't link in static libunwind.

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-15748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607120#comment-15607120
 ] 

Hudson commented on HBASE-15748:


SUCCESS: Integrated in Jenkins build HBase-1.2-JDK7 #56 (See 
[https://builds.apache.org/job/HBase-1.2-JDK7/56/])
HBASE-15748 update CHANGES.txt for additional changes in RC1. (busbey: rev 
67592f3d062743907f8c5ae00dbbe1ae4f69e5af)
* (edit) CHANGES.txt


> Don't link in static libunwind.
> ---
>
> Key: HBASE-15748
> URL: https://issues.apache.org/jira/browse/HBASE-15748
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Elliott Clark
>Assignee: Elliott Clark
> Attachments: HBASE-15748.patch
>
>
> A static libunwind compiled with gcc prevents clang from catching exceptions. 
> So just add the dynamic one. :-/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16939) ExportSnapshot: set owner and permission on right directory

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607118#comment-15607118
 ] 

Hudson commented on HBASE-16939:


SUCCESS: Integrated in Jenkins build HBase-1.2-JDK7 #56 (See 
[https://builds.apache.org/job/HBase-1.2-JDK7/56/])
HBASE-16939 ExportSnapshot: set owner and permission on right directory 
(matteo.bertozzi: rev 6542d651bf4c612f9e1e7b328dbea07eca1a5ae0)
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java


> ExportSnapshot: set owner and permission on right directory
> ---
>
> Key: HBASE-16939
> URL: https://issues.apache.org/jira/browse/HBASE-16939
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Fix For: 2.0.0, 1.4.0, 1.3.1, 1.2.4, 1.1.8
>
> Attachments: HBASE-16939-v1.patch, HBASE-16939.patch
>
>
> {code}
> FileUtil.copy(inputFs, snapshotDir, outputFs, initialOutputSnapshotDir, 
> false, false, conf);
>   if (filesUser != null || filesGroup != null) {
> setOwner(outputFs, snapshotTmpDir, filesUser, filesGroup, true);
>   }
>   if (filesMode > 0) {
> setPermission(outputFs, snapshotTmpDir, (short)filesMode, true);
>   }
> {code}
> It copy snapshot manifest to initialOutputSnapshotDir, but it set owner on 
> snapshotTmpDir. They are different directory when skipTmp is true.
> Another problem is new cluster doesn't have .hbase-snapshot directory. So 
> after export snapshot, it should set owner on .hbase-snapshot directory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16112) TestFlushSnapshotFromClient and TestMobFlushSnapshotFromClient are flaky

2016-10-25 Thread Jingcheng Du (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607100#comment-15607100
 ] 

Jingcheng Du commented on HBASE-16112:
--

Thanks a lot [~tedyu].
HBASE-16841 can fix issues in mob snapshot (I guess part of the failures in 
TestMobFlushSnapshotFromClient should be related with this). But I am not sure 
if the failures in TestFlushSnapshotFromClient is caused by this. Let's see if 
there are still flaky failures after HBASE-16841 is committed.

> TestFlushSnapshotFromClient and TestMobFlushSnapshotFromClient are flaky
> 
>
> Key: HBASE-16112
> URL: https://issues.apache.org/jira/browse/HBASE-16112
> Project: HBase
>  Issue Type: Test
>Reporter: Ted Yu
>Assignee: Jingcheng Du
>Priority: Minor
> Attachments: TestFlushSnapshotFromClient-2353.out, 
> TestMobFlushSnapshotFromClient-2353.out, testSnapshotStateAfterMerge-2353.out
>
>
> On recent QA runs, TestMobFlushSnapshotFromClient failed twice.
> https://builds.apache.org/job/PreCommit-HBASE-Build/2360/#showFailuresLink
> https://builds.apache.org/job/PreCommit-HBASE-Build/2353/artifact/patchprocess/patch-unit-hbase-server.txt
> {code}
> org.apache.hadoop.hbase.snapshot.TestMobFlushSnapshotFromClient.org.apache.hadoop.hbase.snapshot.TestMobFlushSnapshotFromClient
>   Run 1: 
> TestMobFlushSnapshotFromClient>TestFlushSnapshotFromClient.tearDown:123->Object.wait:-2
>  Â» TestTimedOut
>   Run 2: 
> TestMobFlushSnapshotFromClient.org.apache.hadoop.hbase.snapshot.TestMobFlushSnapshotFromClient
>  Â» 
> org.apache.hadoop.hbase.snapshot.TestMobFlushSnapshotFromClient.testSnapshotStateAfterMerge(org.apache.hadoop.hbase.snapshot.TestMobFlushSnapshotFromClient)
>   Run 1: 
> TestMobFlushSnapshotFromClient>TestFlushSnapshotFromClient.testSnapshotStateAfterMerge:335->TestFlushSnapshotFromClient.waitRegionsAfterMerge:526
>  Â» RetriesExhausted
>   Run 2: 
> TestMobFlushSnapshotFromClient>TestFlushSnapshotFromClient.tearDown:123 Â» 
> InterruptedIO
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16939) ExportSnapshot: set owner and permission on right directory

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607101#comment-15607101
 ] 

Hudson commented on HBASE-16939:


FAILURE: Integrated in Jenkins build HBase-1.4 #496 (See 
[https://builds.apache.org/job/HBase-1.4/496/])
HBASE-16939 ExportSnapshot: set owner and permission on right directory 
(matteo.bertozzi: rev 4edd8a63d2afa57a1c7029b2d6badeede8855097)
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java


> ExportSnapshot: set owner and permission on right directory
> ---
>
> Key: HBASE-16939
> URL: https://issues.apache.org/jira/browse/HBASE-16939
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Fix For: 2.0.0, 1.4.0, 1.3.1, 1.2.4, 1.1.8
>
> Attachments: HBASE-16939-v1.patch, HBASE-16939.patch
>
>
> {code}
> FileUtil.copy(inputFs, snapshotDir, outputFs, initialOutputSnapshotDir, 
> false, false, conf);
>   if (filesUser != null || filesGroup != null) {
> setOwner(outputFs, snapshotTmpDir, filesUser, filesGroup, true);
>   }
>   if (filesMode > 0) {
> setPermission(outputFs, snapshotTmpDir, (short)filesMode, true);
>   }
> {code}
> It copy snapshot manifest to initialOutputSnapshotDir, but it set owner on 
> snapshotTmpDir. They are different directory when skipTmp is true.
> Another problem is new cluster doesn't have .hbase-snapshot directory. So 
> after export snapshot, it should set owner on .hbase-snapshot directory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16841) Data loss in MOB files after cloning a snapshot and deleting that snapshot

2016-10-25 Thread Jingcheng Du (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607082#comment-15607082
 ] 

Jingcheng Du commented on HBASE-16841:
--

Thanks a lot [~stack]. I'll wait for his response.

> Data loss in MOB files after cloning a snapshot and deleting that snapshot
> --
>
> Key: HBASE-16841
> URL: https://issues.apache.org/jira/browse/HBASE-16841
> Project: HBase
>  Issue Type: Bug
>  Components: mob, snapshots
>Reporter: Jingcheng Du
>Assignee: Jingcheng Du
> Attachments: HBASE-16841-V2.patch, HBASE-16841-V3.patch, 
> HBASE-16841.patch
>
>
> Running the following steps will probably lose MOB data when working with 
> snapshots.
> 1. Create a mob-enabled table by running create 't1', {NAME => 'f1', IS_MOB 
> => true, MOB_THRESHOLD => 0}.
> 2. Put millions of data.
> 3. Run {{snapshot 't1','t1_snapshot'}} to take a snapshot for this table t1.
> 4. Run {{clone_snapshot 't1_snapshot','t1_cloned'}} to clone this snapshot.
> 5. Run {{delete_snapshot 't1_snapshot'}} to delete this snapshot.
> 6. Run {{disable 't1'}} and {{delete 't1'}} to delete the table.
> 7. Now go to the archive directory of t1, the number of .link directories is 
> different from the number of hfiles which means some data will be lost after 
> the hfile cleaner runs.
> This is because, when taking a snapshot on a enabled mob table, each region 
> flushes itself and takes a snapshot, and the mob snapshot is taken only if 
> the current region is first region of the table. At that time, the flushing 
> of some regions might not be finished, and some mob files are not flushed to 
> disk yet. Eventually some mob files are not recorded in the snapshot manifest.
> To solve this, we need to take the mob snapshot at last after the snapshots 
> on all the online and offline regions are finished in 
> {{EnabledTableSnapshotHandler}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread Duo Zhang (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607074#comment-15607074
 ] 

Duo Zhang commented on HBASE-16890:
---

And could JFR print out the time spent in the critical section for each thread?

I wonder the problem is the consumer thread does too much work under the 
protection of waitingConsumePayloads.

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-15748) Don't link in static libunwind.

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-15748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607021#comment-15607021
 ] 

Hudson commented on HBASE-15748:


SUCCESS: Integrated in Jenkins build HBase-1.2-JDK8 #51 (See 
[https://builds.apache.org/job/HBase-1.2-JDK8/51/])
HBASE-15748 update CHANGES.txt for additional changes in RC1. (busbey: rev 
67592f3d062743907f8c5ae00dbbe1ae4f69e5af)
* (edit) CHANGES.txt


> Don't link in static libunwind.
> ---
>
> Key: HBASE-15748
> URL: https://issues.apache.org/jira/browse/HBASE-15748
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Elliott Clark
>Assignee: Elliott Clark
> Attachments: HBASE-15748.patch
>
>
> A static libunwind compiled with gcc prevents clang from catching exceptions. 
> So just add the dynamic one. :-/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16939) ExportSnapshot: set owner and permission on right directory

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607019#comment-15607019
 ] 

Hudson commented on HBASE-16939:


SUCCESS: Integrated in Jenkins build HBase-1.2-JDK8 #51 (See 
[https://builds.apache.org/job/HBase-1.2-JDK8/51/])
HBASE-16939 ExportSnapshot: set owner and permission on right directory 
(matteo.bertozzi: rev 6542d651bf4c612f9e1e7b328dbea07eca1a5ae0)
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java


> ExportSnapshot: set owner and permission on right directory
> ---
>
> Key: HBASE-16939
> URL: https://issues.apache.org/jira/browse/HBASE-16939
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Fix For: 2.0.0, 1.4.0, 1.3.1, 1.2.4, 1.1.8
>
> Attachments: HBASE-16939-v1.patch, HBASE-16939.patch
>
>
> {code}
> FileUtil.copy(inputFs, snapshotDir, outputFs, initialOutputSnapshotDir, 
> false, false, conf);
>   if (filesUser != null || filesGroup != null) {
> setOwner(outputFs, snapshotTmpDir, filesUser, filesGroup, true);
>   }
>   if (filesMode > 0) {
> setPermission(outputFs, snapshotTmpDir, (short)filesMode, true);
>   }
> {code}
> It copy snapshot manifest to initialOutputSnapshotDir, but it set owner on 
> snapshotTmpDir. They are different directory when skipTmp is true.
> Another problem is new cluster doesn't have .hbase-snapshot directory. So 
> after export snapshot, it should set owner on .hbase-snapshot directory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16939) ExportSnapshot: set owner and permission on right directory

2016-10-25 Thread Guanghao Zhang (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15607013#comment-15607013
 ] 

Guanghao Zhang commented on HBASE-16939:


Thanks for your help. [~mbertozzi]

> ExportSnapshot: set owner and permission on right directory
> ---
>
> Key: HBASE-16939
> URL: https://issues.apache.org/jira/browse/HBASE-16939
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Fix For: 2.0.0, 1.4.0, 1.3.1, 1.2.4, 1.1.8
>
> Attachments: HBASE-16939-v1.patch, HBASE-16939.patch
>
>
> {code}
> FileUtil.copy(inputFs, snapshotDir, outputFs, initialOutputSnapshotDir, 
> false, false, conf);
>   if (filesUser != null || filesGroup != null) {
> setOwner(outputFs, snapshotTmpDir, filesUser, filesGroup, true);
>   }
>   if (filesMode > 0) {
> setPermission(outputFs, snapshotTmpDir, (short)filesMode, true);
>   }
> {code}
> It copy snapshot manifest to initialOutputSnapshotDir, but it set owner on 
> snapshotTmpDir. They are different directory when skipTmp is true.
> Another problem is new cluster doesn't have .hbase-snapshot directory. So 
> after export snapshot, it should set owner on .hbase-snapshot directory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16940) Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega patch" posted on RB

2016-10-25 Thread Ted Yu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-16940:
---
   Resolution: Fixed
 Hadoop Flags: Reviewed
Fix Version/s: (was: HBASE-7912)
   2.0.0
   Status: Resolved  (was: Patch Available)

Integrated to HBASE-7912 branch.

Thanks for the patch, Vlad.

> Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega 
> patch" posted on RB 
> --
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: 2.0.0
>
> Attachments: HBASE-16940-v1.patch, HBASE-16940-v2.patch
>
>
> Review 52748 remaining issues.
> https://reviews.apache.org/r/52748



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16940) Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega patch" posted on RB

2016-10-25 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606936#comment-15606936
 ] 

Hadoop QA commented on HBASE-16940:
---

| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
| {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 0s 
{color} | {color:blue} Docker mode activated. {color} |
| {color:red}-1{color} | {color:red} patch {color} | {color:red} 0m 4s {color} 
| {color:red} HBASE-16940 does not apply to master. Rebase required? Wrong 
Branch? See https://yetus.apache.org/documentation/0.3.0/precommit-patchnames 
for help. {color} |
\\
\\
|| Subsystem || Report/Notes ||
| JIRA Patch URL | 
https://issues.apache.org/jira/secure/attachment/12835232/HBASE-16940-v2.patch |
| JIRA Issue | HBASE-16940 |
| Console output | 
https://builds.apache.org/job/PreCommit-HBASE-Build/4186/console |
| Powered by | Apache Yetus 0.3.0   http://yetus.apache.org |


This message was automatically generated.



> Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega 
> patch" posted on RB 
> --
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch, HBASE-16940-v2.patch
>
>
> Review 52748 remaining issues.
> https://reviews.apache.org/r/52748



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16940) Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega patch" posted on RB

2016-10-25 Thread Vladimir Rodionov (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vladimir Rodionov updated HBASE-16940:
--
Attachment: HBASE-16940-v2.patch

v2.

> Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega 
> patch" posted on RB 
> --
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch, HBASE-16940-v2.patch
>
>
> Review 52748 remaining issues.
> https://reviews.apache.org/r/52748



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16940) Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega patch" posted on RB

2016-10-25 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606918#comment-15606918
 ] 

Ted Yu commented on HBASE-16940:


Minor comments after brief check.
For BackupSystemTable :
{code}
-rsTimestampMap.put(server, Long.parseLong(new String(data)));
+//rsTimestampMap.put(server, Long.parseLong(new String(data)));
+rsTimestampMap.put(server, Bytes.toLong(data));
{code}
The commented out line can be dropped.
{code}
+put.addColumn(BackupSystemTable.SESSIONS_FAMILY, Bytes.toBytes("context"), 
context.toByteArray());
{code}
Line longer than 100 char


> Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega 
> patch" posted on RB 
> --
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch
>
>
> Review 52748 remaining issues.
> https://reviews.apache.org/r/52748



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16940) Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega patch" posted on RB

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606904#comment-15606904
 ] 

stack commented on HBASE-16940:
---

You need something folks can make some sense of as a subject/commit message. 
Looking at a commit message that has some random number, review, and 
miscellaneous means folks have to work had dig up a provenance.

> Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega 
> patch" posted on RB 
> --
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch
>
>
> Review 52748 remaining issues.
> https://reviews.apache.org/r/52748



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16940) Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega patch" posted on RB

2016-10-25 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606900#comment-15606900
 ] 

Ted Yu commented on HBASE-16940:


Running test suite based on patch v1.

> Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega 
> patch" posted on RB 
> --
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch
>
>
> Review 52748 remaining issues.
> https://reviews.apache.org/r/52748



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16940) Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega patch" posted on RB 52748

2016-10-25 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-16940:
--
Summary: Address review of "Backup/Restore (HBASE-7912, HBASE-14030, 
HBASE-14123) mega patch" posted on RB 52748   (was: Address review of 
"Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega patch" posted on RB 
)

> Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega 
> patch" posted on RB 52748 
> 
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch
>
>
> Review 52748 remaining issues.
> https://reviews.apache.org/r/52748



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16940) Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega patch" posted on RB

2016-10-25 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-16940:
--
Summary: Address review of "Backup/Restore (HBASE-7912, HBASE-14030, 
HBASE-14123) mega patch" posted on RB   (was: Review 52748: Miscellaneous )

> Address review of "Backup/Restore (HBASE-7912, HBASE-14030, HBASE-14123) mega 
> patch" posted on RB 
> --
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch
>
>
> Review 52748 remaining issues.
> https://reviews.apache.org/r/52748



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16940) Review 52748: Miscellaneous

2016-10-25 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606899#comment-15606899
 ] 

Hadoop QA commented on HBASE-16940:
---

| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
| {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 0s 
{color} | {color:blue} Docker mode activated. {color} |
| {color:red}-1{color} | {color:red} patch {color} | {color:red} 0m 4s {color} 
| {color:red} HBASE-16940 does not apply to master. Rebase required? Wrong 
Branch? See https://yetus.apache.org/documentation/0.3.0/precommit-patchnames 
for help. {color} |
\\
\\
|| Subsystem || Report/Notes ||
| JIRA Patch URL | 
https://issues.apache.org/jira/secure/attachment/12835230/HBASE-16940-v1.patch |
| JIRA Issue | HBASE-16940 |
| Console output | 
https://builds.apache.org/job/PreCommit-HBASE-Build/4185/console |
| Powered by | Apache Yetus 0.3.0   http://yetus.apache.org |


This message was automatically generated.



> Review 52748: Miscellaneous 
> 
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch
>
>
> Review 52748 remaining issues.
> https://reviews.apache.org/r/52748



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16940) Review 52748: Miscellaneous

2016-10-25 Thread Vladimir Rodionov (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vladimir Rodionov updated HBASE-16940:
--
Status: Patch Available  (was: Open)

> Review 52748: Miscellaneous 
> 
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch
>
>
> Review 52748 remaining issues.
> https://reviews.apache.org/r/52748



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16940) Review 52748: Miscellaneous

2016-10-25 Thread Vladimir Rodionov (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vladimir Rodionov updated HBASE-16940:
--
Description: 
Review 52748 remaining issues.

https://reviews.apache.org/r/52748

  was:Review 52748 remaining issues.


> Review 52748: Miscellaneous 
> 
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch
>
>
> Review 52748 remaining issues.
> https://reviews.apache.org/r/52748



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Comment Edited] (HBASE-16940) Review 52748: Miscellaneous

2016-10-25 Thread Vladimir Rodionov (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606874#comment-15606874
 ] 

Vladimir Rodionov edited comment on HBASE-16940 at 10/26/16 12:08 AM:
--

v1. Addresses the rest of comments (majority of them) from RB. 

[~tedyu], it passes all tests. Lets commit it to HBASE-7912 and I will start 
working on porting all HBASE-16825 patches to master branch. 


was (Author: vrodionov):
v1. Addresses the rest of comments (majority of them) from RB.

> Review 52748: Miscellaneous 
> 
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch
>
>
> Review 52748 remaining issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16940) Review 52748: Miscellaneous

2016-10-25 Thread Vladimir Rodionov (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606877#comment-15606877
 ] 

Vladimir Rodionov commented on HBASE-16940:
---

{quote}
This issue needs better subject. As is, it is cryptic.
{quote}

Its hard for the cumulative patch of 30+ RB comments.

> Review 52748: Miscellaneous 
> 
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch
>
>
> Review 52748 remaining issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16940) Review 52748: Miscellaneous

2016-10-25 Thread Vladimir Rodionov (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vladimir Rodionov updated HBASE-16940:
--
Attachment: HBASE-16940-v1.patch

v1. Addresses the rest of comments (majority of them) from RB.

> Review 52748: Miscellaneous 
> 
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
> Attachments: HBASE-16940-v1.patch
>
>
> Review 52748 remaining issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16939) ExportSnapshot: set owner and permission on right directory

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606847#comment-15606847
 ] 

Hudson commented on HBASE-16939:


FAILURE: Integrated in Jenkins build HBase-1.1-JDK7 #1805 (See 
[https://builds.apache.org/job/HBase-1.1-JDK7/1805/])
HBASE-16939 ExportSnapshot: set owner and permission on right directory 
(matteo.bertozzi: rev 288ea316be7039c9361ffba7061d6a68154e633c)
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java


> ExportSnapshot: set owner and permission on right directory
> ---
>
> Key: HBASE-16939
> URL: https://issues.apache.org/jira/browse/HBASE-16939
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Fix For: 2.0.0, 1.4.0, 1.3.1, 1.2.4, 1.1.8
>
> Attachments: HBASE-16939-v1.patch, HBASE-16939.patch
>
>
> {code}
> FileUtil.copy(inputFs, snapshotDir, outputFs, initialOutputSnapshotDir, 
> false, false, conf);
>   if (filesUser != null || filesGroup != null) {
> setOwner(outputFs, snapshotTmpDir, filesUser, filesGroup, true);
>   }
>   if (filesMode > 0) {
> setPermission(outputFs, snapshotTmpDir, (short)filesMode, true);
>   }
> {code}
> It copy snapshot manifest to initialOutputSnapshotDir, but it set owner on 
> snapshotTmpDir. They are different directory when skipTmp is true.
> Another problem is new cluster doesn't have .hbase-snapshot directory. So 
> after export snapshot, it should set owner on .hbase-snapshot directory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16939) ExportSnapshot: set owner and permission on right directory

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606841#comment-15606841
 ] 

Hudson commented on HBASE-16939:


SUCCESS: Integrated in Jenkins build HBase-Trunk_matrix #1854 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/1854/])
HBASE-16939 ExportSnapshot: set owner and permission on right directory 
(matteo.bertozzi: rev c4be87d0508cda019c15f929396d053887bf0e5e)
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java


> ExportSnapshot: set owner and permission on right directory
> ---
>
> Key: HBASE-16939
> URL: https://issues.apache.org/jira/browse/HBASE-16939
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Fix For: 2.0.0, 1.4.0, 1.3.1, 1.2.4, 1.1.8
>
> Attachments: HBASE-16939-v1.patch, HBASE-16939.patch
>
>
> {code}
> FileUtil.copy(inputFs, snapshotDir, outputFs, initialOutputSnapshotDir, 
> false, false, conf);
>   if (filesUser != null || filesGroup != null) {
> setOwner(outputFs, snapshotTmpDir, filesUser, filesGroup, true);
>   }
>   if (filesMode > 0) {
> setPermission(outputFs, snapshotTmpDir, (short)filesMode, true);
>   }
> {code}
> It copy snapshot manifest to initialOutputSnapshotDir, but it set owner on 
> snapshotTmpDir. They are different directory when skipTmp is true.
> Another problem is new cluster doesn't have .hbase-snapshot directory. So 
> after export snapshot, it should set owner on .hbase-snapshot directory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (HBASE-16453) Test failure TestHRegion.testWritesWhileScanning

2016-10-25 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-16453.
---
Resolution: Incomplete

Resolving incomplete. I don't think we have this any more.

> Test failure TestHRegion.testWritesWhileScanning
> 
>
> Key: HBASE-16453
> URL: https://issues.apache.org/jira/browse/HBASE-16453
> Project: HBase
>  Issue Type: Bug
>Reporter: Anoop Sam John
>
> Latest builds show this fail
> {code}
> org.junit.runners.model.TestTimedOutException: test timed out after 10 minutes
>   at java.lang.Object.wait(Native Method)
>   at 
> org.apache.hadoop.hbase.regionserver.MultiVersionConcurrencyControl.waitForRead(MultiVersionConcurrencyControl.java:218)
>   at 
> org.apache.hadoop.hbase.regionserver.MultiVersionConcurrencyControl.completeAndWait(MultiVersionConcurrencyControl.java:149)
>   at 
> org.apache.hadoop.hbase.regionserver.HRegion.getNextSequenceId(HRegion.java:2604)
>   at 
> org.apache.hadoop.hbase.regionserver.HRegion.internalPrepareFlushCache(HRegion.java:2321)
>   at 
> org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2217)
>   at 
> org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2188)
>   at 
> org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2178)
>   at 
> org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1484)
>   at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1412)
>   at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1362)
>   at 
> org.apache.hadoop.hbase.HBaseTestingUtility.closeRegionAndWAL(HBaseTestingUtility.java:369)
>   at 
> org.apache.hadoop.hbase.regionserver.TestHRegion.testWritesWhileScanning(TestHRegion.java:3962)
> {code}
> An MVCC stuck.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606785#comment-15606785
 ] 

stack commented on HBASE-16890:
---

The disruptor consumer thread would feed the eventloop? Could. Another thought 
was just collect the appends and not assign the trans id until we are going to 
sync since that is when we add the edit to the WAL and given that the write 
path is different in master -- we don't update memstore until after sync has 
come home -- then this could work.

One node is bringing out the fact that asyncfswal currently does not run as 
free as dfsclient#dfsos and the latter is friction-full with its 
packet-stutter. You've made it so it is easy to work on this now. No rush -- 
asyncwal is better than dfsclient in a couple of regards -- but yeah, lets make 
it so asyncwal blows away dfsclient any way you look at it.


> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16939) ExportSnapshot: set owner and permission on right directory

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606766#comment-15606766
 ] 

Hudson commented on HBASE-16939:


FAILURE: Integrated in Jenkins build HBase-1.1-JDK8 #1889 (See 
[https://builds.apache.org/job/HBase-1.1-JDK8/1889/])
HBASE-16939 ExportSnapshot: set owner and permission on right directory 
(matteo.bertozzi: rev 288ea316be7039c9361ffba7061d6a68154e633c)
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/snapshot/ExportSnapshot.java


> ExportSnapshot: set owner and permission on right directory
> ---
>
> Key: HBASE-16939
> URL: https://issues.apache.org/jira/browse/HBASE-16939
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Fix For: 2.0.0, 1.4.0, 1.3.1, 1.2.4, 1.1.8
>
> Attachments: HBASE-16939-v1.patch, HBASE-16939.patch
>
>
> {code}
> FileUtil.copy(inputFs, snapshotDir, outputFs, initialOutputSnapshotDir, 
> false, false, conf);
>   if (filesUser != null || filesGroup != null) {
> setOwner(outputFs, snapshotTmpDir, filesUser, filesGroup, true);
>   }
>   if (filesMode > 0) {
> setPermission(outputFs, snapshotTmpDir, (short)filesMode, true);
>   }
> {code}
> It copy snapshot manifest to initialOutputSnapshotDir, but it set owner on 
> snapshotTmpDir. They are different directory when skipTmp is true.
> Another problem is new cluster doesn't have .hbase-snapshot directory. So 
> after export snapshot, it should set owner on .hbase-snapshot directory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread Duo Zhang (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606750#comment-15606750
 ] 

Duo Zhang commented on HBASE-16890:
---

[~stack] I think a simple way to avoid the contention is to use a disruptor in 
front of the old logic. This will need one more thread. But do you guys really 
think we should do optimizations based on the one node result? In a production 
we will write at least three node?

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread Duo Zhang (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606735#comment-15606735
 ] 

Duo Zhang commented on HBASE-16890:
---

[~ram_krish] But AsyncFSWAL will not do flushing unless you call the flush 
method. So the problem is still what I said above, AsyncFSWAL may flush more. 
Otherwise there is no difference.

Thanks.

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12894) Upgrade Jetty to 9.2.6

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-12894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606719#comment-15606719
 ] 

stack commented on HBASE-12894:
---

The complaint about ServerConnector in Builder from FindBugs seems legit. Why 
are we acting like it is going to be set in the Builder when no one does it. I 
took a look at it and doesn't make sense to me. Maybe you have a clue Mr. 
[~yguang11] Thanks. What about the REST FindBugs complaints too? Thank you.

> Upgrade Jetty to 9.2.6
> --
>
> Key: HBASE-12894
> URL: https://issues.apache.org/jira/browse/HBASE-12894
> Project: HBase
>  Issue Type: Improvement
>  Components: REST, UI
>Affects Versions: 0.98.0
>Reporter: Rick Hallihan
>Assignee: Guang Yang
>Priority: Critical
>  Labels: MicrosoftSupport
> Fix For: 2.0.0
>
> Attachments: HBASE-12894_Jetty9_v0.patch, 
> HBASE-12894_Jetty9_v1.patch, HBASE-12894_Jetty9_v1.patch, 
> HBASE-12894_Jetty9_v2.patch, HBASE-12894_Jetty9_v3.patch
>
>
> The Jetty component that is used for the HBase Stargate REST endpoint is 
> version 6.1.26 and is fairly outdated. We recently had a customer inquire 
> about enabling cross-origin resource sharing (CORS) for the REST endpoint and 
> found that this older version does not include the necessary filter or 
> configuration options, highlighted at: 
> http://wiki.eclipse.org/Jetty/Feature/Cross_Origin_Filter
> The Jetty project has had significant updates through versions 7, 8 and 9, 
> including a transition to be an Eclipse subproject, so updating to the latest 
> version may be non-trivial. The last update to the Jetty component in 
> https://issues.apache.org/jira/browse/HBASE-3377 was a minor version update 
> and did not require significant work. This update will include a package 
> namespace update so there will likely be a larger number of required changes. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16939) ExportSnapshot: set owner and permission on right directory

2016-10-25 Thread Sean Busbey (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Busbey updated HBASE-16939:

Fix Version/s: (was: 1.2.5)
   1.2.4

> ExportSnapshot: set owner and permission on right directory
> ---
>
> Key: HBASE-16939
> URL: https://issues.apache.org/jira/browse/HBASE-16939
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Fix For: 2.0.0, 1.4.0, 1.3.1, 1.2.4, 1.1.8
>
> Attachments: HBASE-16939-v1.patch, HBASE-16939.patch
>
>
> {code}
> FileUtil.copy(inputFs, snapshotDir, outputFs, initialOutputSnapshotDir, 
> false, false, conf);
>   if (filesUser != null || filesGroup != null) {
> setOwner(outputFs, snapshotTmpDir, filesUser, filesGroup, true);
>   }
>   if (filesMode > 0) {
> setPermission(outputFs, snapshotTmpDir, (short)filesMode, true);
>   }
> {code}
> It copy snapshot manifest to initialOutputSnapshotDir, but it set owner on 
> snapshotTmpDir. They are different directory when skipTmp is true.
> Another problem is new cluster doesn't have .hbase-snapshot directory. So 
> after export snapshot, it should set owner on .hbase-snapshot directory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (HBASE-16944) Fix findbugs warning in hbase-server HRegion UL_UNRELEASED_LOCK

2016-10-25 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-16944.
---
Resolution: Fixed

Pushed this:

{code}
diff --git 
a/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java 
b/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java
index 71cc247..7bfd652 100644
--- 
a/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java
+++ 
b/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java
@@ -3025,6 +3025,8 @@ public class HRegion implements HeapSize, 
PropagatingConfigurationObserver, Regi
* In here we also handle replay of edits on region recover.
* @return Change in size brought about by applying batchOp
*/
+  @edu.umd.cs.findbugs.annotations.SuppressWarnings(value="UL_UNRELEASED_LOCK",
+ justification="Findbugs seems to be confused on this.")
   @SuppressWarnings("unchecked")
   // TODO: This needs a rewrite. Doesn't have to be this long. St.Ack 20160120
   private long doMiniBatchMutate(BatchOperation batchOp) throws IOException 
{
{code}

> Fix findbugs warning in hbase-server HRegion UL_UNRELEASED_LOCK
> ---
>
> Key: HBASE-16944
> URL: https://issues.apache.org/jira/browse/HBASE-16944
> Project: HBase
>  Issue Type: Bug
>  Components: findbugs
>Reporter: stack
>
> Fix the annoying long-lived UL_UNRELEASED_LOCK warning out of line 3251 in 
> HRegion.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16944) Fix findbugs warning in hbase-server HRegion UL_UNRELEASED_LOCK

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606565#comment-15606565
 ] 

stack commented on HBASE-16944:
---


CodeWarning
UL  
org.apache.hadoop.hbase.regionserver.HRegion.doMiniBatchMutate(HRegion$BatchOperation)
 does not release lock on all paths
Bug type UL_UNRELEASED_LOCK (click for details) 
In class org.apache.hadoop.hbase.regionserver.HRegion
In method 
org.apache.hadoop.hbase.regionserver.HRegion.doMiniBatchMutate(HRegion$BatchOperation)
At HRegion.java:[line 3251]

> Fix findbugs warning in hbase-server HRegion UL_UNRELEASED_LOCK
> ---
>
> Key: HBASE-16944
> URL: https://issues.apache.org/jira/browse/HBASE-16944
> Project: HBase
>  Issue Type: Bug
>  Components: findbugs
>Reporter: stack
>
> Fix the annoying long-lived UL_UNRELEASED_LOCK warning out of line 3251 in 
> HRegion.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HBASE-16944) Fix findbugs warning in hbase-server HRegion UL_UNRELEASED_LOCK

2016-10-25 Thread stack (JIRA)

stack created HBASE-16944:
-

 Summary: Fix findbugs warning in hbase-server HRegion 
UL_UNRELEASED_LOCK
 Key: HBASE-16944
 URL: https://issues.apache.org/jira/browse/HBASE-16944
 Project: HBase
  Issue Type: Bug
  Components: findbugs
Reporter: stack


Fix the annoying long-lived UL_UNRELEASED_LOCK warning out of line 3251 in 
HRegion.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16939) ExportSnapshot: set owner and permission on right directory

2016-10-25 Thread Matteo Bertozzi (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matteo Bertozzi updated HBASE-16939:

Fix Version/s: (was: 1.2.4)
   1.2.5

> ExportSnapshot: set owner and permission on right directory
> ---
>
> Key: HBASE-16939
> URL: https://issues.apache.org/jira/browse/HBASE-16939
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Fix For: 2.0.0, 1.4.0, 1.3.1, 1.2.5, 1.1.8
>
> Attachments: HBASE-16939-v1.patch, HBASE-16939.patch
>
>
> {code}
> FileUtil.copy(inputFs, snapshotDir, outputFs, initialOutputSnapshotDir, 
> false, false, conf);
>   if (filesUser != null || filesGroup != null) {
> setOwner(outputFs, snapshotTmpDir, filesUser, filesGroup, true);
>   }
>   if (filesMode > 0) {
> setPermission(outputFs, snapshotTmpDir, (short)filesMode, true);
>   }
> {code}
> It copy snapshot manifest to initialOutputSnapshotDir, but it set owner on 
> snapshotTmpDir. They are different directory when skipTmp is true.
> Another problem is new cluster doesn't have .hbase-snapshot directory. So 
> after export snapshot, it should set owner on .hbase-snapshot directory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16939) ExportSnapshot: set owner and permission on right directory

2016-10-25 Thread Matteo Bertozzi (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matteo Bertozzi updated HBASE-16939:

   Resolution: Fixed
Fix Version/s: 1.1.8
   1.2.4
   1.3.1
   1.4.0
   2.0.0
   Status: Resolved  (was: Patch Available)

> ExportSnapshot: set owner and permission on right directory
> ---
>
> Key: HBASE-16939
> URL: https://issues.apache.org/jira/browse/HBASE-16939
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Fix For: 2.0.0, 1.4.0, 1.3.1, 1.2.4, 1.1.8
>
> Attachments: HBASE-16939-v1.patch, HBASE-16939.patch
>
>
> {code}
> FileUtil.copy(inputFs, snapshotDir, outputFs, initialOutputSnapshotDir, 
> false, false, conf);
>   if (filesUser != null || filesGroup != null) {
> setOwner(outputFs, snapshotTmpDir, filesUser, filesGroup, true);
>   }
>   if (filesMode > 0) {
> setPermission(outputFs, snapshotTmpDir, (short)filesMode, true);
>   }
> {code}
> It copy snapshot manifest to initialOutputSnapshotDir, but it set owner on 
> snapshotTmpDir. They are different directory when skipTmp is true.
> Another problem is new cluster doesn't have .hbase-snapshot directory. So 
> after export snapshot, it should set owner on .hbase-snapshot directory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16939) ExportSnapshot: set owner and permission on right directory

2016-10-25 Thread Matteo Bertozzi (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606500#comment-15606500
 ] 

Matteo Bertozzi commented on HBASE-16939:
-

committed on 1.1, 1.2, branch-1, master. holding on 1.3 since we are trying to 
get a release, i'll commit it later for 1.3.1
(for 0.98, open a backport jira for both HBASE-14445 and this one if you need 
this stuff in 0.98)

> ExportSnapshot: set owner and permission on right directory
> ---
>
> Key: HBASE-16939
> URL: https://issues.apache.org/jira/browse/HBASE-16939
> Project: HBase
>  Issue Type: Bug
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Attachments: HBASE-16939-v1.patch, HBASE-16939.patch
>
>
> {code}
> FileUtil.copy(inputFs, snapshotDir, outputFs, initialOutputSnapshotDir, 
> false, false, conf);
>   if (filesUser != null || filesGroup != null) {
> setOwner(outputFs, snapshotTmpDir, filesUser, filesGroup, true);
>   }
>   if (filesMode > 0) {
> setPermission(outputFs, snapshotTmpDir, (short)filesMode, true);
>   }
> {code}
> It copy snapshot manifest to initialOutputSnapshotDir, but it set owner on 
> snapshotTmpDir. They are different directory when skipTmp is true.
> Another problem is new cluster doesn't have .hbase-snapshot directory. So 
> after export snapshot, it should set owner on .hbase-snapshot directory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606424#comment-15606424
 ] 

stack commented on HBASE-16890:
---

I did a primitive compare with 100 threads, 10 columns on WALPE. The classic 
FSHLog in JFR shows a few instances of FSOutputSummer#write and 
DFSOS#flushOrSync in samples (7 and 3 respectively). The asyncwal by contrast 
has two orders of magnitude more points of contention showing up in samples at 
append and sync (900 and 200). The sync on waitingConsumePayloads is a 
throughput killer.

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606423#comment-15606423
 ] 

stack commented on HBASE-16890:
---

I did a primitive compare with 100 threads, 10 columns on WALPE. The classic 
FSHLog in JFR shows a few instances of FSOutputSummer#write and 
DFSOS#flushOrSync in samples (7 and 3 respectively). The asyncwal by contrast 
has two orders of magnitude more points of contention showing up in samples at 
append and sync (900 and 200). The sync on waitingConsumePayloads is a 
throughput killer.

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-1600) Multiple HBase clients to multiple indpendent HBase clusters from the same jvm should be allowed

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606275#comment-15606275
 ] 

Hudson commented on HBASE-1600:
---

SUCCESS: Integrated in Jenkins build HBase-Trunk_matrix #1853 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/1853/])
HBASE-1600 Table#checkAndPut() docs are too vague (Csaba Skrabak) (stack: rev 
36aa41685685f6b7d10fea63787cb8adacecc2e7)
* (edit) hbase-client/src/main/java/org/apache/hadoop/hbase/client/Table.java


> Multiple HBase clients to multiple indpendent HBase clusters from the same 
> jvm should be allowed
> 
>
> Key: HBASE-1600
> URL: https://issues.apache.org/jira/browse/HBASE-1600
> Project: HBase
>  Issue Type: Bug
>  Components: Client
>Affects Versions: 0.20.0
>Reporter: Chris K Wensel
>Assignee: Nitay Joffe
>Priority: Blocker
>
> ZK quorum servers list is static in ZooKeeperWrapper and is coupled to 
> zoo.cfg. 
> This prevents multiple clients from connecting to multiple distinct HBase 
> clusters and requires zoo.cfg to exist on a cluster that needs to connect to 
> a remote HBase cluster.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12894) Upgrade Jetty to 9.2.6

2016-10-25 Thread Guang Yang (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-12894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606066#comment-15606066
 ] 

Guang Yang commented on HBASE-12894:


Hi [~stack],
Please let me know if there is anything you want me to do for this patch so as 
to commit it, thanks!

> Upgrade Jetty to 9.2.6
> --
>
> Key: HBASE-12894
> URL: https://issues.apache.org/jira/browse/HBASE-12894
> Project: HBase
>  Issue Type: Improvement
>  Components: REST, UI
>Affects Versions: 0.98.0
>Reporter: Rick Hallihan
>Assignee: Guang Yang
>Priority: Critical
>  Labels: MicrosoftSupport
> Fix For: 2.0.0
>
> Attachments: HBASE-12894_Jetty9_v0.patch, 
> HBASE-12894_Jetty9_v1.patch, HBASE-12894_Jetty9_v1.patch, 
> HBASE-12894_Jetty9_v2.patch, HBASE-12894_Jetty9_v3.patch
>
>
> The Jetty component that is used for the HBase Stargate REST endpoint is 
> version 6.1.26 and is fairly outdated. We recently had a customer inquire 
> about enabling cross-origin resource sharing (CORS) for the REST endpoint and 
> found that this older version does not include the necessary filter or 
> configuration options, highlighted at: 
> http://wiki.eclipse.org/Jetty/Feature/Cross_Origin_Filter
> The Jetty project has had significant updates through versions 7, 8 and 9, 
> including a transition to be an Eclipse subproject, so updating to the latest 
> version may be non-trivial. The last update to the Jetty component in 
> https://issues.apache.org/jira/browse/HBASE-3377 was a minor version update 
> and did not require significant work. This update will include a package 
> namespace update so there will likely be a larger number of required changes. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16179) Fix compilation errors when building hbase-spark against Spark 2.0

2016-10-25 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606070#comment-15606070
 ] 

Ted Yu commented on HBASE-16179:


Here is a related thread for a poll on dropping support for Scala 2.10 :

http://search-hadoop.com/m/Spark/q3RTtwK7l612MKAk?subj=Re+Straw+poll+dropping+support+for+things+like+Scala+2+10

> Fix compilation errors when building hbase-spark against Spark 2.0
> --
>
> Key: HBASE-16179
> URL: https://issues.apache.org/jira/browse/HBASE-16179
> Project: HBase
>  Issue Type: Bug
>  Components: spark
>Reporter: Ted Yu
>Assignee: Ted Yu
>Priority: Critical
> Fix For: 2.0.0
>
> Attachments: 16179.v0.txt, 16179.v1.txt, 16179.v1.txt, 16179.v10.txt, 
> 16179.v11.txt, 16179.v4.txt, 16179.v5.txt, 16179.v7.txt, 16179.v8.txt, 
> 16179.v9.txt
>
>
> I tried building hbase-spark module against Spark-2.0 snapshot and got the 
> following compilation errors:
> http://pastebin.com/bg3w247a
> Some Spark classes such as DataTypeParser and Logging are no longer 
> accessible to downstream projects.
> hbase-spark module should not depend on such classes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16931) Setting cell's seqId to zero in compaction flow might cause RS down.

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15606042#comment-15606042
 ] 

Hudson commented on HBASE-16931:


SUCCESS: Integrated in Jenkins build HBase-1.2-JDK7 #55 (See 
[https://builds.apache.org/job/HBase-1.2-JDK7/55/])
HBASE-16931 Setting cell's seqId to zero in compaction flow might cause (liyu: 
rev 7f10379b1e6756002993c2638c866a6537a088c4)
* (edit) 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestCompaction.java
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/compactions/Compactor.java


> Setting cell's seqId to zero in compaction flow might cause RS down.
> 
>
> Key: HBASE-16931
> URL: https://issues.apache.org/jira/browse/HBASE-16931
> Project: HBase
>  Issue Type: Bug
>  Components: regionserver
>Affects Versions: 2.0.0
>Reporter: binlijin
>Assignee: binlijin
>Priority: Critical
> Fix For: 2.0.0, 1.4.0, 1.2.4, 1.1.8
>
> Attachments: HBASE-16931-master.patch, HBASE-16931.branch-1.patch, 
> HBASE-16931.branch-1.v2.patch, HBASE-16931_master_v2.patch, 
> HBASE-16931_master_v3.patch, HBASE-16931_master_v4.patch, 
> HBASE-16931_master_v5.patch
>
>
> Compactor#performCompaction
>   do {
> hasMore = scanner.next(cells, scannerContext);
> // output to writer:
> for (Cell c : cells) {
>   if (cleanSeqId && c.getSequenceId() <= smallestReadPoint) {
> CellUtil.setSequenceId(c, 0);
>   }
>   writer.append(c);
> }
> cells.clear();
>   } while (hasMore);
> scanner.next will choose at most "hbase.hstore.compaction.kv.max" kvs, the 
> last cell still reference by StoreScanner.prevCell, so if cleanSeqId is 
> called when the scanner.next call StoreScanner.checkScanOrder may throw 
> exception and cause regionserver down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread ramkrishna.s.vasudevan (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605994#comment-15605994
 ] 

ramkrishna.s.vasudevan commented on HBASE-16890:


Some updates here. Read the code and discussed internally on how things work. 
Need to do some verification in cluster tomorrow. Will report back. MAy be what 
i think is that
In FSHLog case once the data was equal to (512 * 9) bytes we were flushing the 
buffer and that flush was resulting in a chunk packet ( which is part of the 
bigger chunk 64K). So only when the sync is called we may end up in non-aligned 
chunks. 
All this happens on the dfs client side. 
On the DN side when it sees the unaligned block it tries to overwrite the CRC 
of the current block with the previous block CRC. Now in the FSHLog case we may 
get only the one chunk packet as unaligned per sync call.
But in case of AsyncHLog every packet we generate will mostly be unaligned and 
so the DN needs to do the extra work of overwriting the CRC every time. This is 
what I see from code but let me profile the DN also to verify. I may be wrong. 
Will come back here. From HBase side profiling not much is revealed as per my 
tests.

> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HBASE-16943) FN Interfaces for writing tools for monitoring/operations

2016-10-25 Thread Thiruvel Thirumoolan (JIRA)

Thiruvel Thirumoolan created HBASE-16943:


 Summary: FN Interfaces for writing tools for monitoring/operations
 Key: HBASE-16943
 URL: https://issues.apache.org/jira/browse/HBASE-16943
 Project: HBase
  Issue Type: Sub-task
Reporter: Thiruvel Thirumoolan
Assignee: Thiruvel Thirumoolan
 Fix For: 2.0.0


HBASE-15532 introduces new interfaces and tools. This sub-task is only for the 
new interfaces and their implementation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HBASE-16942) FavoredNodes - Balancer improvements

2016-10-25 Thread Thiruvel Thirumoolan (JIRA)

Thiruvel Thirumoolan created HBASE-16942:


 Summary: FavoredNodes - Balancer improvements
 Key: HBASE-16942
 URL: https://issues.apache.org/jira/browse/HBASE-16942
 Project: HBase
  Issue Type: Sub-task
Reporter: Thiruvel Thirumoolan
Assignee: Thiruvel Thirumoolan
 Fix For: 2.0.0


This deals with the balancer based enhancements to favored nodes patch as 
discussed in HBASE-15532.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HBASE-16941) FavoredNodes - Split/Merge code paths

2016-10-25 Thread Thiruvel Thirumoolan (JIRA)

Thiruvel Thirumoolan created HBASE-16941:


 Summary: FavoredNodes - Split/Merge code paths
 Key: HBASE-16941
 URL: https://issues.apache.org/jira/browse/HBASE-16941
 Project: HBase
  Issue Type: Sub-task
Reporter: Thiruvel Thirumoolan
Assignee: Thiruvel Thirumoolan
 Fix For: 2.0.0


This jira is to deal with the split/merge logic discussed as part of 
HBASE-15532.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16940) Review 52748: Miscellaneous

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605935#comment-15605935
 ] 

stack commented on HBASE-16940:
---

This issue needs better subject. As is, it is cryptic.

> Review 52748: Miscellaneous 
> 
>
> Key: HBASE-16940
> URL: https://issues.apache.org/jira/browse/HBASE-16940
> Project: HBase
>  Issue Type: Sub-task
>Reporter: Vladimir Rodionov
>Assignee: Vladimir Rodionov
> Fix For: HBASE-7912
>
>
> Review 52748 remaining issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16931) Setting cell's seqId to zero in compaction flow might cause RS down.

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605931#comment-15605931
 ] 

Hudson commented on HBASE-16931:


SUCCESS: Integrated in Jenkins build HBase-1.2-JDK8 #50 (See 
[https://builds.apache.org/job/HBase-1.2-JDK8/50/])
HBASE-16931 Setting cell's seqId to zero in compaction flow might cause (liyu: 
rev 7f10379b1e6756002993c2638c866a6537a088c4)
* (edit) 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestCompaction.java
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/compactions/Compactor.java


> Setting cell's seqId to zero in compaction flow might cause RS down.
> 
>
> Key: HBASE-16931
> URL: https://issues.apache.org/jira/browse/HBASE-16931
> Project: HBase
>  Issue Type: Bug
>  Components: regionserver
>Affects Versions: 2.0.0
>Reporter: binlijin
>Assignee: binlijin
>Priority: Critical
> Fix For: 2.0.0, 1.4.0, 1.2.4, 1.1.8
>
> Attachments: HBASE-16931-master.patch, HBASE-16931.branch-1.patch, 
> HBASE-16931.branch-1.v2.patch, HBASE-16931_master_v2.patch, 
> HBASE-16931_master_v3.patch, HBASE-16931_master_v4.patch, 
> HBASE-16931_master_v5.patch
>
>
> Compactor#performCompaction
>   do {
> hasMore = scanner.next(cells, scannerContext);
> // output to writer:
> for (Cell c : cells) {
>   if (cleanSeqId && c.getSequenceId() <= smallestReadPoint) {
> CellUtil.setSequenceId(c, 0);
>   }
>   writer.append(c);
> }
> cells.clear();
>   } while (hasMore);
> scanner.next will choose at most "hbase.hstore.compaction.kv.max" kvs, the 
> last cell still reference by StoreScanner.prevCell, so if cleanSeqId is 
> called when the scanner.next call StoreScanner.checkScanOrder may throw 
> exception and cause regionserver down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16934) Revert ITBLL misconfiguration check introduced in HBASE-16562

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605910#comment-15605910
 ] 

Hudson commented on HBASE-16934:


FAILURE: Integrated in Jenkins build HBase-0.98-on-Hadoop-1.1 #1284 (See 
[https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/1284/])
HBASE-16934 Revert ITBLL misconfiguration check that results in NPE. (busbey: 
rev f475e54bef96a3817e4807c8bd21f841d1492b2a)
* (edit) 
hbase-it/src/test/java/org/apache/hadoop/hbase/test/IntegrationTestBigLinkedList.java


> Revert ITBLL misconfiguration check introduced in HBASE-16562
> -
>
> Key: HBASE-16934
> URL: https://issues.apache.org/jira/browse/HBASE-16934
> Project: HBase
>  Issue Type: Bug
>  Components: integration tests
>Affects Versions: 1.1.7, 0.98.23
>Reporter: Sean Busbey
>Assignee: Sean Busbey
>Priority: Critical
> Fix For: 0.98.24, 1.1.8
>
>
> The misconfiguration check in HBASE-16562 does insufficient input validation 
> / handling, resulting in a NPE during normal operations.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-14551) Procedure v2 - Reimplement split

2016-10-25 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-14551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605896#comment-15605896
 ] 

Hadoop QA commented on HBASE-14551:
---

| (x) *{color:red}-1 overall{color}* |
\\
\\
|| Vote || Subsystem || Runtime || Comment ||
| {color:blue}0{color} | {color:blue} reexec {color} | {color:blue} 0m 15s 
{color} | {color:blue} Docker mode activated. {color} |
| {color:green}+1{color} | {color:green} @author {color} | {color:green} 0m 0s 
{color} | {color:green} The patch does not contain any @author tags. {color} |
| {color:green}+1{color} | {color:green} test4tests {color} | {color:green} 0m 
0s {color} | {color:green} The patch appears to include 9 new or modified test 
files. {color} |
| {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 7s 
{color} | {color:blue} Maven dependency ordering for branch {color} |
| {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 3m 
0s {color} | {color:green} master passed {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 27s 
{color} | {color:green} master passed {color} |
| {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 13m 
55s {color} | {color:green} master passed {color} |
| {color:green}+1{color} | {color:green} mvneclipse {color} | {color:green} 0m 
39s {color} | {color:green} master passed {color} |
| {color:red}-1{color} | {color:red} findbugs {color} | {color:red} 1m 49s 
{color} | {color:red} hbase-protocol-shaded in master has 24 extant Findbugs 
warnings. {color} |
| {color:red}-1{color} | {color:red} findbugs {color} | {color:red} 1m 39s 
{color} | {color:red} hbase-server in master has 1 extant Findbugs warnings. 
{color} |
| {color:green}+1{color} | {color:green} javadoc {color} | {color:green} 0m 59s 
{color} | {color:green} master passed {color} |
| {color:blue}0{color} | {color:blue} mvndep {color} | {color:blue} 0m 8s 
{color} | {color:blue} Maven dependency ordering for patch {color} |
| {color:green}+1{color} | {color:green} mvninstall {color} | {color:green} 1m 
42s {color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} compile {color} | {color:green} 1m 28s 
{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} cc {color} | {color:green} 1m 28s 
{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} javac {color} | {color:green} 1m 28s 
{color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} checkstyle {color} | {color:green} 14m 
33s {color} | {color:green} the patch passed {color} |
| {color:green}+1{color} | {color:green} mvneclipse {color} | {color:green} 0m 
39s {color} | {color:green} the patch passed {color} |
| {color:red}-1{color} | {color:red} whitespace {color} | {color:red} 0m 0s 
{color} | {color:red} The patch has 19 line(s) that end in whitespace. Use git 
apply --whitespace=fix. {color} |
| {color:green}+1{color} | {color:green} hadoopcheck {color} | {color:green} 
28m 3s {color} | {color:green} Patch does not cause any errors with Hadoop 
2.6.1 2.6.2 2.6.3 2.6.4 2.6.5 2.7.1 2.7.2 2.7.3 or 3.0.0-alpha1. {color} |
| {color:green}+1{color} | {color:green} findbugs {color} | {color:green} 5m 
32s {color} | {color:green} the patch passed {color} |
| {color:red}-1{color} | {color:red} javadoc {color} | {color:red} 0m 17s 
{color} | {color:red} hbase-client generated 1 new + 13 unchanged - 0 fixed = 
14 total (was 13) {color} |
| {color:red}-1{color} | {color:red} javadoc {color} | {color:red} 0m 28s 
{color} | {color:red} hbase-server generated 3 new + 1 unchanged - 0 fixed = 4 
total (was 1) {color} |
| {color:green}+1{color} | {color:green} unit {color} | {color:green} 0m 28s 
{color} | {color:green} hbase-protocol-shaded in the patch passed. {color} |
| {color:green}+1{color} | {color:green} unit {color} | {color:green} 0m 54s 
{color} | {color:green} hbase-client in the patch passed. {color} |
| {color:red}-1{color} | {color:red} unit {color} | {color:red} 85m 31s {color} 
| {color:red} hbase-server in the patch failed. {color} |
| {color:green}+1{color} | {color:green} unit {color} | {color:green} 0m 23s 
{color} | {color:green} hbase-rsgroup in the patch passed. {color} |
| {color:red}-1{color} | {color:red} asflicense {color} | {color:red} 0m 46s 
{color} | {color:red} The patch generated 1 ASF License warnings. {color} |
| {color:black}{color} | {color:black} {color} | {color:black} 167m 17s {color} 
| {color:black} {color} |
\\
\\
|| Reason || Tests ||
| Failed junit tests | hadoop.hbase.security.access.TestAccessController |
|   | hadoop.hbase.regionserver.TestSplitTransactionOnCluster |
|   | hadoop.hbase.namespace.TestNamespaceAuditor |
|   | hadoop.hbase.mapreduce.TestLoadIncrementalHFilesSplitRecovery |
|   | hadoop.hbase.mapreduce.TestSecureLoadIncrementalHFilesSpli

[jira] [Commented] (HBASE-14417) Incremental backup and bulk loading

2016-10-25 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-14417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605864#comment-15605864
 ] 

Ted Yu commented on HBASE-14417:


In IncrementalTableBackupClient, before marking the backup complete, I am 
adding code to copy the bulk loaded hfiles to destination filesystem and 
persist the list of copied files to hbase:backup table.

> Incremental backup and bulk loading
> ---
>
> Key: HBASE-14417
> URL: https://issues.apache.org/jira/browse/HBASE-14417
> Project: HBase
>  Issue Type: New Feature
>Affects Versions: 2.0.0
>Reporter: Vladimir Rodionov
>Assignee: Ted Yu
>Priority: Critical
>  Labels: backup
> Fix For: 2.0.0
>
> Attachments: 14417.v1.txt, 14417.v11.txt, 14417.v13.txt, 
> 14417.v2.txt, 14417.v6.txt
>
>
> Currently, incremental backup is based on WAL files. Bulk data loading 
> bypasses WALs for obvious reasons, breaking incremental backups. The only way 
> to continue backups after bulk loading is to create new full backup of a 
> table. This may not be feasible for customers who do bulk loading regularly 
> (say, every day).
> Google doc for design:
> https://docs.google.com/document/d/1ACCLsecHDvzVSasORgqqRNrloGx4mNYIbvAU7lq5lJE



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16841) Data loss in MOB files after cloning a snapshot and deleting that snapshot

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605758#comment-15605758
 ] 

stack commented on HBASE-16841:
---

Patch LGTM but good to get [~mbertozzi]'s opinion [~jingcheng...@intel.com] He 
is out the next few days but will be back sir.

> Data loss in MOB files after cloning a snapshot and deleting that snapshot
> --
>
> Key: HBASE-16841
> URL: https://issues.apache.org/jira/browse/HBASE-16841
> Project: HBase
>  Issue Type: Bug
>  Components: mob, snapshots
>Reporter: Jingcheng Du
>Assignee: Jingcheng Du
> Attachments: HBASE-16841-V2.patch, HBASE-16841-V3.patch, 
> HBASE-16841.patch
>
>
> Running the following steps will probably lose MOB data when working with 
> snapshots.
> 1. Create a mob-enabled table by running create 't1', {NAME => 'f1', IS_MOB 
> => true, MOB_THRESHOLD => 0}.
> 2. Put millions of data.
> 3. Run {{snapshot 't1','t1_snapshot'}} to take a snapshot for this table t1.
> 4. Run {{clone_snapshot 't1_snapshot','t1_cloned'}} to clone this snapshot.
> 5. Run {{delete_snapshot 't1_snapshot'}} to delete this snapshot.
> 6. Run {{disable 't1'}} and {{delete 't1'}} to delete the table.
> 7. Now go to the archive directory of t1, the number of .link directories is 
> different from the number of hfiles which means some data will be lost after 
> the hfile cleaner runs.
> This is because, when taking a snapshot on a enabled mob table, each region 
> flushes itself and takes a snapshot, and the mob snapshot is taken only if 
> the current region is first region of the table. At that time, the flushing 
> of some regions might not be finished, and some mob files are not flushed to 
> disk yet. Eventually some mob files are not recorded in the snapshot manifest.
> To solve this, we need to take the mob snapshot at last after the snapshots 
> on all the online and offline regions are finished in 
> {{EnabledTableSnapshotHandler}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-16000) Table#checkAndPut() docs are too vague

2016-10-25 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-16000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-16000:
--
   Resolution: Fixed
 Hadoop Flags: Reviewed
Fix Version/s: 2.0.0
   Status: Resolved  (was: Patch Available)

Pushed to master. Thanks for the fixup [~cskrabak]

> Table#checkAndPut() docs are too vague
> --
>
> Key: HBASE-16000
> URL: https://issues.apache.org/jira/browse/HBASE-16000
> Project: HBase
>  Issue Type: Bug
>  Components: documentation
>Reporter: Nick Dimiduk
>Assignee: Csaba Skrabak
>Priority: Minor
>  Labels: beginner, beginners
> Fix For: 2.0.0
>
> Attachments: hbase-16000.patch
>
>
> The docs for our CAS operations that accept a {{CompareOp}} are too vague. 
> They're copied from the part that implies {{CompareOp.EQUAL}}, which means 
> they don't tell you how your op, the provided {{value}}, and any present 
> value are compared.
> Only by digging into HRegion.java can I see that the operation is, for 
> example with CompareOp.GREATER, we get {{value.compareTo(existingKeyValue) > 
> 0}}. Javadocs should be explicit about the order of operands.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16932) Implement small scan

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605732#comment-15605732
 ] 

stack commented on HBASE-16932:
---

I think this a good first cut. I think calling it as it is -- smallScan -- is 
the way to go. Get another opinion given this is going to be public API 
([~carp84]?)

> Implement small scan
> 
>
> Key: HBASE-16932
> URL: https://issues.apache.org/jira/browse/HBASE-16932
> Project: HBase
>  Issue Type: Sub-task
>  Components: Client
>Affects Versions: 2.0.0
>Reporter: Duo Zhang
>Assignee: Duo Zhang
> Fix For: 2.0.0
>
> Attachments: HBASE-16932.patch
>
>
> As said in HBASE-16838, we should have a scan method that returns a 
> CompletaFuture for the whole result. It is suitable for a small scan.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16890) Analyze the performance of AsyncWAL and fix the same

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605707#comment-15605707
 ] 

stack commented on HBASE-16890:
---

bq. There is no contention for the synchronization in DFSClient#DFSOS for our 
FSHLog. We always call it in the consumer thread of Disruptor...

In FSHLog there is the consumer thread of Disruptor plus five syncer threads 
all banging on DFSClient#DFSOS.




> Analyze the performance of AsyncWAL and fix the same
> 
>
> Key: HBASE-16890
> URL: https://issues.apache.org/jira/browse/HBASE-16890
> Project: HBase
>  Issue Type: Sub-task
>  Components: wal
>Affects Versions: 2.0.0
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
> Fix For: 2.0.0
>
> Attachments: contention.png, contention_defaultWAL.png
>
>
> Tests reveal that AsyncWAL under load in single node cluster performs slower 
> than the Default WAL. This task is to analyze and see if we could fix it.
> See some discussions in the tail of JIRA HBASE-15536.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-14417) Incremental backup and bulk loading

2016-10-25 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-14417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605689#comment-15605689
 ] 

Ted Yu commented on HBASE-14417:


When running TestFullRestore, I found an interesting issue: bulk load can 
happen to the table which is fully backed up - when overwrite option is 
specified.

I can think of two ways to omit these bulk loaded files:
1. under backup zookeeper subtree, create znode with tables which are being 
restored to with overwrite option. This allows the postAppend() hook to skip 
these tables for the duration of the restore
2. at the end of bulk load, issue deletes against the znodes which are added 
during the bulk load

I am leaning toward first option.

> Incremental backup and bulk loading
> ---
>
> Key: HBASE-14417
> URL: https://issues.apache.org/jira/browse/HBASE-14417
> Project: HBase
>  Issue Type: New Feature
>Affects Versions: 2.0.0
>Reporter: Vladimir Rodionov
>Assignee: Ted Yu
>Priority: Critical
>  Labels: backup
> Fix For: 2.0.0
>
> Attachments: 14417.v1.txt, 14417.v11.txt, 14417.v13.txt, 
> 14417.v2.txt, 14417.v6.txt
>
>
> Currently, incremental backup is based on WAL files. Bulk data loading 
> bypasses WALs for obvious reasons, breaking incremental backups. The only way 
> to continue backups after bulk loading is to create new full backup of a 
> table. This may not be feasible for customers who do bulk loading regularly 
> (say, every day).
> Google doc for design:
> https://docs.google.com/document/d/1ACCLsecHDvzVSasORgqqRNrloGx4mNYIbvAU7lq5lJE



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16931) Setting cell's seqId to zero in compaction flow might cause RS down.

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605663#comment-15605663
 ] 

Hudson commented on HBASE-16931:


FAILURE: Integrated in Jenkins build HBase-1.1-JDK7 #1804 (See 
[https://builds.apache.org/job/HBase-1.1-JDK7/1804/])
HBASE-16931 Setting cell's seqId to zero in compaction flow might cause (liyu: 
rev f4cbf77f28a2668743b1c73b5aeb113c3456fb13)
* (edit) 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestCompaction.java
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/compactions/Compactor.java


> Setting cell's seqId to zero in compaction flow might cause RS down.
> 
>
> Key: HBASE-16931
> URL: https://issues.apache.org/jira/browse/HBASE-16931
> Project: HBase
>  Issue Type: Bug
>  Components: regionserver
>Affects Versions: 2.0.0
>Reporter: binlijin
>Assignee: binlijin
>Priority: Critical
> Fix For: 2.0.0, 1.4.0, 1.2.4, 1.1.8
>
> Attachments: HBASE-16931-master.patch, HBASE-16931.branch-1.patch, 
> HBASE-16931.branch-1.v2.patch, HBASE-16931_master_v2.patch, 
> HBASE-16931_master_v3.patch, HBASE-16931_master_v4.patch, 
> HBASE-16931_master_v5.patch
>
>
> Compactor#performCompaction
>   do {
> hasMore = scanner.next(cells, scannerContext);
> // output to writer:
> for (Cell c : cells) {
>   if (cleanSeqId && c.getSequenceId() <= smallestReadPoint) {
> CellUtil.setSequenceId(c, 0);
>   }
>   writer.append(c);
> }
> cells.clear();
>   } while (hasMore);
> scanner.next will choose at most "hbase.hstore.compaction.kv.max" kvs, the 
> last cell still reference by StoreScanner.prevCell, so if cleanSeqId is 
> called when the scanner.next call StoreScanner.checkScanOrder may throw 
> exception and cause regionserver down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16112) TestFlushSnapshotFromClient and TestMobFlushSnapshotFromClient are flaky

2016-10-25 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605647#comment-15605647
 ] 

Ted Yu commented on HBASE-16112:


HBASE-16841 would make these tests more stable, right ?

> TestFlushSnapshotFromClient and TestMobFlushSnapshotFromClient are flaky
> 
>
> Key: HBASE-16112
> URL: https://issues.apache.org/jira/browse/HBASE-16112
> Project: HBase
>  Issue Type: Test
>Reporter: Ted Yu
>Assignee: Jingcheng Du
>Priority: Minor
> Attachments: TestFlushSnapshotFromClient-2353.out, 
> TestMobFlushSnapshotFromClient-2353.out, testSnapshotStateAfterMerge-2353.out
>
>
> On recent QA runs, TestMobFlushSnapshotFromClient failed twice.
> https://builds.apache.org/job/PreCommit-HBASE-Build/2360/#showFailuresLink
> https://builds.apache.org/job/PreCommit-HBASE-Build/2353/artifact/patchprocess/patch-unit-hbase-server.txt
> {code}
> org.apache.hadoop.hbase.snapshot.TestMobFlushSnapshotFromClient.org.apache.hadoop.hbase.snapshot.TestMobFlushSnapshotFromClient
>   Run 1: 
> TestMobFlushSnapshotFromClient>TestFlushSnapshotFromClient.tearDown:123->Object.wait:-2
>  Â» TestTimedOut
>   Run 2: 
> TestMobFlushSnapshotFromClient.org.apache.hadoop.hbase.snapshot.TestMobFlushSnapshotFromClient
>  Â» 
> org.apache.hadoop.hbase.snapshot.TestMobFlushSnapshotFromClient.testSnapshotStateAfterMerge(org.apache.hadoop.hbase.snapshot.TestMobFlushSnapshotFromClient)
>   Run 1: 
> TestMobFlushSnapshotFromClient>TestFlushSnapshotFromClient.testSnapshotStateAfterMerge:335->TestFlushSnapshotFromClient.waitRegionsAfterMerge:526
>  Â» RetriesExhausted
>   Run 2: 
> TestMobFlushSnapshotFromClient>TestFlushSnapshotFromClient.tearDown:123 Â» 
> InterruptedIO
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16931) Setting cell's seqId to zero in compaction flow might cause RS down.

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605578#comment-15605578
 ] 

Hudson commented on HBASE-16931:


FAILURE: Integrated in Jenkins build HBase-1.1-JDK8 #1888 (See 
[https://builds.apache.org/job/HBase-1.1-JDK8/1888/])
HBASE-16931 Setting cell's seqId to zero in compaction flow might cause (liyu: 
rev f4cbf77f28a2668743b1c73b5aeb113c3456fb13)
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/compactions/Compactor.java
* (edit) 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestCompaction.java


> Setting cell's seqId to zero in compaction flow might cause RS down.
> 
>
> Key: HBASE-16931
> URL: https://issues.apache.org/jira/browse/HBASE-16931
> Project: HBase
>  Issue Type: Bug
>  Components: regionserver
>Affects Versions: 2.0.0
>Reporter: binlijin
>Assignee: binlijin
>Priority: Critical
> Fix For: 2.0.0, 1.4.0, 1.2.4, 1.1.8
>
> Attachments: HBASE-16931-master.patch, HBASE-16931.branch-1.patch, 
> HBASE-16931.branch-1.v2.patch, HBASE-16931_master_v2.patch, 
> HBASE-16931_master_v3.patch, HBASE-16931_master_v4.patch, 
> HBASE-16931_master_v5.patch
>
>
> Compactor#performCompaction
>   do {
> hasMore = scanner.next(cells, scannerContext);
> // output to writer:
> for (Cell c : cells) {
>   if (cleanSeqId && c.getSequenceId() <= smallestReadPoint) {
> CellUtil.setSequenceId(c, 0);
>   }
>   writer.append(c);
> }
> cells.clear();
>   } while (hasMore);
> scanner.next will choose at most "hbase.hstore.compaction.kv.max" kvs, the 
> last cell still reference by StoreScanner.prevCell, so if cleanSeqId is 
> called when the scanner.next call StoreScanner.checkScanOrder may throw 
> exception and cause regionserver down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16783) Use ByteBufferPool for the header and message during Rpc response

2016-10-25 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605571#comment-15605571
 ] 

stack commented on HBASE-16783:
---

LGTM

> Use ByteBufferPool for the header and message during Rpc response
> -
>
> Key: HBASE-16783
> URL: https://issues.apache.org/jira/browse/HBASE-16783
> Project: HBase
>  Issue Type: Improvement
>Reporter: ramkrishna.s.vasudevan
>Assignee: ramkrishna.s.vasudevan
>Priority: Minor
> Attachments: HBASE-16783.patch, HBASE-16783_1.patch, 
> HBASE-16783_2.patch, HBASE-16783_3.patch, HBASE-16783_4.patch, 
> HBASE-16783_5.patch, HBASE-16783_6.patch, HBASE-16783_7.patch
>
>
> With ByteBufferPool in place we could avoid the byte[] creation in 
> RpcServer#createHeaderAndMessageBytes and try using the Buffer from the pool 
> rather than creating byte[] every time.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16841) Data loss in MOB files after cloning a snapshot and deleting that snapshot

2016-10-25 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605412#comment-15605412
 ] 

Ted Yu commented on HBASE-16841:


Patch v3 is good by me.

> Data loss in MOB files after cloning a snapshot and deleting that snapshot
> --
>
> Key: HBASE-16841
> URL: https://issues.apache.org/jira/browse/HBASE-16841
> Project: HBase
>  Issue Type: Bug
>  Components: mob, snapshots
>Reporter: Jingcheng Du
>Assignee: Jingcheng Du
> Attachments: HBASE-16841-V2.patch, HBASE-16841-V3.patch, 
> HBASE-16841.patch
>
>
> Running the following steps will probably lose MOB data when working with 
> snapshots.
> 1. Create a mob-enabled table by running create 't1', {NAME => 'f1', IS_MOB 
> => true, MOB_THRESHOLD => 0}.
> 2. Put millions of data.
> 3. Run {{snapshot 't1','t1_snapshot'}} to take a snapshot for this table t1.
> 4. Run {{clone_snapshot 't1_snapshot','t1_cloned'}} to clone this snapshot.
> 5. Run {{delete_snapshot 't1_snapshot'}} to delete this snapshot.
> 6. Run {{disable 't1'}} and {{delete 't1'}} to delete the table.
> 7. Now go to the archive directory of t1, the number of .link directories is 
> different from the number of hfiles which means some data will be lost after 
> the hfile cleaner runs.
> This is because, when taking a snapshot on a enabled mob table, each region 
> flushes itself and takes a snapshot, and the mob snapshot is taken only if 
> the current region is first region of the table. At that time, the flushing 
> of some regions might not be finished, and some mob files are not flushed to 
> disk yet. Eventually some mob files are not recorded in the snapshot manifest.
> To solve this, we need to take the mob snapshot at last after the snapshots 
> on all the online and offline regions are finished in 
> {{EnabledTableSnapshotHandler}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-16910) Avoid NPE when starting StochasticLoadBalancer

2016-10-25 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-16910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15605451#comment-15605451
 ] 

Hudson commented on HBASE-16910:


SUCCESS: Integrated in Jenkins build HBase-1.4 #495 (See 
[https://builds.apache.org/job/HBase-1.4/495/])
HBASE-16910 Avoid NPE when starting StochasticLoadBalancer (ashishsinghi: rev 
ae502a9d5ce3dc5c4a485c3ff364d433bdf29a10)
* (edit) hbase-server/src/main/java/org/apache/hadoop/hbase/master/HMaster.java
* (edit) 
hbase-server/src/main/java/org/apache/hadoop/hbase/master/balancer/StochasticLoadBalancer.java


> Avoid NPE when starting StochasticLoadBalancer
> --
>
> Key: HBASE-16910
> URL: https://issues.apache.org/jira/browse/HBASE-16910
> Project: HBase
>  Issue Type: Bug
>  Components: Balancer
>Affects Versions: 2.0.0
>Reporter: Guanghao Zhang
>Assignee: Guanghao Zhang
>Priority: Minor
> Fix For: 2.0.0, 1.4.0
>
> Attachments: HBASE-16910-branch-1.patch, HBASE-16910-v1.patch, 
> HBASE-16910.patch
>
>
> When master start, it initialize StochasticLoadBalancer.
> {code}
> this.balancer.setClusterStatus(getClusterStatus());
> this.balancer.setMasterServices(this);
> {code}
> It first setClusterStatus(), then setMasterService(). But in setClusterStatus 
> method, it use master service which is not initialized. So it will throw NPE.
> {code}
> int tablesCount = isByTable ? services.getTableDescriptors().getAll().size() 
> : 1;
> {code}
> It happens when set isByTable is ture.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-14551) Procedure v2 - Reimplement split

2016-10-25 Thread Stephen Yuan Jiang (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-14551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stephen Yuan Jiang updated HBASE-14551:
---
Hadoop Flags: Reviewed
  Status: Patch Available  (was: Open)

> Procedure v2 - Reimplement split
> 
>
> Key: HBASE-14551
> URL: https://issues.apache.org/jira/browse/HBASE-14551
> Project: HBase
>  Issue Type: Sub-task
>  Components: proc-v2
>Affects Versions: 2.0.0
>Reporter: Matteo Bertozzi
>Assignee: Stephen Yuan Jiang
>Priority: Minor
> Fix For: 2.0.0
>
> Attachments: HBASE-14551.v1-master.patch
>
>
> use the proc-v2 state machine for split. also update the logic to have a 
> single meta-writer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-14551) Procedure v2 - Reimplement split

2016-10-25 Thread Stephen Yuan Jiang (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-14551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stephen Yuan Jiang updated HBASE-14551:
---
Attachment: HBASE-14551.v1-master.patch

> Procedure v2 - Reimplement split
> 
>
> Key: HBASE-14551
> URL: https://issues.apache.org/jira/browse/HBASE-14551
> Project: HBase
>  Issue Type: Sub-task
>  Components: proc-v2
>Affects Versions: 2.0.0
>Reporter: Matteo Bertozzi
>Assignee: Stephen Yuan Jiang
>Priority: Minor
> Fix For: 2.0.0
>
> Attachments: HBASE-14551.v1-master.patch
>
>
> use the proc-v2 state machine for split. also update the logic to have a 
> single meta-writer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

1 2 >

1 - 100 of 139 matches

Mail list logo