subject:"\[jira\] \[Commented\] \(HBASE\-10015\) Major performance improvement\: Avoid synchronization in StoreScanner"

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13831814#comment-13831814
]

Lars Hofhansl commented on HBASE-10015:
---

So it seems this is universal.
Our performance dude has actually confirmed that this is an expected outcome.
I'll gather some more CPU metrics as I find time today.

[~stack], these locks are taken in StoreScanner, which is row based (unlike
StoreFileScanner and MemstoreScanner). So the frequent locks/unlocks there we'd
mostly see per row. Wider rows won't be slower, but the speedup effect would be
proportionally less. To be sure I'll validate with wider tables (maybe 20 CQs).
I will also double check that biased locking is in effect.

Major performance improvement: Avoid synchronization in StoreScanner

Key: HBASE-10015
URL: https://issues.apache.org/jira/browse/HBASE-10015
Project: HBase
Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
Attachments: 10015-0.94-lock.txt, 10015-0.94-new-sample.txt,
10015-0.94-v2.txt, 10015-0.94-v3.txt, 10015-0.94-v4.txt,
10015-0.94-withtest.txt, 10015-0.94.txt, 10015-trunk-v2.txt,
10015-trunk-v3.txt, 10015-trunk-v4.txt, 10015-trunk-v4.txt,
10015-trunk-v4.txt, 10015-trunk.txt, TestLoad.java

Did some more profiling (this time with a sampling profiler) and
StoreScanner.peek() showed up a lot in the samples. At first that was
surprising, but peek is synchronized, so it seems a lot of the sync'ing cost
is eaten there.
It seems the only reason we have to synchronize all these methods is because
a concurrent flush or compaction can change the scanner stack, other than
that only a single thread should access a StoreScanner at any given time.
So replaced updateReaders() with some code that just indicates to the scanner
that the readers should be updated and then make it the using thread's
responsibility to do the work.
The perf improvement from this is staggering. I am seeing somewhere around 3x
scan performance improvement across all scenarios.
Now, the hard part is to reason about whether this is 100% correct. I ran
TestAtomicOperation and TestAcidGuarantees a few times in a loop, all still
pass.
Will attach a sample patch.

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13832029#comment-13832029
]

Lars Hofhansl commented on HBASE-10015:
---

10 columns, 100 byte values, 1st and 4th selected.
5 runs, Mean: 13.756 Sigma: 0.04210938137755054, with lock patch
5 runs, Mean: 14.2028 Sigma: 0.08452313292821084, without

10 columns, all selected
5 runs, Mean: 8.577 Sigma: 0.09060463564299566
5 runs, Mean: 9.9846 Sigma: 0.1023749969474969, without

Per KV cost now is dominant. In no scenario have I observed this to be slower.

Major performance improvement: Avoid synchronization in StoreScanner

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

2013-11-25 Thread Ted Yu (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13832032#comment-13832032
]

Ted Yu commented on HBASE-10015:

Major performance improvement: Avoid synchronization in StoreScanner

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13832107#comment-13832107
]

Lars Hofhansl commented on HBASE-10015:
---

I force enabled biased locking. No improvement with intrinsic locking;
according to the docs it is enabled in JDK6+ anyway, was just making sure.

Major performance improvement: Avoid synchronization in StoreScanner

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

2013-11-25 Thread Enis Soztutar (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13832147#comment-13832147
]

Enis Soztutar commented on HBASE-10015:
---

If we do ref counting, can't we still finish the compaction, but not archive
the files as long as there are scanners against them. We can do it by
decoupling the store file archiving from compaction. At the worst case on RS
crash, we might end up already compacted files which won't affect semantics.
Regardless, +1 on -lock patch just for the gains.

Major performance improvement: Avoid synchronization in StoreScanner

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13832162#comment-13832162
]

Lars Hofhansl commented on HBASE-10015:
---

Yeah, that's the idea. We'd delay archiving any HFile until no scanners are
referring to it any more.

Major performance improvement: Avoid synchronization in StoreScanner

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

2013-11-24 Thread stack (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13831001#comment-13831001
]

stack commented on HBASE-10015:
---

So you are thinking this cannot be completed until after we add delayed clean
up of no-longer-used files? Only then can we safely remove synchronizations?

How many threads we talking anyways? It should be uncontended. The only
thread is the current handler asking to return scan results -- this changes as
different handlers come in on each bulk next invocation -- and then an
incidental update readers request..and that is it?

Major performance improvement: Avoid synchronization in StoreScanner

Key: HBASE-10015
URL: https://issues.apache.org/jira/browse/HBASE-10015
Project: HBase
Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
Attachments: 10015-0.94-v2.txt, 10015-0.94-v3.txt, 10015-0.94-v4.txt,
10015-0.94-withtest.txt, 10015-0.94.txt, 10015-trunk-v2.txt,
10015-trunk-v3.txt, 10015-trunk-v4.txt, 10015-trunk-v4.txt,
10015-trunk-v4.txt, 10015-trunk.txt, TestLoad.java

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1383#comment-1383
]

Lars Hofhansl commented on HBASE-10015:
---

I think so (to your first point).

This is (almost) never contented, but StoreScanner.peek() is called *very*
frequently (including the compares in KeyValueHeap) and the memory fences
enforced by synchronized cause a slowdown.

I did notice that during flushed and compactions we do *not* register any
listeners for changed readers, so my earlier idea of just synchronizing on the
RegionScannerImpl should work after all.

Major performance improvement: Avoid synchronization in StoreScanner

Key: HBASE-10015
URL: https://issues.apache.org/jira/browse/HBASE-10015
Project: HBase
Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
Attachments: 10015-0.94-v2.txt, 10015-0.94-v3.txt, 10015-0.94-v4.txt,
10015-0.94-withtest.txt, 10015-0.94.txt, 10015-trunk-v2.txt,
10015-trunk-v3.txt, 10015-trunk-v4.txt, 10015-trunk-v4.txt,
10015-trunk-v4.txt, 10015-trunk.txt, TestLoad.java

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13831165#comment-13831165
]

Lars Hofhansl commented on HBASE-10015:
---

Was just looking at making a trunk patch.
In trunk StoreScanners for compaction *do* register a change observer? But not
in 0.94?!

So - sigh - the patch I just made for 0.94 will not work in trunk. And maybe
more importantly: Is there a lingering bug in 0.94?

Major performance improvement: Avoid synchronization in StoreScanner

Key: HBASE-10015
URL: https://issues.apache.org/jira/browse/HBASE-10015
Project: HBase
Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
Attachments: 10015-0.94-new-sample.txt, 10015-0.94-v2.txt,
10015-0.94-v3.txt, 10015-0.94-v4.txt, 10015-0.94-withtest.txt,
10015-0.94.txt, 10015-trunk-v2.txt, 10015-trunk-v3.txt, 10015-trunk-v4.txt,
10015-trunk-v4.txt, 10015-trunk-v4.txt, 10015-trunk.txt, TestLoad.java

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13831191#comment-13831191
]

Lars Hofhansl commented on HBASE-10015:
---

Continuing my monologue here... :)

To my surprise I get a 25-30% perf improvement when I just replace intrinsic
locking (synchronized) with ReentrantLock - again for very tall tables only.
synchronized with biased locking should outperform the ReentrantLock in
uncontended cases, but it does not (all my tests are against a real
RegionServer, so the initial delay for BiasedLocking has not effect here).

Something is going on.

This is on JDK7 on old'ish 2 core machine. Will try on some other machines as
well. If this bears out on other machines as well it would be safe and quick
win.

Major performance improvement: Avoid synchronization in StoreScanner

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13831202#comment-13831202
]

Lars Hofhansl commented on HBASE-10015:
---

Verified on different machine architecture with JDK6 (using the attached
unittest).
4400ms vs 3200ms (JDK7, 2 core machine)
2600mx vs 2000ms (JDK6, 12 core machine)

Again, if somebody is still reading and could verify the latest patch with the
earlier unittest on their hardware that would be greatly appreciated.

Major performance improvement: Avoid synchronization in StoreScanner

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13830719#comment-13830719
]

Lars Hofhansl commented on HBASE-10015:
---

Something like this.

On order for the compaction to make progress we could assign scanners to
epochs, where everytime we change the readers for a store we go to a new epoch.
If all scanners for an epoch are either done or have switched to the new epoch,
we can retire the readers of that epoch.

In any case, just commit this change and keep working on it? As Vladimir points
out there are other issues with more serious performance implications.

Major performance improvement: Avoid synchronization in StoreScanner

Key: HBASE-10015
URL: https://issues.apache.org/jira/browse/HBASE-10015
Project: HBase
Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
Fix For: 0.98.0, 0.96.1, 0.94.15

Attachments: 10015-0.94-v2.txt, 10015-0.94-v3.txt, 10015-0.94-v4.txt,
10015-0.94-withtest.txt, 10015-0.94.txt, 10015-trunk-v2.txt,
10015-trunk-v3.txt, 10015-trunk-v4.txt, 10015-trunk-v4.txt,
10015-trunk-v4.txt, 10015-trunk.txt, TestLoad.java

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

2013-11-23 Thread Andrew Purtell (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13830735#comment-13830735
]

Andrew Purtell commented on HBASE-10015:

bq. On order for the compaction to make progress we could assign scanners to
epochs, where everytime we change the readers for a store we go to a new epoch.
If all scanners for an epoch are either done or have switched to the new epoch,
we can retire the readers of that epoch.

I haven't thought about this nearly as much as you have recently but that
sounds promising.

Major performance improvement: Avoid synchronization in StoreScanner

Key: HBASE-10015
URL: https://issues.apache.org/jira/browse/HBASE-10015
Project: HBase
Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
Fix For: 0.98.0, 0.96.1, 0.94.15

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

2013-11-23 Thread stack (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13830739#comment-13830739
]

stack commented on HBASE-10015:
---

+1 on committing what is done already.

epoch sounds like refcounting? Yeah, no hurry deleting the old stuff as long
as it is done eventually. Accounting would be easier if we could move/rename
files under the scanner as long is it does not disrupt (maybe I can try this).

I like your idea of lockless scanning. Would be good to put it up as a goal
even if hard to attain, if only to orientate which way progress lies.

Major performance improvement: Avoid synchronization in StoreScanner

Key: HBASE-10015
URL: https://issues.apache.org/jira/browse/HBASE-10015
Project: HBase
Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
Fix For: 0.98.0, 0.96.1, 0.94.15

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13830786#comment-13830786
]

Lars Hofhansl commented on HBASE-10015:
---

Epoch would be like reference counting per distinct sweet of readers. With just
a reference count of scanners I'd be worried that compaction would never make
any progress.

Major performance improvement: Avoid synchronization in StoreScanner

Key: HBASE-10015
URL: https://issues.apache.org/jira/browse/HBASE-10015
Project: HBase
Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
Fix For: 0.98.0, 0.96.1, 0.94.15

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner

[
https://issues.apache.org/jira/browse/HBASE-10015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13830849#comment-13830849
]

Lars Hofhansl commented on HBASE-10015:
---

Actually this is not quite right in 0.94, because it does not have HBASE-6499.
(seek does not call checkReseek). I'll make that change as well, also
checkUpdatedReaders can be folded into checkReseek for better readability.

Major performance improvement: Avoid synchronization in StoreScanner

Key: HBASE-10015
URL: https://issues.apache.org/jira/browse/HBASE-10015
Project: HBase
Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
Fix For: 0.98.0, 0.96.1, 0.94.15

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-10015) Major performance improvement: Avoid synchronization in StoreScanner