[
https://issues.apache.org/jira/browse/HBASE-576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12637353#action_12637353
]
stack commented on HBASE-576:
-----------------------------
Thanks J-D. Patch looks good. Pity couldn't be fixed better but yeah, would
need migration script. As is will save a bunch of churn. Let me commit it.
Looking at rpc, I see I broke it a while back; I removed the very reason we
subclass RPC. I replaced all our carefully planted HbaseObjectWritables with
default ObjectWritables. Means we're sending Strings instead of codes for our
parameter names.
So, did a test where a cluster had 1M rows loaded into 11 regions spread over 3
machines. A single client could random-read at ~482/second. Using above patch
and running with 8 threads, was able to read at 1531/second. Basic formula:
throughput can be multiplied by # of threads up to maximum of number of cluster
members: e.g. if 8 threads but only 3 servers, can only see 3X throughput
improvement. If 8 servers hosting regions, should see 8X.
> Investigate IPC performance
> ---------------------------
>
> Key: HBASE-576
> URL: https://issues.apache.org/jira/browse/HBASE-576
> Project: Hadoop HBase
> Issue Type: Improvement
> Components: ipc
> Affects Versions: 0.1.0, 0.1.1, 0.1.2, 0.2.0
> Reporter: Jim Kellerman
> Assignee: stack
> Attachments: htd.patch, pe.patch
>
>
> Turning off all file I/O, and running the PerformanceEvaluation test, of
> 1,048,576 sequential writes to HBase managed to achieve only 7,285 IPCs per
> second.
> Running PerformanceEvaluation sequential write test modified to do an abort
> instead of a commit, it was possible to do 68,337 operations per second. We
> are obviously spending a lot of time doing IPCs.
> We need to investigate to find the bottleneck. Marshalling and unmarshalling?
> Socket setup and teardown?
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.