[ 
https://issues.apache.org/jira/browse/PHOENIX-3224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15450454#comment-15450454
 ] 

James Taylor commented on PHOENIX-3224:
---------------------------------------

Would be good to get more info on bigger joins. Using the USE_SORT_MERGE_JOIN 
hint, you should be able to do larger joins and perf would be equivalent to 
doing a sort on each of the tables being joined. Can we try some smaller tests 
first with this hint to make sure there's not something else going on? Please 
let us know if we're missing anything, [~maryannxue].

> Observations from large scale testing.
> --------------------------------------
>
>                 Key: PHOENIX-3224
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-3224
>             Project: Phoenix
>          Issue Type: Task
>            Reporter: Lars Hofhansl
>
> We have a >1000 node physical cluster at our disposal for a short time, 
> before it'll be handed off to its intended use.
> Loaded a bunch of data (TPCs LINEITEM table, among others) and ran a bunch of 
> queries. Most tables are between 100G and 500G (uncompressed) and between 
> 600m and 2bn rows.
> The good news is that many things just worked. We sorted > 400G is < 5s with 
> HBase and Phoenix. Scans work. Joins work (as long as one side is kept under 
> 1m rows or so).
> For the issues we observers I'll file sub jiras under this.
> I'm going to write a lob post about this and attach a link here.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to