[jira] [Commented] (PHOENIX-1751) Perform aggregations, sorting, etc, in the preScannerOpen instead of postScannerOpen

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-1751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429250#comment-15429250 ] James Taylor commented on PHOENIX-1751: --- Please review, [~samarthjain]. This shoul

[jira] [Commented] (PHOENIX-1751) Perform aggregations, sorting, etc, in the preScannerOpen instead of postScannerOpen

2016-08-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-1751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429242#comment-15429242 ] Hadoop QA commented on PHOENIX-1751: {color:red}-1 overall{color}. Here are the res

[jira] [Updated] (PHOENIX-1751) Perform aggregations, sorting, etc, in the preScannerOpen instead of postScannerOpen

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-1751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Taylor updated PHOENIX-1751: -- Attachment: PHOENIX-1751_v3.patch New patch that fixes the couple of test failures (which were

[jira] [Assigned] (PHOENIX-1751) Perform aggregations, sorting, etc, in the preScannerOpen instead of postScannerOpen

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-1751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Taylor reassigned PHOENIX-1751: - Assignee: James Taylor > Perform aggregations, sorting, etc, in the preScannerOpen inst

[jira] [Updated] (PHOENIX-1751) Perform aggregations, sorting, etc, in the preScannerOpen instead of postScannerOpen

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-1751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Taylor updated PHOENIX-1751: -- Summary: Perform aggregations, sorting, etc, in the preScannerOpen instead of postScannerOpen

[jira] [Comment Edited] (PHOENIX-3187) Support multi-byte characters for CHAR datatype

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-3187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429145#comment-15429145 ] James Taylor edited comment on PHOENIX-3187 at 8/20/16 1:34 AM: --

[jira] [Commented] (PHOENIX-3187) Support multi-byte characters for CHAR datatype

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-3187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429145#comment-15429145 ] James Taylor commented on PHOENIX-3187: --- Can you elaborate more on why you need mu

Re: Issues while Running Apache Phoenix against TPC-H data

2016-08-19 Thread Nick Dimiduk
It's TPC-DS, not -H, but this is what I was using way back when to run perf tests over Phoenix and the query server while I was developing on it. The first project generates, loads the data via mapreduce and the second tool wraps up use of jmeter to run queries in parallel. https://github.com/ndim

[jira] [Commented] (PHOENIX-930) duplicated columns cause query exception and drop table exception

2016-08-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429132#comment-15429132 ] Hadoop QA commented on PHOENIX-930: --- {color:red}-1 overall{color}. Here are the result

Re: Issues while Running Apache Phoenix against TPC-H data

2016-08-19 Thread Andrew Purtell
> Maybe there's such a test harness that already exists for TPC? TPC provides tooling but it's all proprietary. The generated data can be kept separately (Druid does it at least - http://druid.io/blog/2014/03/17/benchmarking-druid.html ​). I'd say there would be one time setup: generation of data

[jira] [Commented] (PHOENIX-930) duplicated columns cause query exception and drop table exception

2016-08-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429119#comment-15429119 ] Hadoop QA commented on PHOENIX-930: --- {color:red}-1 overall{color}. Here are the result

Re: Issues while Running Apache Phoenix against TPC-H data

2016-08-19 Thread James Taylor
On Fri, Aug 19, 2016 at 3:01 PM, Andrew Purtell wrote: > > I have a long interest in 'canned' loadings. Interesting ones are hard to > > come by. If Phoenix ran any or a subset of TPCs, I'd like to try it. > > Likewise > > > But I don't want to be the first to try it. I am not a Phoenix expert. >

[jira] [Updated] (PHOENIX-3148) Reduce size of PTable so that more tables can be cached in the metada cache.

2016-08-19 Thread Thomas D'Silva (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-3148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas D'Silva updated PHOENIX-3148: Attachment: PHOENIX-3148-v2.patch Attached v2 patch which has missing files. > Reduce siz

[jira] [Updated] (PHOENIX-3148) Reduce size of PTable so that more tables can be cached in the metada cache.

2016-08-19 Thread Thomas D'Silva (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-3148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas D'Silva updated PHOENIX-3148: Attachment: PHOENIX-3148.patch [~jamestaylor] Can you please review? I was not able to ru

[jira] [Updated] (PHOENIX-930) duplicated columns cause query exception and drop table exception

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Taylor updated PHOENIX-930: - Attachment: PHOENIX-930_v5.patch Reattaching to get test run > duplicated columns cause query ex

[jira] [Resolved] (PHOENIX-3185) Error: ERROR 514 (42892): A duplicate column name was detected in the object definition or ALTER TABLE statement. columnName=TEST_TABLE.C1 (state=42892,code=514)

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Taylor resolved PHOENIX-3185. --- Resolution: Duplicate Duplicate of PHOENIX-930. > Error: ERROR 514 (42892): A duplicate col

[jira] [Updated] (PHOENIX-930) duplicated columns cause query exception and drop table exception

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Taylor updated PHOENIX-930: - Attachment: PHOENIX-930_v4.patch Thanks for the patches, [~yhxx511] and [~kalyanhadoop]. Please f

[jira] [Commented] (PHOENIX-808) Create snapshot of system tables prior to upgrade and restore on any failure

2016-08-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428993#comment-15428993 ] Hadoop QA commented on PHOENIX-808: --- {color:red}-1 overall{color}. Here are the result

Re: Issues while Running Apache Phoenix against TPC-H data

2016-08-19 Thread larsh
Thanks John. How was the region server Java process' heap configured?(Mujtaba is out today, not sure he'll listen in before Monday) On our regular SKUs we configure the region servers with 31GB of heap (the machines have more RAM than that), but I am not sure about which test cluster we used fo

[jira] [Commented] (PHOENIX-808) Create snapshot of system tables prior to upgrade and restore on any failure

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428949#comment-15428949 ] James Taylor commented on PHOENIX-808: -- +1. Looks good, [~samarthjain]. > Create sn

Re: Issues while Running Apache Phoenix against TPC-H data

2016-08-19 Thread Andrew Purtell
> I have a long interest in 'canned' loadings. Interesting ones are hard to > come by. If Phoenix ran any or a subset of TPCs, I'd like to try it. Likewise > But I don't want to be the first to try it. I am not a Phoenix expert. Same here, I'd just email dev@phoenix with a report that TPC query

Re: Issues while Running Apache Phoenix against TPC-H data

2016-08-19 Thread Stack
On Fri, Aug 19, 2016 at 1:19 PM, James Taylor wrote: > On Fri, Aug 19, 2016 at 11:37 AM, Stack wrote: > > > On Thu, Aug 18, 2016 at 5:54 PM, James Taylor > > wrote: > > > > > The data loaded fine for us. > > > > > > Mind describing what you did to get it to work and with what versions and > > c

[jira] [Updated] (PHOENIX-808) Create snapshot of system tables prior to upgrade and restore on any failure

2016-08-19 Thread Samarth Jain (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Samarth Jain updated PHOENIX-808: - Attachment: PHOENIX-808_v2_master.patch Patch for master branch. > Create snapshot of system tab

Re: Issues while Running Apache Phoenix against TPC-H data

2016-08-19 Thread John Leach
Sorry for the delay on this end… Each Region Server has 24 Gigs of Ram, 12 cores plus 12 virtual cores. Would you please provide appropriate configurations for an analytic and data load benchmark? I am hearing HBase 1.2 and latest Phoenix release. If you are using open source hbase, woul

[jira] [Commented] (PHOENIX-808) Create snapshot of system tables prior to upgrade and restore on any failure

2016-08-19 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428876#comment-15428876 ] Hadoop QA commented on PHOENIX-808: --- {color:red}-1 overall{color}. Here are the result

[jira] [Updated] (PHOENIX-808) Create snapshot of system tables prior to upgrade and restore on any failure

2016-08-19 Thread Samarth Jain (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Samarth Jain updated PHOENIX-808: - Attachment: PHOENIX-808_v2_nowhitespacediff.patch PHOENIX-808_v2.patch Updated pa

Re: Issues while Running Apache Phoenix against TPC-H data

2016-08-19 Thread larsh
I think Stack it trying to help and was just asking whether Mujtaba did something special to load the data (and perhaps how it took for us and on how many nodes we did that).(If it loaded fine for us and there was nothing special we had to do, I agree that there's no way (or need) to troubleshoo

Re: Issues while Running Apache Phoenix against TPC-H data

2016-08-19 Thread James Taylor
On Fri, Aug 19, 2016 at 11:37 AM, Stack wrote: > On Thu, Aug 18, 2016 at 5:54 PM, James Taylor > wrote: > > > The data loaded fine for us. > > > Mind describing what you did to get it to work and with what versions and > configurations and with what TPC loading and how much of the workload was >

Re: Issues while Running Apache Phoenix against TPC-H data

2016-08-19 Thread Stack
On Thu, Aug 18, 2016 at 5:54 PM, James Taylor wrote: > The data loaded fine for us. Mind describing what you did to get it to work and with what versions and configurations and with what TPC loading and how much of the workload was supported? Was it a one-off project? > If TPC is not represe

[jira] [Commented] (PHOENIX-808) Create snapshot of system tables prior to upgrade and restore on any failure

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428577#comment-15428577 ] James Taylor commented on PHOENIX-808: -- I suppose for 4.8.1, we probably shouldn't r

[jira] [Commented] (PHOENIX-808) Create snapshot of system tables prior to upgrade and restore on any failure

2016-08-19 Thread Samarth Jain (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428575#comment-15428575 ] Samarth Jain commented on PHOENIX-808: -- Thanks for the review, [~jamestaylor]. Shoul

Re: [ANNOUNCE] Apache Phoenix 4.8.0 released

2016-08-19 Thread James Taylor
This is good feedback, Afshin. Thanks for letting us know. I've updated the download page to provide a link to the new fixes/features. Would be great if this link could be dynamic (i.e. always point to the release notes from the last released version). Anyone know how to do this? I've also updated

Re: [DISCUSS] About dynamic column in spark connector

2016-08-19 Thread Josh Mahonin
Hi Xiaopeng, You're not the only one who's interested in phoenix-spark and dynamic columns! I responded to a similar question on the phoenix-users group recently: http://search-hadoop.com/m/9UY0h2bNa0FLjndj1&subj=Re+Phoenix+spark+and+dynamic+columns I'm not aware of any ongoing work for this feat

Re: [DISCUSS] About dynamic column in spark connector

2016-08-19 Thread Josh Elser
Hi Xiaopeng, Thanks for the message. Using the dev list for this type of discussion is great. I am not familiar with the implementation (and the "gotchas" that might exist in this improvement), but I would encourage you to try to reach out on that JIRA issue you mentioned. Maybe Josh M or Ra

Re: [ANNOUNCE] Apache Phoenix 4.8.0 released

2016-08-19 Thread Josh Elser
(-cc other lists) Hi Afshin, The release notes you referenced are more meant to alert users about any issues in the new release that you may run into over previous releases. "Release notes provide details on issues and their fixes which may have an impact on prior Phoenix behavior" - Josh

[jira] [Commented] (PHOENIX-2161) Can't change timeout

2016-08-19 Thread James Taylor (JIRA)
[ https://issues.apache.org/jira/browse/PHOENIX-2161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15428320#comment-15428320 ] James Taylor commented on PHOENIX-2161: --- You'd set phoenix.query.timeoutMs to caus