[GitHub] drill pull request #887: DRILL-5688: Add repeated map support to column acce...

2017-07-26 Thread paul-rogers
GitHub user paul-rogers opened a pull request: https://github.com/apache/drill/pull/887 DRILL-5688: Add repeated map support to column accessors Restructures the existing "column accessor" code to adopt a JSON-like structure that works for all of Drill's data types. This PR focused

[jira] [Created] (DRILL-5691) multiple count distinct query planning error at physical phase

2017-07-26 Thread weijie.tong (JIRA)
weijie.tong created DRILL-5691: -- Summary: multiple count distinct query planning error at physical phase Key: DRILL-5691 URL: https://issues.apache.org/jira/browse/DRILL-5691 Project: Apache Drill

[jira] [Created] (DRILL-5690) RepeatedDecimal18Vector does not pass scale, precision to data vector

2017-07-26 Thread Paul Rogers (JIRA)
Paul Rogers created DRILL-5690: -- Summary: RepeatedDecimal18Vector does not pass scale, precision to data vector Key: DRILL-5690 URL: https://issues.apache.org/jira/browse/DRILL-5690 Project: Apache

Re: Drill query planning error

2017-07-26 Thread weijie tong
another tips ,the coun distinct query examples mentioned above are all transferred by my rule,that is from Aggregate,Aggregate,Project,Scan to Project,Scan.Jira will be logged soon later as I am on road. On Thu, 27 Jul 2017 at 8:17 AM Julian Hyde wrote: > I think the

[GitHub] drill pull request #886: Update 010-performance-tuning-introduction.md

2017-07-26 Thread paul-rogers
Github user paul-rogers commented on a diff in the pull request: https://github.com/apache/drill/pull/886#discussion_r129728610 --- Diff: _docs/performance-tuning/010-performance-tuning-introduction.md --- @@ -3,9 +3,9 @@ title: "Performance Tuning Introduction" date:

[GitHub] drill pull request #886: Update 010-performance-tuning-introduction.md

2017-07-26 Thread paul-rogers
Github user paul-rogers commented on a diff in the pull request: https://github.com/apache/drill/pull/886#discussion_r129728474 --- Diff: _docs/performance-tuning/010-performance-tuning-introduction.md --- @@ -3,9 +3,9 @@ title: "Performance Tuning Introduction" date:

Re: Drill query planning error

2017-07-26 Thread Julian Hyde
I think the “multiple distinct count” code path also gets triggered if there’s a mixture of distinct and non-distinct aggregates. > On Jul 26, 2017, at 5:16 PM, Aman Sinha wrote: > > It sounds like you have narrowed it down further and it could be an issue > with the

Re: Drill query planning error

2017-07-26 Thread Aman Sinha
It sounds like you have narrowed it down further and it could be an issue with the scalar check when 2 or more cartesian joins are present. Please file a JIRA with relevant details. On Wed, Jul 26, 2017 at 3:13 PM, weijie tong wrote: > Thanks for pointing out the

Re: [VOTE] Release Apache Drill 1.11.0 - rc0

2017-07-26 Thread Padma Penumarthy
+1 (non-binding) Tried in embedded mode on my mac. Ran some queries. Downloaded and built on CentOS VM. Installed the build on the 4 node cluster. Ran some queries on parquet files. Thanks, Padma > On Jul 26, 2017, at 2:54 PM, Kunal Khatua wrote: > > +1 (non-binding) >

Re: Drill query planning error

2017-07-26 Thread weijie tong
Thanks for pointing out the possible reasons @Aman @Julian . I am not sure that's the real problem . As I initially mentioned,sorry for not clearly described ,queries like "select count(*),sum(a),count(distinct b) from t where dt=xxx " will be right.but "select count(*),sum(a),count (distinct

RE: [VOTE] Release Apache Drill 1.11.0 - rc0

2017-07-26 Thread Kunal Khatua
+1 (non-binding) Downloaded and built in CentOS. Tried a handful of queries and it worked fine. Tried repro for DRILL-5420 (CPU churn) and no issues. Ran a few 100 query cancellations and limit queries too, but didn't see any issues. (guessing this resolves DRILL-5435 and DRILL-5609 as well).

Re: [VOTE] Release Apache Drill 1.11.0 - rc0

2017-07-26 Thread Jinfeng Ni
Downloaded src and built with unit test on CentOS 6.4. Deploy to a 2 nodes cluster. Run a couple of tpcds queries over scale factor 1 parquet data. Tried query cancel while running tpcds queries. Run yelp tutorial queries over json data. Verified maven artifacts with a simple Drill application.

[jira] [Resolved] (DRILL-5160) Memory leak in Parquet async reader when Snappy fails

2017-07-26 Thread Kunal Khatua (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kunal Khatua resolved DRILL-5160. - Resolution: Fixed Assignee: Kunal Khatua (was: Parth Chandra) Reviewer:

Re: Drill query planning error

2017-07-26 Thread Aman Sinha
Yes, the RelMdMaxRowCount statistic would be useful for this. Thanks for pointing that out. I'll see if we can leverage that. The grouping sets is not yet supported in Drill, but that would be a better solution since it avoids extra scans. -Aman On Wed, Jul 26, 2017 at 11:25 AM, Julian Hyde

[GitHub] drill pull request #886: Update 010-performance-tuning-introduction.md

2017-07-26 Thread saurabhmahapatra94
GitHub user saurabhmahapatra94 opened a pull request: https://github.com/apache/drill/pull/886 Update 010-performance-tuning-introduction.md This is my first edit to the documentation You can merge this pull request into a Git repository by running: $ git pull

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129659864 --- Diff: exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataCache.java --- @@ -424,32 +432,121 @@ public void

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129655324 --- Diff: exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataVersion.java --- @@ -0,0 +1,97 @@ +/* + * Licensed to

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129562100 --- Diff: exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataVersion.java --- @@ -0,0 +1,97 @@ +/* + * Licensed to

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129653217 --- Diff: exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataVersion.java --- @@ -0,0 +1,97 @@ +/* + * Licensed to

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129654257 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java --- @@ -0,0 +1,115 @@ +/* + * Licensed to the

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129653855 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java --- @@ -0,0 +1,115 @@ +/* + * Licensed to the

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129545551 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java --- @@ -0,0 +1,115 @@ +/* + * Licensed to the

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129654624 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java --- @@ -0,0 +1,115 @@ +/* + * Licensed to the

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129545316 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java --- @@ -132,25 +134,64 @@ public static ParquetTableMetadata_v3

Re: Drill query planning error

2017-07-26 Thread Julian Hyde
Aman, Thanks for moving dev@calcite to Bcc. This is properly a Drill question. A blanket restriction on cartesian joins is a blunt instrument. Sometimes cartesian joins are valid, safe, and the best plan for a query. This is a case in point. Users shouldn’t have to change config parameters to

[jira] [Resolved] (DRILL-4685) RpcException: Data not accepted downstream

2017-07-26 Thread Roman (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Roman resolved DRILL-4685. -- Resolution: Cannot Reproduce > RpcException: Data not accepted downstream >

Re: Drill query planning error

2017-07-26 Thread Aman Sinha
[Since this is Drill specific, I put dev@calcite on BCC]. If you have two aggregates: Count(distinct a), Count(distinct b), the Calcite logical plan consists of a cartesian join of 2 subqueries each of which first does a group-by on the distinct column followed by a count aggregate. By default,

Drill query planning error

2017-07-26 Thread weijie tong
HI all: I materialize the count distinct query result to a cache, then when user query the count distinct , a specific rule will translate the query to the cache. It turns out right when the query has only one count (distinct ) operator ,but when it has two count (distinct ) ,it causes error

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread arina-ielchiieva
Github user arina-ielchiieva commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129539105 --- Diff: exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataCache.java --- @@ -424,32 +432,121 @@ public void

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread arina-ielchiieva
Github user arina-ielchiieva commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129528313 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java --- @@ -0,0 +1,115 @@ +/* + * Licensed to

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread arina-ielchiieva
Github user arina-ielchiieva commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129530761 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java --- @@ -132,25 +134,64 @@ public static

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread arina-ielchiieva
Github user arina-ielchiieva commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129525648 --- Diff: exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataVersion.java --- @@ -0,0 +1,97 @@ +/* + *

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread arina-ielchiieva
Github user arina-ielchiieva commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129535500 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java --- @@ -0,0 +1,115 @@ +/* + * Licensed to

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread arina-ielchiieva
Github user arina-ielchiieva commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129529192 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java --- @@ -0,0 +1,115 @@ +/* + * Licensed to

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread arina-ielchiieva
Github user arina-ielchiieva commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129526081 --- Diff: exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataVersion.java --- @@ -0,0 +1,97 @@ +/* + *

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread arina-ielchiieva
Github user arina-ielchiieva commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129526936 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java --- @@ -0,0 +1,115 @@ +/* + * Licensed to

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129485543 --- Diff: exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataCache.java --- @@ -38,24 +40,31 @@ private static

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129486188 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java --- @@ -1851,9 +1922,73 @@ private static String relativize(String

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129409026 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java --- @@ -132,23 +135,57 @@ public static ParquetTableMetadata_v3

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129104084 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java --- @@ -132,23 +135,57 @@ public static ParquetTableMetadata_v3

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129103339 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/dfs/MetadataContext.java --- @@ -41,6 +41,10 @@ private PruneStatus

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129486097 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java --- @@ -1851,9 +1922,73 @@ private static String relativize(String

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129485798 --- Diff: exec/java-exec/src/test/java/org/apache/drill/BaseTestQuery.java --- @@ -639,14 +644,18 @@ protected static void

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129485867 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java --- @@ -1851,9 +1922,73 @@ private static String relativize(String

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129103929 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java --- @@ -132,23 +135,57 @@ public static ParquetTableMetadata_v3

[GitHub] drill pull request #877: DRILL-5660: Drill 1.10 queries fail due to Parquet ...

2017-07-26 Thread vdiravka
Github user vdiravka commented on a diff in the pull request: https://github.com/apache/drill/pull/877#discussion_r129485531 --- Diff: exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataCache.java --- @@ -38,24 +40,31 @@ private static

Re: [HANGOUT] Topics for 7/25/17

2017-07-26 Thread yuliya Feldman
Sorry for the late chime in.Just a note - regarding s3 - even after upgrade to hadoop 2.8.x you may need to separately update versions of aws, as one provided with the upgrade is not supporting all the newly added regions. Thanks,Yuliya From: Arina Yelchiyeva