GitHub user paul-rogers opened a pull request:
https://github.com/apache/drill/pull/887
DRILL-5688: Add repeated map support to column accessors
Restructures the existing "column accessor" code to adopt a JSON-like
structure that works for all of Drill's data types. This PR focused
here is the jira issue link:
https://issues.apache.org/jira/browse/DRILL-5691
On Thu, Jul 27, 2017 at 8:32 AM, weijie tong
wrote:
> another tips ,the coun distinct query examples mentioned above are all
> transferred by my rule,that is from Aggregate,Aggregate,Project,Scan to
> Project,Scan.
weijie.tong created DRILL-5691:
--
Summary: multiple count distinct query planning error at physical
phase
Key: DRILL-5691
URL: https://issues.apache.org/jira/browse/DRILL-5691
Project: Apache Drill
Paul Rogers created DRILL-5690:
--
Summary: RepeatedDecimal18Vector does not pass scale, precision to
data vector
Key: DRILL-5690
URL: https://issues.apache.org/jira/browse/DRILL-5690
Project: Apache Drill
another tips ,the coun distinct query examples mentioned above are all
transferred by my rule,that is from Aggregate,Aggregate,Project,Scan to
Project,Scan.Jira will be logged soon later as I am on road.
On Thu, 27 Jul 2017 at 8:17 AM Julian Hyde wrote:
> I think the “multiple distinct count
Github user paul-rogers commented on a diff in the pull request:
https://github.com/apache/drill/pull/886#discussion_r129728610
--- Diff: _docs/performance-tuning/010-performance-tuning-introduction.md
---
@@ -3,9 +3,9 @@ title: "Performance Tuning Introduction"
date:
p
Github user paul-rogers commented on a diff in the pull request:
https://github.com/apache/drill/pull/886#discussion_r129728474
--- Diff: _docs/performance-tuning/010-performance-tuning-introduction.md
---
@@ -3,9 +3,9 @@ title: "Performance Tuning Introduction"
date:
p
I think the “multiple distinct count” code path also gets triggered if there’s
a mixture of distinct and non-distinct aggregates.
> On Jul 26, 2017, at 5:16 PM, Aman Sinha wrote:
>
> It sounds like you have narrowed it down further and it could be an issue
> with the scalar check when 2 or more
It sounds like you have narrowed it down further and it could be an issue
with the scalar check when 2 or more cartesian joins are present.
Please file a JIRA with relevant details.
On Wed, Jul 26, 2017 at 3:13 PM, weijie tong
wrote:
> Thanks for pointing out the possible reasons @Aman @Julian .
+1 (non-binding)
Tried in embedded mode on my mac. Ran some queries.
Downloaded and built on CentOS VM.
Installed the build on the 4 node cluster. Ran some queries on parquet files.
Thanks,
Padma
> On Jul 26, 2017, at 2:54 PM, Kunal Khatua wrote:
>
> +1 (non-binding)
>
> Downloaded and b
Thanks for pointing out the possible reasons @Aman @Julian . I am not sure
that's the real problem .
As I initially mentioned,sorry for not clearly described ,queries like
"select count(*),sum(a),count(distinct b) from t where dt=xxx " will be
right.but "select count(*),sum(a),count (distinct b),c
+1 (non-binding)
Downloaded and built in CentOS. Tried a handful of queries and it worked fine.
Tried repro for DRILL-5420 (CPU churn) and no issues. Ran a few 100 query
cancellations and limit queries too, but didn't see any issues. (guessing this
resolves DRILL-5435 and DRILL-5609 as well).
Downloaded src and built with unit test on CentOS 6.4.
Deploy to a 2 nodes cluster.
Run a couple of tpcds queries over scale factor 1 parquet data.
Tried query cancel while running tpcds queries.
Run yelp tutorial queries over json data.
Verified maven artifacts with a simple Drill application.
Al
[
https://issues.apache.org/jira/browse/DRILL-5160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kunal Khatua resolved DRILL-5160.
-
Resolution: Fixed
Assignee: Kunal Khatua (was: Parth Chandra)
Reviewer: Parth
Yes, the RelMdMaxRowCount statistic would be useful for this. Thanks for
pointing that out. I'll see if we can leverage that. The grouping sets
is not yet supported in Drill, but that would be a better solution since it
avoids extra scans.
-Aman
On Wed, Jul 26, 2017 at 11:25 AM, Julian Hyde
GitHub user saurabhmahapatra94 opened a pull request:
https://github.com/apache/drill/pull/886
Update 010-performance-tuning-introduction.md
This is my first edit to the documentation
You can merge this pull request into a Git repository by running:
$ git pull https://github.co
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129659864
--- Diff:
exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataCache.java
---
@@ -424,32 +432,121 @@ public void testMoveCach
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129655324
--- Diff:
exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataVersion.java
---
@@ -0,0 +1,97 @@
+/*
+ * Licensed to
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129562100
--- Diff:
exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataVersion.java
---
@@ -0,0 +1,97 @@
+/*
+ * Licensed to
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129653217
--- Diff:
exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataVersion.java
---
@@ -0,0 +1,97 @@
+/*
+ * Licensed to
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129654257
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java
---
@@ -0,0 +1,115 @@
+/*
+ * Licensed to the Apach
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129653855
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java
---
@@ -0,0 +1,115 @@
+/*
+ * Licensed to the Apach
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129545551
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java
---
@@ -0,0 +1,115 @@
+/*
+ * Licensed to the Apach
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129654624
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java
---
@@ -0,0 +1,115 @@
+/*
+ * Licensed to the Apach
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129545316
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java
---
@@ -132,25 +134,64 @@ public static ParquetTableMetadata_v3
ge
Aman,
Thanks for moving dev@calcite to Bcc. This is properly a Drill question.
A blanket restriction on cartesian joins is a blunt instrument. Sometimes
cartesian joins are valid, safe, and the best plan for a query. This is a case
in point. Users shouldn’t have to change config parameters to g
[
https://issues.apache.org/jira/browse/DRILL-4685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Roman resolved DRILL-4685.
--
Resolution: Cannot Reproduce
> RpcException: Data not accepted downstream
>
[Since this is Drill specific, I put dev@calcite on BCC].
If you have two aggregates: Count(distinct a), Count(distinct b), the
Calcite logical plan consists of a cartesian join of 2 subqueries each of
which first does a group-by on the distinct column followed by a count
aggregate. By default,
HI all:
I materialize the count distinct query result to a cache, then when user
query the count distinct , a specific rule will translate the query to the
cache. It turns out right when the query has only one count (distinct )
operator ,but when it has two count (distinct ) ,it causes error .Th
Github user arina-ielchiieva commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129539105
--- Diff:
exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataCache.java
---
@@ -424,32 +432,121 @@ public void test
Github user arina-ielchiieva commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129528313
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java
---
@@ -0,0 +1,115 @@
+/*
+ * Licensed to t
Github user arina-ielchiieva commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129530761
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java
---
@@ -132,25 +134,64 @@ public static ParquetTableMetadat
Github user arina-ielchiieva commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129525648
--- Diff:
exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataVersion.java
---
@@ -0,0 +1,97 @@
+/*
+ * Lice
Github user arina-ielchiieva commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129535500
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java
---
@@ -0,0 +1,115 @@
+/*
+ * Licensed to t
Github user arina-ielchiieva commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129529192
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java
---
@@ -0,0 +1,115 @@
+/*
+ * Licensed to t
Github user arina-ielchiieva commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129526081
--- Diff:
exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataVersion.java
---
@@ -0,0 +1,97 @@
+/*
+ * Lice
Github user arina-ielchiieva commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129525442
--- Diff:
exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataVersion.java
---
@@ -0,0 +1,97 @@
+/*
+ * Lice
Github user arina-ielchiieva commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129526936
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/MetadataVersions.java
---
@@ -0,0 +1,115 @@
+/*
+ * Licensed to t
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129485543
--- Diff:
exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataCache.java
---
@@ -38,24 +40,31 @@
private static final
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129486188
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java
---
@@ -1851,9 +1922,73 @@ private static String relativize(String
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129409026
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java
---
@@ -132,23 +135,57 @@ public static ParquetTableMetadata_v3
ge
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129104084
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java
---
@@ -132,23 +135,57 @@ public static ParquetTableMetadata_v3
ge
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129103339
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/dfs/MetadataContext.java
---
@@ -41,6 +41,10 @@
private PruneStatus pruneSta
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129486097
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java
---
@@ -1851,9 +1922,73 @@ private static String relativize(String
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129485798
--- Diff: exec/java-exec/src/test/java/org/apache/drill/BaseTestQuery.java
---
@@ -639,14 +644,18 @@ protected static void
copyDirectoryIntoTempSpace(String
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129485867
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java
---
@@ -1851,9 +1922,73 @@ private static String relativize(String
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129103929
--- Diff:
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/Metadata.java
---
@@ -132,23 +135,57 @@ public static ParquetTableMetadata_v3
ge
Github user vdiravka commented on a diff in the pull request:
https://github.com/apache/drill/pull/877#discussion_r129485531
--- Diff:
exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataCache.java
---
@@ -38,24 +40,31 @@
private static final
48 matches
Mail list logo