Re: Calcite: Trait propagation using relset iteration versus remove extraneous trait creation

2016-03-14 Thread Aman Sinha
I think the goal should be to achieve the trait propagation without relying on the add followed by remove strategy. Consider a simple query with 2 table join followed by a group-by. If I want to use merge join and streaming aggregate, there will be 1 pair of {hash-distribute, sort} added on bot

Re: Working with Case-Sensitive Data-sources

2016-03-14 Thread Aditya
> > However, if the field x is sometimes 'x' and sometimes 'X', we're going to > different results between the first query and the second. That is why I > think we need to guarantee that even when optimization rules fails, we have > the same plan meaning. In essence, all plans should be valid. If y

Re: Working with Case-Sensitive Data-sources

2016-03-14 Thread Jacques Nadeau
I believe it also suffers from the same issues. -- Jacques Nadeau CTO and Co-Founder, Dremio On Mon, Mar 14, 2016 at 4:29 PM, Neeraja Rentachintala < nrentachint...@maprtech.com> wrote: > How is this handled for MongoDB storage plugin, which I believe a case > sensitive DB as well? > > On Mon, M

Re: Working with Case-Sensitive Data-sources

2016-03-14 Thread Neeraja Rentachintala
How is this handled for MongoDB storage plugin, which I believe a case sensitive DB as well? On Mon, Mar 14, 2016 at 4:27 PM, Jacques Nadeau wrote: > I don't think it is that simple since there are some types of things that > we can't pushdown that will cause inconsistent results. > > For exampl

Re: Working with Case-Sensitive Data-sources

2016-03-14 Thread Jacques Nadeau
I don't think it is that simple since there are some types of things that we can't pushdown that will cause inconsistent results. For example, assuming that all values of x are positive, the following two queries should return the same result select * from hbase where x = 5 select * from hbase wh

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Jinfeng Ni
+1 (binding) - Download src tgz and do a full maven build on CentOS - Run yelp tutorial queries. - Verify query profiles on Web-UI - Run couple of partition pruning related queries. All look good. Jinfeng On Mon, Mar 14, 2016 at 2:48 PM, Jacques Nadeau wrote: > +1 (binding) > > - Download src

Re: Working with Case-Sensitive Data-sources

2016-03-14 Thread Jinfeng Ni
Project pushdown should always happen. If you see project pushdown does not happen for your HBase query, then it's a bug. However, if you submit two physical plans, one with project pushdown, another one without project pushdown, but they return different results for HBase query. I'll not call thi

[jira] [Resolved] (DRILL-4050) Add zip archives to the list of artifacts in verify_release.sh

2016-03-14 Thread Aditya Kishore (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aditya Kishore resolved DRILL-4050. --- Resolution: Fixed Fix Version/s: 1.7.0 This has been merged into master. > Add zip arc

[GitHub] drill pull request: DRILL-4050: Add zip archives to the list of ar...

2016-03-14 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/drill/pull/249 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabl

Calcite: Trait propagation using relset iteration versus remove extraneous trait creation

2016-03-14 Thread Jacques Nadeau
Hey All, I've been thinking about the SubsetTransformer pattern [1] that we use in Drill to ensure trait propagation. It was discussed here in Calcite [2] Julian's felt that the correct solution (and the patch he ultimately applied) was to use a create and then remove behavior. Take a look at his

[GitHub] drill pull request: DRILL-4050: Add zip archives to the list of ar...

2016-03-14 Thread adityakishore
Github user adityakishore commented on the pull request: https://github.com/apache/drill/pull/249#issuecomment-196542189 This enhanced version of the script allows integrated download and verification of a Drill release. It can be used to verify both the main release artifacts and mav

Re: Working with Case-Sensitive Data-sources

2016-03-14 Thread Jacques Nadeau
Agree with Zelaine, plan changes/optimizations shouldn't change results. This is a bug. Drill is focused on being case-insensitive, case-preserving. Each storage plugin implements its own case sensitivity policy when working with columns/fields and should be documented. It isn't practical to make

[GitHub] drill pull request: DRILL-4504: Create an event loop for each of [...

2016-03-14 Thread jacques-n
Github user jacques-n commented on a diff in the pull request: https://github.com/apache/drill/pull/429#discussion_r56082440 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/client/DrillClient.java --- @@ -74,73 +74,148 @@ /** * Thin wrapper around a UserClie

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Jacques Nadeau
+1 (binding) - Download src tgz and build and test - Download binary tgz, test execution of a number of queries and verify profiles - Enable socket level logging and confirm new planning phase + time logging -- Jacques Nadeau CTO and Co-Founder, Dremio On Mon, Mar 14, 2016 at 1:45 PM, Chun Ch

Re: Working with Case-Sensitive Data-sources

2016-03-14 Thread Jinfeng Ni
Abhishek Great question. Here is what I understand regarding the case sensitive policy. Drill's case sensitivity policy (case insensitive and case preserving) applies to the execution engine in Drill; it does not enforce the case sensitivity policy to all the storage plugin. A storage plugin coul

[GitHub] drill pull request: DRILL-4504: Create an event loop for each of [...

2016-03-14 Thread sudheeshkatkam
Github user sudheeshkatkam commented on a diff in the pull request: https://github.com/apache/drill/pull/429#discussion_r56080621 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/client/DrillClient.java --- @@ -74,73 +74,148 @@ /** * Thin wrapper around a Use

Re: Working with Case-Sensitive Data-sources

2016-03-14 Thread Zelaine Fong
Abhishek, I guess you're arguing that Drill's current behavior of honoring the case sensitive nature of the underlying data source (in this case, HBase and MapR-DB) will be confusing for Drill users who are accustomed to Drill's case insensitive behavior. I can see arguments both ways. But the

[GitHub] drill pull request: DRILL-4479: Use varchar for default column whe...

2016-03-14 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/drill/pull/420 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabl

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Chun Chang
+1 (non-binding) -ran functional and advanced automation On Mon, Mar 14, 2016 at 1:09 PM, Sudheesh Katkam wrote: > +1 (non-binding) > > * downloaded and built from source tar-ball; ran unit tests successfully > on Ubuntu > * ran simple queries (including cancellations) in embedded mode on Mac;

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Sudheesh Katkam
+1 (non-binding) * downloaded and built from source tar-ball; ran unit tests successfully on Ubuntu * ran simple queries (including cancellations) in embedded mode on Mac; verified states in web UI * ran simple queries (including cancellations) on a 3 node cluster; verified states in web UI *

[GitHub] drill pull request: DRILL-4504: Create an event loop for each of [...

2016-03-14 Thread hnfgns
Github user hnfgns commented on a diff in the pull request: https://github.com/apache/drill/pull/429#discussion_r56060382 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/client/DrillClient.java --- @@ -74,73 +74,148 @@ /** * Thin wrapper around a UserClient

[GitHub] drill pull request: DRILL-4504: Create an event loop for each of [...

2016-03-14 Thread hnfgns
Github user hnfgns commented on a diff in the pull request: https://github.com/apache/drill/pull/429#discussion_r56058208 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/client/DrillClient.java --- @@ -74,73 +74,148 @@ /** * Thin wrapper around a UserClient

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Abhishek Girish
+1 (non-binding) - Tested Drill in distributed mode (built with MapR profile). - Ran functional tests from Drill-Test-Framework [1] - Tested Web UI (basic sanity) - Tested Sqlline Looks good. [1] https://github.com/mapr/drill-test-framework On Mon, Mar 14, 2016 at 11:23 AM, Venki Korukanti wr

[jira] [Created] (DRILL-4510) IllegalStateException: Failure while reading vector. Expected vector class of org.apache.drill.exec.vector.NullableIntVector but was holding vector class org.apache.dril

2016-03-14 Thread Chun Chang (JIRA)
Chun Chang created DRILL-4510: - Summary: IllegalStateException: Failure while reading vector. Expected vector class of org.apache.drill.exec.vector.NullableIntVector but was holding vector class org.apache.drill.exec.vector.NullableVarCharVector

[GitHub] drill pull request: DRILL-4479: Use varchar for default column whe...

2016-03-14 Thread hnfgns
Github user hnfgns commented on the pull request: https://github.com/apache/drill/pull/420#issuecomment-196464641 +1 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and w

[jira] [Resolved] (DRILL-4490) Count(*) function returns as optional instead of required

2016-03-14 Thread Jinfeng Ni (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jinfeng Ni resolved DRILL-4490. --- Resolution: Fixed Fix Version/s: 1.6.0 Fixed in commit: 46e3de790da8f9c6d2d18e7e40fd37c01b3b168

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Venki Korukanti
+1 Installed tar.gz on a 3 node cluster. Ran queries on data located in HDFS Enabled auth in WebUI, ran few queries and, verified auth and querying works fine Logged bugs for 2 minor issues/improvements (DRILL-4508 & DRILL-4509

[jira] [Created] (DRILL-4509) Ignore unknown storage plugin configs while starting Drillbit

2016-03-14 Thread Venki Korukanti (JIRA)
Venki Korukanti created DRILL-4509: -- Summary: Ignore unknown storage plugin configs while starting Drillbit Key: DRILL-4509 URL: https://issues.apache.org/jira/browse/DRILL-4509 Project: Apache Drill

RE: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Norris Lee
+1 (Non-binding) Build from source on CentOS. Tested the ODBC driver with queries against hive and DFS (json, parquet, tsv, csv, directories). Norris -Original Message- From: Hsuan Yi Chu [mailto:hyi...@maprtech.com] Sent: Monday, March 14, 2016 10:42 AM To: dev@drill.apache.org; adity

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Hsuan Yi Chu
+1 mvn clean install on linux vm; Tried some queries; Looks good. On Mon, Mar 14, 2016 at 9:58 AM, Aditya wrote: > While I did verify the signature and structure of the maven artifacts, I > think Jacques was referring to verify the functionality, which I have not. > > On Mon, Mar 14, 2016 at 8:1

[jira] [Created] (DRILL-4508) Null proof all AutoCloseable.close() methods

2016-03-14 Thread Venki Korukanti (JIRA)
Venki Korukanti created DRILL-4508: -- Summary: Null proof all AutoCloseable.close() methods Key: DRILL-4508 URL: https://issues.apache.org/jira/browse/DRILL-4508 Project: Apache Drill Issue T

[jira] [Resolved] (DRILL-4474) Inconsistent behavior while using COUNT in select (Apache drill 1.2.0)

2016-03-14 Thread Jinfeng Ni (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jinfeng Ni resolved DRILL-4474. --- Resolution: Fixed Fix Version/s: 1.6.0 Fixed in commit: 49ae6d363efe78df4e89f7913d1d560e9627b32

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Aditya
While I did verify the signature and structure of the maven artifacts, I think Jacques was referring to verify the functionality, which I have not. On Mon, Mar 14, 2016 at 8:12 AM, Parth Chandra wrote: > Aditya has verified the maven artifacts. Would it make sense to extend the > vote by another

Re: [VOTE] Extended - Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Parth Chandra
Hello everyone, I'm extending the deadline to verify the release candidate. While there are enough votes to pass rc0, I'm inclined to allow additional time to let the broader community get a chance. Voting will now end at *8:10 am PDT, March 15th, 2016*. Thanks Parth On Mon, Mar 14, 2016 at

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Parth Chandra
Aditya has verified the maven artifacts. Would it make sense to extend the vote by another day to let more people verify the release? On Mon, Mar 14, 2016 at 7:08 AM, Jacques Nadeau wrote: > I haven't had a chance to validate yet. Has anyone checked the maven > artifacts yet? > On Mar 14, 201

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Jacques Nadeau
I haven't had a chance to validate yet. Has anyone checked the maven artifacts yet? On Mar 14, 2016 6:37 AM, "Aditya" wrote: > +1 (binding). > > * Verified checksum and signature of all release artifacts in[1] and maven > artifacts in [2] and the artifacts are signed using Parth's public key (ID

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Aditya
+1 (binding). * Verified checksum and signature of all release artifacts in[1] and maven artifacts in [2] and the artifacts are signed using Parth's public key (ID 9BAA73B0). * Verified that build and tests pass using the source artifact. * Verified that Drill can be launched in embedded mode usin

[jira] [Created] (DRILL-4507) TO_TIMESTAMP does not generate TIMESTAMP data type in metadata

2016-03-14 Thread Ian Hellstrom (JIRA)
Ian Hellstrom created DRILL-4507: Summary: TO_TIMESTAMP does not generate TIMESTAMP data type in metadata Key: DRILL-4507 URL: https://issues.apache.org/jira/browse/DRILL-4507 Project: Apache Drill

Working with Case-Sensitive Data-sources

2016-03-14 Thread Abhishek Girish
Hello all, As I understand, Drill by design is case-insensitive, w.r.t column names within a table or file [1]. While this provides great flexibility and works well with many data-sources, there are issues when working with case-sensitive data-sources such as HBase / MapR-DB. Consider the followi

Re: [VOTE] Release Apache Drill 1.6.0 - rc0

2016-03-14 Thread Abdel Hakim Deneche
+1 built from source with mapr profile and deployed on 2 nodes, then run window functions from Drill's test framework. Also took a quick look at the WebUI. Everything looks fine On Sun, Mar 13, 2016 at 5:53 PM, Parth Chandra wrote: > Added GPG key > > On Sat, Mar 12, 2016 at 6:48 PM, Aditya wr