Re: [DISCUSS] Design Documents

2015-10-19 Thread Aman Sinha
I feel like there are 2 broad issues: - there should be a targeted group of people for a design review of a particular feature or enhancement. The target group should be based on expertise in a component. After all, this is what one would do in a physical in-person design review - we would invit

[jira] [Created] (DRILL-3957) Need help to find available BI Tool (open source) to connect to Apache Drill Data

2015-10-19 Thread ganesh semalty (JIRA)
ganesh semalty created DRILL-3957: - Summary: Need help to find available BI Tool (open source) to connect to Apache Drill Data Key: DRILL-3957 URL: https://issues.apache.org/jira/browse/DRILL-3957 Pro

[jira] [Created] (DRILL-3956) TEXT MySQL type unsupported

2015-10-19 Thread Andrew (JIRA)
Andrew created DRILL-3956: - Summary: TEXT MySQL type unsupported Key: DRILL-3956 URL: https://issues.apache.org/jira/browse/DRILL-3956 Project: Apache Drill Issue Type: Bug Components: Stor

[GitHub] drill pull request: DRILL-3912: Common subexpression elimination

2015-10-19 Thread StevenMPhillips
Github user StevenMPhillips commented on the pull request: https://github.com/apache/drill/pull/189#issuecomment-149391542 Yeah, I think that makes sense. I will go ahead and do that. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHu

[GitHub] drill pull request: DRILL-3912: Common subexpression elimination

2015-10-19 Thread jinfengni
Github user jinfengni commented on the pull request: https://github.com/apache/drill/pull/189#issuecomment-149390932 If the only bug you saw is in Scalar Replacement after turning on CSE for other operators, would it make sense to open a different JIRA to track the SR bug, and fix the

[GitHub] drill pull request: DRILL-3912: Common subexpression elimination

2015-10-19 Thread StevenMPhillips
Github user StevenMPhillips commented on a diff in the pull request: https://github.com/apache/drill/pull/189#discussion_r42445825 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/expr/EvaluationVisitor.java --- @@ -106,19 +163,32 @@ public HoldingContainer visitFunct

[GitHub] drill pull request: DRILL-3912: Common subexpression elimination

2015-10-19 Thread StevenMPhillips
Github user StevenMPhillips commented on a diff in the pull request: https://github.com/apache/drill/pull/189#discussion_r42445808 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/expr/EqualityVisitor.java --- @@ -0,0 +1,322 @@ +/** + * Licensed to the Apache So

[GitHub] drill pull request: DRILL-3912: Common subexpression elimination

2015-10-19 Thread StevenMPhillips
Github user StevenMPhillips commented on the pull request: https://github.com/apache/drill/pull/189#issuecomment-149390304 I actually have done some additional work on this branch to extend it to work with other operators. But I ran into the Scalar Replacement bug that was mentioned i

Re: List type

2015-10-19 Thread Ted Dunning
Julien, Riffing off other recent threads, another important case is List of {int OR List of int} On Mon, Oct 19, 2015 at 1:28 PM, Julien Le Dem wrote: > I'm looking at the type system in Drill and I have the following question: > Why is there a LIST type and a REPEATED field? > It sounds li

[GitHub] drill pull request: DRILL-3912: Common subexpression elimination

2015-10-19 Thread jinfengni
Github user jinfengni commented on a diff in the pull request: https://github.com/apache/drill/pull/189#discussion_r42444043 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/expr/EvaluationVisitor.java --- @@ -106,19 +163,32 @@ public HoldingContainer visitFunctionCal

[GitHub] drill pull request: DRILL-3912: Common subexpression elimination

2015-10-19 Thread jinfengni
Github user jinfengni commented on a diff in the pull request: https://github.com/apache/drill/pull/189#discussion_r42443753 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/expr/EqualityVisitor.java --- @@ -0,0 +1,322 @@ +/** + * Licensed to the Apache Software

[GitHub] drill pull request: DRILL-3912: Common subexpression elimination

2015-10-19 Thread jinfengni
Github user jinfengni commented on the pull request: https://github.com/apache/drill/pull/189#issuecomment-149385017 Seems to me this pull request would remove the common subexpression elimination(CSE) for Project and Filter only; it would not do CSE for other operators, like Join, Ag

Re: select from table with options

2015-10-19 Thread Ted Dunning
Wouldn't a table function be a better option? Something like this perhaps? select * from delimitedFile(dfs.`default`.`/path/to/file/something.psv`, '|') ? Or how about fake-o parameters that the delimited record scanner knows how to push down into the scanning of the data? That would look like

[jira] [Created] (DRILL-3955) Possible bug in creation of Drill columns for HBase column families

2015-10-19 Thread Daniel Barclay (Drill) (JIRA)
Daniel Barclay (Drill) created DRILL-3955: - Summary: Possible bug in creation of Drill columns for HBase column families Key: DRILL-3955 URL: https://issues.apache.org/jira/browse/DRILL-3955 P

[jira] [Created] (DRILL-3954) HBase tests use only 1 region, don't detect bug(s) in dummy-column NullableIntVector creation/resolution

2015-10-19 Thread Daniel Barclay (Drill) (JIRA)
Daniel Barclay (Drill) created DRILL-3954: - Summary: HBase tests use only 1 region, don't detect bug(s) in dummy-column NullableIntVector creation/resolution Key: DRILL-3954 URL: https://issues.apache.org/

[GitHub] drill pull request: DRILL-3802: Throw unsupported error for Rollup...

2015-10-19 Thread amansinha100
Github user amansinha100 commented on the pull request: https://github.com/apache/drill/pull/209#issuecomment-149369397 Gave one minor comment; rest of the changes look good to me. +1 --- If your project is set up for it, you can reply to this email and have your reply appear on Git

[GitHub] drill pull request: DRILL-3802: Throw unsupported error for Rollup...

2015-10-19 Thread amansinha100
Github user amansinha100 commented on a diff in the pull request: https://github.com/apache/drill/pull/209#discussion_r42435960 --- Diff: exec/java-exec/src/main/java/org/apache/drill/exec/planner/sql/parser/UnsupportedOperatorsVisitor.java --- @@ -305,6 +313,105 @@ public SqlNode

[jira] [Created] (DRILL-3953) Apache Drill - Memory Issue when using against hbase db on Windows machine

2015-10-19 Thread Pete (JIRA)
Pete created DRILL-3953: --- Summary: Apache Drill - Memory Issue when using against hbase db on Windows machine Key: DRILL-3953 URL: https://issues.apache.org/jira/browse/DRILL-3953 Project: Apache Drill

Re: List type

2015-10-19 Thread Hanifi Gunes
Sounds great. I will look at both the union vector and type promotion stuff very soon. It would be nice if we could work on bringing up ListVector alive as well. I will file a JIRA for this. On Mon, Oct 19, 2015 at 2:43 PM, Steven Phillips wrote: > In the work I did for the Union types, (see PR

Re: List type

2015-10-19 Thread Steven Phillips
In the work I did for the Union types, (see PR https://github.com/apache/drill/pull/207), I actually went down that exact path. In that branch, if Union type is enable, any vectors created through the ComplexWriter interface will not create any Repeated type vectors. On Mon, Oct 19, 2015 at 2:29 P

Re: List type

2015-10-19 Thread Hanifi Gunes
If I am not wrong currently we use i) RepeatedInt for single ii) RepeatedList of RepeatedInt for double iii) RepeatedList of RepeatedList of RepeatedInt for triple arrays. I think we should refactor vector design in such way that we will only have a ListVector eliminating the need for all Repeated

select from table with options

2015-10-19 Thread Julien Le Dem
I'm looking into passing information on how to interpret a file through the select clause in Drill. Something along the lines of: *select * from dfs.`default`.`/path/to/file/something.psv?type=text&delimiter=|`;* (In this example, we want to specify a specific delimiter, but that would apply to any

List type

2015-10-19 Thread Julien Le Dem
I'm looking at the type system in Drill and I have the following question: Why is there a LIST type and a REPEATED field? It sounds like there should only one of those 2 concepts. Could someone describe how the following are represented? - one dimensional list of int - 2 dimensional list of ints -

Re: [DISCUSS] Design Documents

2015-10-19 Thread Parth Chandra
Agreed, that we need to have a better response from the dev community when a proposal is put forward but the absence of a response does not imply that we need not write any design down. The document has many consumers down the road (new contributors, documentation, and even end users who want to un

Re: Partial aggregation in Drill-on-Phoenix

2015-10-19 Thread Maryann Xue
Made another two check-ins to https://github.com/jacques-n/drill/pull/5, first one for the changes James had suggested. The second check-in included some test cases that failed to use Phoenix partial aggregate because of https://issues.apache.org/jira/browse/CALCITE-926. I also reproduced the prob

Re: [DISCUSS] Design Documents

2015-10-19 Thread Ted Dunning
On Mon, Oct 19, 2015 at 2:17 AM, wrote: > When working internally here, we write our design docs upfront using > markdown. > ... > I can't say how appropriate this is for an open-source effort. > Well, the whole Drill web-site is done in Mark-down. Design documents could easily be done as pull

[GitHub] drill pull request: DRILL-3802: Throw unsupported error for Rollup...

2015-10-19 Thread hsuanyi
Github user hsuanyi commented on the pull request: https://github.com/apache/drill/pull/209#issuecomment-149294066 +1(non-binding) LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this fe

[jira] [Created] (DRILL-3952) Improve Window Functions performance when not all batches are required to process the current batch

2015-10-19 Thread Deneche A. Hakim (JIRA)
Deneche A. Hakim created DRILL-3952: --- Summary: Improve Window Functions performance when not all batches are required to process the current batch Key: DRILL-3952 URL: https://issues.apache.org/jira/browse/DRILL

[GitHub] drill pull request: DRILL-3769: Allow to_date() to use castDate()'...

2015-10-19 Thread hsuanyi
Github user hsuanyi commented on a diff in the pull request: https://github.com/apache/drill/pull/205#discussion_r42401544 --- Diff: exec/java-exec/src/test/resources/testframework/testCastFunctions/testToDateForTimeStamp.tsv --- @@ -0,0 +1 @@ +1969-12-31 --- End diff --

[jira] [Created] (DRILL-3951) Lexical Errors in ODBC Queries

2015-10-19 Thread Charles Givre (JIRA)
Charles Givre created DRILL-3951: Summary: Lexical Errors in ODBC Queries Key: DRILL-3951 URL: https://issues.apache.org/jira/browse/DRILL-3951 Project: Apache Drill Issue Type: Bug

[jira] [Created] (DRILL-3950) CAST(...) * (Interval Constant) gives Internal Exception

2015-10-19 Thread Sean Hsuan-Yi Chu (JIRA)
Sean Hsuan-Yi Chu created DRILL-3950: Summary: CAST(...) * (Interval Constant) gives Internal Exception Key: DRILL-3950 URL: https://issues.apache.org/jira/browse/DRILL-3950 Project: Apache Drill

[GitHub] drill pull request: DRILL-3802: Throw unsupported error for Rollup...

2015-10-19 Thread jinfengni
GitHub user jinfengni opened a pull request: https://github.com/apache/drill/pull/209 DRILL-3802: Throw unsupported error for Rollup, Cube, Grouping_Sets, … …Grouping, Grouping_ID, Group_ID. You can merge this pull request into a Git repository by running: $ git pull https:

[GitHub] drill pull request: DRILL-3769: Allow to_date() to use castDate()'...

2015-10-19 Thread mehant
Github user mehant commented on a diff in the pull request: https://github.com/apache/drill/pull/205#discussion_r42399236 --- Diff: exec/java-exec/src/test/resources/testframework/testCastFunctions/testToDateForTimeStamp.tsv --- @@ -0,0 +1 @@ +1969-12-31 --- End diff --

[GitHub] drill pull request: DRILL-3769: Allow to_date() to use castDate()'...

2015-10-19 Thread mehant
Github user mehant commented on the pull request: https://github.com/apache/drill/pull/205#issuecomment-149287371 +1 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and w

[jira] [Resolved] (DRILL-3949) new storage plugin config is not saved on OS X

2015-10-19 Thread Jacques Nadeau (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-3949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacques Nadeau resolved DRILL-3949. --- Resolution: Not A Problem This is expected behavior. OS X (and most systems), clear /tmp/ on a

Re: [DISCUSS] Design Documents

2015-10-19 Thread Parth Chandra
On Sun, Oct 18, 2015 at 4:44 PM, Jacques Nadeau wrote: > Parth, > > Thanks for bringing this up. We definitely need to do a better job of > discussing development decisions. I think whether this is done as a set of > descriptions and comments on JIRA or a formal doc on Google is less > important

Who wants to write a simple rest script to post a Slack digest?

2015-10-19 Thread Jacques Nadeau
Hey Guys, I was looking at Slack and there doesn't seem to be a straightforward to a send an email to the list once a week that shows all the messages for the dev and user channels. Anybody want to write a little script that does this? The APIs are here: https://api.slack.com/events/message Note

[GitHub] drill pull request: DRILL-3947: Use setSafe() for date, time, time...

2015-10-19 Thread zfong
Github user zfong commented on the pull request: https://github.com/apache/drill/pull/208#issuecomment-149259793 Aman -- yup, saw that. Thanks. On Mon, Oct 19, 2015 at 8:57 AM, Aman Sinha wrote: > @zfong , please see my last explanation

[GitHub] drill pull request: DRILL-3947: Use setSafe() for date, time, time...

2015-10-19 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/drill/pull/208 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabl

[GitHub] drill pull request: DRILL-3947: Use setSafe() for date, time, time...

2015-10-19 Thread amansinha100
Github user amansinha100 commented on the pull request: https://github.com/apache/drill/pull/208#issuecomment-149257300 @zfong, please see my last explanation about why the repro does not occur at small scale. Hence, adding a unit test won't help ... unless the static constant that c

[jira] [Created] (DRILL-3949) new storage plugin is not saved on OS X

2015-10-19 Thread Kristine Hahn (JIRA)
Kristine Hahn created DRILL-3949: Summary: new storage plugin is not saved on OS X Key: DRILL-3949 URL: https://issues.apache.org/jira/browse/DRILL-3949 Project: Apache Drill Issue Type: Bug

Re: [MongoDB] - Why not returning the _id when using *

2015-10-19 Thread Tugdual Grall
> > > >> I do not understand why the '_id' is not returned when I do: > > >> select * from mongo.db.collection > > >> Any reason? > In most of the scenarios _id is system generated and not used for > analytics. That's why, we thought of ignoring this field as of "select *", > but user can explicit

Re: [MongoDB] - Why not returning the _id when using *

2015-10-19 Thread AnilKumar B
Hi Tugdual, >> I do not understand why the '_id' is not returned when I do: >> select * from mongo.db.collection >> Any reason? In most of the scenarios _id is system generated and not used for analytics. That's why, we thought of ignoring this field as of "select *", but user can explicitly me

RE: [DISCUSS] Design Documents

2015-10-19 Thread Michael.FULKE
Hi - this is always a challenge, when people just love writing code. When working internally here, we write our design docs upfront using markdown. Then we raise a pull request in Stash into the design docs folder. At this point the team comments on the PR, just as they would on a code change. The

Re: Apache Drill

2015-10-19 Thread Kasper Sørensen
@Jacques - I might be interested, but I am new to Drill - just saw the demo at Apache: Big Data EU and felt that what we do in MetaModel and what you do in Drill is hugely complementary. But I need someone to help me get started and thought this might be a good spot for it. @Ted - The mapping is s

[GitHub] drill pull request: DRILL-3947: Use setSafe() for date, time, time...

2015-10-19 Thread amansinha100
Github user amansinha100 commented on the pull request: https://github.com/apache/drill/pull/208#issuecomment-149123146 @mehant , yes the underlying issue is simple: the null bit vector for the NullableDateVector is allocated at 4096 bytes (actually, the UInt1Vector is used for this,