Drill logical plan optimization

2015-05-27 Thread Rajkumar Singh
Hi I am looking for some measures/params to looked upon to optimize the drill logical query plan if i want to resubmit it through the Drill UI, Could you please points me some docs so that I can go through it. Rajkumar Singh MapR Technologies

Re: what's the differenct between drill and optiq

2015-05-27 Thread Jacques Nadeau
Andrew, As others have pointed out there are definitely differences in how each different community project leverages Calcite (remember, Apache Kylin, Phoenix and I believe Flink also use it). Remember, Calcite--at its core--is a developers toolkit that other applications/systems incorporate. Whi

Re: what's the differenct between drill and optiq

2015-05-27 Thread Ted Dunning
Andrew, Sorry for being cryptic. Hanifi is more clear. My point was directed at the differences between where Hive may ultimately go and where Drill is now. Hanifi was providing a good summary of where Drill is now. As he said, Calcite does query parsing and planning. Ultimately, it will do t

Re: what's the differenct between drill and optiq

2015-05-27 Thread Jinfeng Ni
Besides the different optimization rule sets and cost model (Calcite is very extensive in that sense) used by Hive and Drill, I see the following difference of the way how Calcite is used in these two systems. 1. Drill supports query against schema-less storage (text, JSON etc). For Hive or other

RE: what's the differenct between drill and optiq

2015-05-27 Thread Andrew Brust
That makes sense. Just having trouble mapping that back on Ted's comment. But I tend to think that's me and my ignorance. -Original Message- From: Hanifi Gunes [mailto:hgu...@maprtech.com] Sent: Wednesday, May 27, 2015 4:48 PM To: user Subject: Re: what's the differenct between drill a

RE: what's the differenct between drill and optiq

2015-05-27 Thread Andrew Brust
I have to admit that I'm not following your point all that well. Is there something you could link me to so I could read up and better understand without using up your time? -Original Message- From: Ted Dunning [mailto:ted.dunn...@gmail.com] Sent: Wednesday, May 27, 2015 11:35 AM To: u

Re: what's the differenct between drill and optiq

2015-05-27 Thread Hanifi Gunes
Calcite does parsing & planning of queries. Drill executes in a very flexible distributed columnar fashion with late binding. On Wed, May 27, 2015 at 8:34 AM, Ted Dunning wrote: > Andrew, > > What Hive does not have is the extensions that Drill has that allow SQL to > be type flexible. The ALL

Monitoring long / stuck CTAS

2015-05-27 Thread Matt
Attempting to create a Parquet backed table with a CTAS from an 44GB tab delimited file in HDFS. The process seemed to be running, as CPU and IO was seen on all 4 nodes in this cluster, and .parquet files being created in the expected path. In however in the last two hours or so, all nodes sho

Re: DrillBits Communication

2015-05-27 Thread Kristine Hahn
> > With respect to user interacting with drillbit, the following documentation > will give you some idea. > https://cwiki.apache.org/confluence/display/DRILL/Drill+Interfaces Correction: The cwiki site is obsolete. Here's the latest Drill interfaces docs: http://drill.apache.org/docs/odbc-jdbc-

Re: DrillBits Communication

2015-05-27 Thread Neeraja Rentachintala
Sanjeev If you are new to Drill, I suggest looking at the high level architecture doc below to get started. http://drill.apache.org/architecture/ With respect to user interacting with drillbit, the following documentation will give you some idea. https://cwiki.apache.org/confluence/display/DRILL/Dr

RE: JAVA API for Drill

2015-05-27 Thread Norris Lee
Hi Nishith, Take a look at the DrillClient.java and .cpp/.hpp classes of the project for the Java and C++ libraries respectively. Norris -Original Message- From: Nishith Maheshwari [mailto:nsh...@gmail.com] Sent: Wednesday, May 27, 2015 1:45 AM To: user@drill.apache.org Subject: Re: J

Re: Query local files on cluster? [Beginner]

2015-05-27 Thread Andries Engelbrecht
OK, that is the simplest way to get going. And see how the solution works for you. It can be a little confusing between local FS and working with a cluster. I have found that dealing with large data volumes it worked much easier to use the NFS on the MapR cluster to directly move data to the DFS

Re: Query local files on cluster? [Beginner]

2015-05-27 Thread Matt
Drill can process a lot of data quickly, and for best performance and consistency you will likely find that the sooner you get the data to the DFS the better. Already most of the way there. Initial confusion came from the features to query the local / native filesystem, and how that does not f

DrillBits Communication

2015-05-27 Thread Sanjeev Verma
Hi Can somebody help me to get some documentation regarding how drillbit communicate with the other drillbits in the cluster and also how user do interact with the drillbit. Thanks Sanjeev

Re: what's the differenct between drill and optiq

2015-05-27 Thread Ted Dunning
Andrew, What Hive does not have is the extensions that Drill has that allow SQL to be type flexible. The ALL type and all of the implications both in terms of implementation and user impact it has are a really big deal. On Wed, May 27, 2015 at 6:08 AM, Andrew Brust < andrew.br...@bluebadgeinsi

Re: Query local files on cluster? [Beginner]

2015-05-27 Thread Andries Engelbrecht
You will be better off to use the Drill cluster as a whole vs trying to play with local vs DFS storage. A couple of ideas: As previously mentioned you can use the robust NFS on MapR to easily place the CSV/files on the DFS, and then use Drill with CTAS to convert the files to Parquet on the DFS

Re: DrillBits Communication

2015-05-27 Thread Rajkumar Singh
Bitcom allows one drillbit to communicate with the other, it basically decide client and server drillbits based on the who has initiated the connection.Please start looking at BitCom.java and BitComImpl.java to know more about it. Rajkumar Singh MapR Technologies > On May 27, 2015, at 7:08 P

Re: what's the differenct between drill and optiq

2015-05-27 Thread Andrew Brust
Thanks! Sent from my phone - Reply message - From: "PHANI KUMAR YADAVILLI" To: "user@drill.apache.org" Subject: what's the differenct between drill and optiq Date: Wed, May 27, 2015 8:33 AM Yes hive uses calcite. You can refer hive documentation. On May 27, 2015 6:01 PM, "Andrew Brust

RE: what's the differenct between drill and optiq

2015-05-27 Thread PHANI KUMAR YADAVILLI
Yes hive uses calcite. You can refer hive documentation. On May 27, 2015 6:01 PM, "Andrew Brust" wrote: > Folks at Hortonworks told me that Hive now uses Calcite as well. Can > anyone here confirm or deny that? > > -Original Message- > From: Rajkumar Singh [mailto:rsi...@maprtech.com] >

RE: what's the differenct between drill and optiq

2015-05-27 Thread Andrew Brust
Folks at Hortonworks told me that Hive now uses Calcite as well. Can anyone here confirm or deny that? -Original Message- From: Rajkumar Singh [mailto:rsi...@maprtech.com] Sent: Wednesday, May 27, 2015 6:52 AM To: user@drill.apache.org Subject: Re: what's the differenct between drill an

Re: what's the differenct between drill and optiq

2015-05-27 Thread Rajkumar Singh
Optiq(now known as calcite) is an api for query parser,planner and optimization, drill uses it for the SQL parsing,validation and optimization.Drill query planner applies its own custom planner rules to build the query logical plan. Rajkumar Singh > On May 27, 2015, at 12:04 PM, 陈礼剑 wrote:

what's the differenct between drill and optiq

2015-05-27 Thread 陈礼剑
Hi: I just want to know the difference between drill and optiq. Is drill just 'extend' optiq to support many other 'stores'(hadoop, mongodb, ...)? ---from davy Thanks.

Re: JAVA API for Drill

2015-05-27 Thread Nishith Maheshwari
Thank you Martin and Rajkumar for your prompt responses. I am actually looking if some API is available which provides this functionality. In the documentation it is mentioned in : https://drill.apache.org/docs/architecture-introduction - *You can connect to Apache Drill through the following int

Re: JAVA API for Drill

2015-05-27 Thread Rajkumar Singh
Do you try drill-jdbc driver? I will suggest you to use java jdbc connectivity to query drill using the drill-jdbc driver.I have not tried this to query HBASE using drill but it should work if you have correctly configured the HBase Storage plugin with the DRILL. Thanks Rajkumar Singh > On M