Re: File System Storage Plugin

2015-06-09 Thread Abhishek Girish
Hey Kartik, You could refer to http://drill.apache.org/docs/file-system-storage-plugin/ for details. Regards, Abhishek On Tuesday, June 9, 2015, Kartik Bhatia wrote: > Hi, > I have installed Apache Drill on my local windows workstation. > How do I install Storage Plugin for distributed file sy

Re: File System Storage Plugin

2015-06-09 Thread Kristine Hahn
Please see http://apache.github.io/drill/docs/file-system-storage-plugin/ and please let us know if you have any problems with these instructions. Kristine Hahn Sr. Technical Writer 415-497-8107 @krishahn On Tue, Jun 9, 2015 at 5:00 PM, Kartik Bhatia wrote: > Hi, > I have installed Apache Dril

Re: Drill vs MapR-DB/HBASE : fragments per region for table-scan

2015-06-09 Thread Steven Phillips
It's currently not supported. It would require an enhancement to the storage plugin. Feel free to file an enhancement jira, or f you want to experiment with it yourself, you can take a look at the HBaseGroupScan code, which is where the assignment logic is located. Currently, we don't split the wo

File System Storage Plugin

2015-06-09 Thread Kartik Bhatia
Hi, I have installed Apache Drill on my local windows workstation. How do I install Storage Plugin for distributed file system(hdfs) with Apache Drill to query files on hdfs system? Thanks Kartik ~~ This e-mail message from State Co

Re: Query on setting up Apache Drill and nested query for json file

2015-06-09 Thread Ted Dunning
On Tue, Jun 9, 2015 at 11:42 AM, Jason Altekruse wrote: > *We do not currently have a shortcut to read files in the directory where > you launched Drill.* > This has made me grumpy in the past, but I really think that Drill got it right here. The real problem is that with a large parallel progr

Re: Issue getting Drill up and running on Windows

2015-06-09 Thread Kristine Hahn
Additional note: you can ignore the configuration error message ( https://issues.apache.org/jira/browse/DRILL-3120), this is expected as shown below: C:\apache-drill-1.0.0\bin>sqlline.bat -u "jdbc:dri l:schema=dfs;zk=local" DRILL_ARGS - " -u jdbc:drill:schema=dfs;zk=local" HADOOP_HOME not detecte

Re: Issue getting Drill up and running on Windows

2015-06-09 Thread Andries Engelbrecht
The connection string is wrong. Use sqlline.bat -u "jdbc:drill:zk=local" On Jun 9, 2015, at 1:31 PM, Benjamin Etten wrote: > I just installed Drill last night on my Mac (after seeing a fantastic > presentation, I am SO blown away!). No problems there... however, I am > having issues getting it

Issue getting Drill up and running on Windows

2015-06-09 Thread Benjamin Etten
I just installed Drill last night on my Mac (after seeing a fantastic presentation, I am SO blown away!). No problems there... however, I am having issues getting it to run on Windows. I have checked the list of pre-reqs, including Environment Variables. When I execute the following, (see below

Re: Query on setting up Apache Drill and nested query for json file

2015-06-09 Thread Jason Altekruse
Hi Rob, Thanks for putting so much effort into getting Drill set up for your use case, we know that there are still some sharp edges in Drill and detailed information about use cases that are hard to set up help us to improve the docs and core project. As a quick answer, I think you might have ru

Drill vs MapR-DB/HBASE : fragments per region for table-scan

2015-06-09 Thread Andy Pernsteiner
I've been doing some experimentation with Drill (1.0) against MaprDB. I believe I'd see similar behavior vs HBASE, although I haven't tried yet. I have a 11GB table, which is split into 8 regions (not perfectly balanced, some have 2x the # of records as others). When I run a drill query which

Re: MapR Drill - mongodb collections does not show up

2015-06-09 Thread Kamesh
Jacques, We are also planning to work on writing BSON data types directly into value vectors. Let us know, if you have already started working on it, otherwise we would like to work on it. On Tue, Jun 9, 2015 at 9:23 PM, Jacques Nadeau wrote: > Yeah, I'm looking at the avoiding the BSON > JSON

Re: from_unixtime in drill explorer/ODBC

2015-06-09 Thread Jason Altekruse
This is pretty well implied with Christopher's message, but Drill ships with a Hive storage plugin which puts Hive jars on the default class path. Just as with Drill native UDFs we pick up the default Hive functions in these jars and register them. Another one that was causing some issues was the H

Re: MapR Drill - mongodb collections does not show up

2015-06-09 Thread Jacques Nadeau
Yeah, I'm looking at the avoiding the BSON > JSON steps. On Tue, Jun 9, 2015 at 12:17 AM, Adam Gilmore wrote: > ​We haven't quite hit it with that much data in Mongo. We're compressing > Mongo data to Parquet in smaller chunks so we never query Mongo with that > much. > > We've found it a bit t

Re: Query on setting up Apache Drill and nested query for json file

2015-06-09 Thread Andries Engelbrecht
Rob, What errors are in your drillbit.log and drillbit.out files? This will help to understand why you are seeing the issues with getting Drill running. To “spool” the query output you can user !record where it writes to the relative path from where sqlline was started I would recommend that

Re: Query on setting up Apache Drill and nested query for json file

2015-06-09 Thread Neeraja Rentachintala
Rob Before answering specific questions, I just wanted to check how did you setup Drill. Did you just download the tar from the Drill site and launching in embedded mode? I just did the same and am able to query the file you attached. I just put in my local file system under the directory and refer

Re: Drill authentication with pyodbc

2015-06-09 Thread Christopher Matta
My fault, I had a greedy regular expression that was replacing the space in the value “Basic Authentication” with nothing. It’s working now. Some people, when confronted with a problem, think “I know, I’ll use regular expressions.” Now they have two problems. Thanks guys. Chris mattacma...@mapr

Query on setting up Apache Drill and nested query for json file

2015-06-09 Thread Rob E
Hello, We are trying to 'flatten' out a multi level (nested) json file that is the result of a REST API call. We were going to try flatten out this to a CSV, load it into Oracle tables and then do traditional SQL queries. The initial approach was going to be done using a combination of CSVKIT (in2

Re: from_unixtime in drill explorer/ODBC

2015-06-09 Thread Christopher Matta
The function from_unixtime() is actually a Hive UDF and drill doesn’t yet know how to deal with the 2 extra bytes returned from it. It’s better to use the Drill function to_timestamp(): select to_timestamp(1432912733) from `sys`.`version` Chris mattacma...@mapr.com 215-701-3146 On Tue, Jun 9, 20

from_unixtime in drill explorer/ODBC

2015-06-09 Thread Uli Bethke
The following query run through Drill Explorer/ODBC returns NULL, while running it from SQLLINE it works fine. select from_unixtime(1432912733) from `sys`.`version`; Is this a limitation of the ODBC driver? Running on Drill 1.0.0 on Windows in Embedded mode. Using 1.0.0 ODBC driver

Re: MapR Drill - mongodb collections does not show up

2015-06-09 Thread Adam Gilmore
​We haven't quite hit it with that much data in Mongo. We're compressing Mongo data to Parquet in smaller chunks so we never query Mongo with that much. We've found it a bit too slow querying Mongo with Drill, mostly I think from the fact that it goes from BSON -> JSON and then the JSON is parsed