bootstrap-storage-plugins.json - Missing documentation?

2016-03-31 Thread Peder Jakobsen | gmail
Hi, the documentation mentions that you can load storage plugin configurations at startup. This is good, because every time I reboot, I currently have to manually cut and paste my configuration into the web interface. But, while the documentation references an example of bootstrap-storage-plugins

Re: Creating a single parquet or csv file using CTAS command?

2016-02-04 Thread Peder Jakobsen | gmail
rt from just a > > basic hosting solution? As an example you will find MapR with Drill > > available on demand on Amazon AWS and Azure. You may want to look at that > > to spin up a node(s), load/process/download your data, and then spin it > > down. Might be worth a look, p

Re: Creating a single parquet or csv file using CTAS command?

2016-02-04 Thread Peder Jakobsen | gmail
onfig for your use > case. > > --Andries > > > On Feb 4, 2016, at 10:12 AM, Peder Jakobsen | gmail > wrote: > > > > Sorry, bad typo: I have 50GB of data, NOT 500GB ;). And I usually only > > query a 1 GB subset of this data using Drill. > > > &g

Re: Creating a single parquet or csv file using CTAS command?

2016-02-04 Thread Peder Jakobsen | gmail
Sorry, bad typo: I have 50GB of data, NOT 500GB ;). And I usually only query a 1 GB subset of this data using Drill. On Thu, Feb 4, 2016 at 1:04 PM, Peder Jakobsen | gmail wrote: > On Thu, Feb 4, 2016 at 11:15 AM, Andries Engelbrecht < > aengelbre...@maprtech.com> wrote: >

Re: Creating a single parquet or csv file using CTAS command?

2016-02-04 Thread Peder Jakobsen | gmail
On Thu, Feb 4, 2016 at 11:15 AM, Andries Engelbrecht < aengelbre...@maprtech.com> wrote: > Is there a reason to create a single file? Typically you may want more > files to improve parallel operation on distributed systems like drill. > Good question. I'm not actually using Drill for "big data

Creating a single parquet or csv file using CTAS command?

2016-02-04 Thread Peder Jakobsen | gmail
Hi, is there a way to force drill to create a single file when performing a CTAS command (or some other method). Right now, I'm creating CSV files, and then have to perform and extra step to stitch 1_0_0.parquet 1_1_0.parquet 1_2_0.parquet etc. together into a single file. Thank you. Peder

Can Drill connect to an MS Access Database file using ODBC?

2016-01-19 Thread Peder Jakobsen | gmail
I have an .mdb (Access file) that ideally I'd like for Drill to see as just another data source. Is this possible? It seems like the MapR driver is for connecting to Drill via ODBC only, which doesn't cover my use case. Thanks :)

Re: Does drill recognize new line correctly?

2016-01-06 Thread Peder Jakobsen | gmail
Hi, if you are working in Unix, use *iconv* to remove newlines, and other things like BOMS, converting to UTF-8 etc. Perhaps Google "iconv remove newlines from csv"..? iconv is quick, you can process Gibabytes of nested csv files in minutes. If you are using Windows, I'm not sure. Peder On

Re: SQL Lookup table - how to mimic?

2016-01-06 Thread Peder Jakobsen | gmail
uot;,"AU","AUS","36",".au\r"] | > > +-+ > > 10 rows selected (0.905 seconds) > > 0: jdbc:drill:> select * from maprfs.cmatta.`country_data.csv` limit 10; > > +---

Re: SQL Lookup table - how to mimic?

2016-01-06 Thread Peder Jakobsen | gmail
; 10 rows selected (0.905 seconds) > 0: jdbc:drill:> select * from maprfs.cmatta.`country_data.csv` limit 10; > ++ > | columns | > +

SQL Lookup table - how to mimic?

2016-01-05 Thread Peder Jakobsen | gmail
Does drill have something analogous to SQL CONSTRAINT and REFERENCES for doing table type lookups? I have a large data that uses numerical ISO country codes; they need to be replaced with country names that are mapped in a small CSV file (iso_code, country_name). Any suggestions? Thanks you,

Re: ODBC in Virtual Box connects to Drill on host Unix system?

2016-01-04 Thread Peder Jakobsen | gmail
; > What BI tool are you running on Debian? > > Thanks, > Tomer > > On Thu, Dec 31, 2015 at 8:40 AM, Peder Jakobsen | gmail < > pjakob...@gmail.com > > wrote: > > > Hi Norris, > > > > Just discovered that the ODBC driver is only available for Cent

Re: MS Windows: Hadoop binaries required to run drill?

2016-01-03 Thread Peder Jakobsen | gmail
; indicate why Drill is failing to initialize these settings. > > Can you possibly try this under the Administrator account if one exists? > > -- > Jacques Nadeau > CTO and Co-Founder, Dremio > > On Thu, Dec 31, 2015 at 2:50 PM, Peder Jakobsen | gmail < > pjakob...@gmail.c

Documentation update: 64bit Windows is a basic requirement for running Drill on Windows?

2016-01-03 Thread Peder Jakobsen | gmail
It doesn't seem possible to run Drill on 32 bit versions of Windows, as implied in the instructions on installing the ODBC driver here. https://drill.apache.org/docs/installing-the-driver-on-windows/ I'd like to update the documentation to make this more explicit via a Github pull request. I jus

Re: MS Windows: Hadoop binaries required to run drill?

2016-01-02 Thread Peder Jakobsen | gmail
; indicate why Drill is failing to initialize these settings. > > Can you possibly try this under the Administrator account if one exists? > > -- > Jacques Nadeau > CTO and Co-Founder, Dremio > > On Thu, Dec 31, 2015 at 2:50 PM, Peder Jakobsen | gmail < > pjakob...@gmail.com >

Re: MS Windows: Hadoop binaries required to run drill?

2015-12-31 Thread Peder Jakobsen | gmail
t if one exists? > > -- > Jacques Nadeau > CTO and Co-Founder, Dremio > > On Thu, Dec 31, 2015 at 2:50 PM, Peder Jakobsen | gmail < > pjakob...@gmail.com > > wrote: > > > I spoke too soon, perhaps (but it must be a simple I/O issue on startup, > >

Re: MS Windows: Hadoop binaries required to run drill?

2015-12-31 Thread Peder Jakobsen | gmail
line 1, column 15 to line 1, column 16: Table 'cp.employee.json' not found Error: VALIDATION ERROR: From line 1, column 15 to line 1, column 16: Table 'cp.employee.json' not found [Error Id: 9d6af232-fc40-40ec-8a2a-8d082923b776 on Funky:31010] (state=,code=0) On Thu, Dec 31,

Re: MS Windows: Hadoop binaries required to run drill?

2015-12-31 Thread Peder Jakobsen | gmail
though I have not programmed in Java since 2003 ? ;) I can't imagine that I/O in java has changed all that much. Peder :) On Thu, Dec 31, 2015 at 5:30 PM, Peder Jakobsen | gmail wrote: > No. I've tried this many times. > > So at startup, it's supposed to create th

Re: MS Windows: Hadoop binaries required to run drill?

2015-12-31 Thread Peder Jakobsen | gmail
> On Thu, Dec 31, 2015 at 2:22 PM, Peder Jakobsen | gmail > wrote: > > OK, at startup, Drill creates an empty file called hive.sys.drill that's > > located in C:\tmp\drill\sys.storage_plugins > > > > Perhaps it's not surprising that we get "Unable

Re: MS Windows: Hadoop binaries required to run drill?

2015-12-31 Thread Peder Jakobsen | gmail
file appears to be empty. On Linux, lots of stuff is included in this drill path: profiles, sys.options & sys.storage_plugins Hope this helps P. On Thu, Dec 31, 2015 at 5:13 PM, Peder Jakobsen | gmail wrote: > I deleted everything in C:\Windows\Temp. Note, when I start drill > a

Re: MS Windows: Hadoop binaries required to run drill?

2015-12-31 Thread Peder Jakobsen | gmail
der, > > What's in your C:\Windows\Temp directory? Is there something that > looks like it's from Drill? If there is, try deleting it and running > again. > > --Nathan > > On Thu, Dec 31, 2015 at 1:53 PM, Peder Jakobsen | gmail > wrote: > > Hi Tomer and Jacq

Re: MS Windows: Hadoop binaries required to run drill?

2015-12-31 Thread Peder Jakobsen | gmail
xception: Unable to deserialize "/tmp/drill/sys.storage_plugins/hive.sys.drill" (state=,code=0) On Thu, Dec 31, 2015 at 4:33 PM, Ted Dunning wrote: > Can you say a bit more about your platform and environment? > > What about the errors you are seeing? > > > > On Thu,

Re: MS Windows: Hadoop binaries required to run drill?

2015-12-31 Thread Peder Jakobsen | gmail
> > > No. Drill does not depend on Hadoop > > > > Can you try these instructions and see if they work: > > > > http://www.dremio.com/blog/installing-apache-drill-on-microsoft-windows/ > > > > > > > On Dec 31, 2015, at 11:32 AM, Peder Jakobsen | gm

MS Windows: Hadoop binaries required to run drill?

2015-12-31 Thread Peder Jakobsen | gmail
Hi, does hadoop have to be installed in order to run apache drill on windows? So far, I'm 3 hours into "Run Apache Drill in 10 Minutes", and still no luck. Once I figure it out, I will update the documentation for windows if you like. Thanks, Peder J.

Re: ODBC in Virtual Box connects to Drill on host Unix system?

2015-12-31 Thread Peder Jakobsen | gmail
: > Hey Peder, > > Yep, I tried using Tableau on a Windows VM (VMWare) connecting to Drill > running on a Host Linux system and it worked perfectly fine. > > Norris > > -Original Message- > From: Peder Jakobsen | gmail [mailto:pjakob...@gmail.com] > Sent: Tues

Re: Drill 1.4 fails to start on Windows 7, JDK 1.7.0 - "Unable to deserialize"

2015-12-31 Thread Peder Jakobsen | gmail
than > > On Tue, Dec 29, 2015 at 2:04 PM, Peder Jakobsen | gmail > wrote: > > C:\devel\apache-drill-1.4.0\bin>sqlline.bat -u "jdbc:drill:zk=local" > > DRILL_ARGS - " -u jdbc:drill:zk=local" > > HADOOP_HOME not detected... > > HBASE_H

Fixed: Apache Drill won't start on windows 7

2015-12-29 Thread Peder Jakobsen | gmail
This appears to be a common issue that was addressed here: https://issues.apache.org/jira/browse/DRILL-1907 Adam Gilmore added a comment - 08/Jan/15 05:30 Fixed it - basically it's a problem that, by default, Drill tries

ODBC in Virtual Box connects to Drill on host Unix system?

2015-12-29 Thread Peder Jakobsen | gmail
Hi, I recently installed Windows 7 in VirtalBox in order to connect Tableau to Drill. Unfortunately Apache Drill 1.4 won't run on Windows at the moment, awaiting a response from the mailing list. But, I wonder if it's possible to simply connect via ODBC from VirtualBox Windows to Drill running o

Drill 1.4 fails to start on Windows 7, JDK 1.7.0 - "Unable to deserialize"

2015-12-29 Thread Peder Jakobsen | gmail
C:\devel\apache-drill-1.4.0\bin>sqlline.bat -u "jdbc:drill:zk=local" DRILL_ARGS - " -u jdbc:drill:zk=local" HADOOP_HOME not detected... HBASE_HOME not detected... Calculating Drill classpath... Error: Failure in starting embedded Drillbit: java.lang.RuntimeException: Unable to deserialize "/tmp/dr

Re: Python Driver Contribution Idea

2015-12-28 Thread Peder Jakobsen | gmail
I'm quite motivated to help. And yes, it would make for a very nice masters thesis: Good API design is a seriously useful thing to become an expert at :) Cheers, Peder Jakobsen On Mon, Dec 28, 2015 at 10:45 AM, Peder Jakobsen | gmail < pjakob...@gmail.com> wrote: > Two thum

Re: Python Driver Contribution Idea

2015-12-28 Thread Peder Jakobsen | gmail
Two thumbs up for this project. An immediate benefit is the ability to take advantage of the enhanced interactive features of the iPython shell. Perhaps the next step is to model the design after a similar Rest API wrapper, for example, python-twitter: https://github.com/bear/python-twitter On

Re: Learning to use the C++ client libraries

2015-12-28 Thread Peder Jakobsen | gmail
Drill. > > > > Here’s an iPython Notebook that demonstrates the RESTful interface and > > Drill. ( > > > https://github.com/cgivre/DS/blob/master/Querying%20Apache%20Drill%20via%20the%20RESTful%20Interface.ipynb > > ) > > > > — C > > > > > On De

Learning to use the C++ client libraries

2015-12-27 Thread Peder Jakobsen | gmail
Hi, Coming from the Python/Ruby /bash shell world, I find working SQLLine a hard pill to swallow; it's it's very "DOS" like, aka almost featureless. For this reason, I'd like to explore writing a python wrapper around the Drill C++ client libraries. What's the best way to learn to use this libra

Where is the Drill Explorer and / or docs on installing it?

2015-12-27 Thread Peder Jakobsen | gmail
Hi, The docs refer to the Drill Explorer, which for OS X is supposed to be located in the Applications folder: https://drill.apache.org/docs/connecting-drill-explorer-to-data/ But there is no mention in the documentation on how to install this tool (I think)? Are the docs referring to and old ve

Cloud Hosting

2015-12-08 Thread Peder Jakobsen | gmail
Hi, Are there any companies offering Drill cloud hosting? If not, is there a document that spells out best practices, or makes recommendations, eg. OpenShift vs. Amazon vs. DigitalOcean etc. Mapr seems to have some stuff, but are they the only game in town ? https://aws.amazon.com/marketplace/