Re: Running Drill with Tachyon+S3

2015-07-26 Thread David Tucker
One quick hint : The file $DRILL_HOME/bin/hadoop-excludes.txt has a list of jars that ARE NOT loaded during the bootstrap of Drill … and jets3t is one of them. Commenting out the jets3t line in that file and restarting the drill bits will at least get you past the first java dependency problem

Re: Best Performance of drill

2015-07-25 Thread David Tucker
One more thing to remember ... S3 is an object store, not a file system in the traditional sense. That means that when a drill-bit accesses a file from S3, the whole thing is transferred ... whether it's 100 bytes or 100 megabytes. The advantages of the parquet format are far more obvious in

Re: Transitioning to shared profile location

2015-07-14 Thread David Tucker
:05 PM, Hao Zhu wrote: > > We just need to copy that profile in corresponding mfs location, then you > can view the profile from the UI: > http://:8047/profiles/ > > It is working fine for me. > > Thanks, > Hao > > > On Tue, Jul 14, 2015 at 3:15 PM,

Transitioning to shared profile location

2015-07-14 Thread David Tucker
I decided to transition to shared profile locations for my drill cluster. I updated the drill-override.conf with a blobroot setting sys.store.provider.zk.blobroot: "maprfs:///tmp/drill/profiles” and made sure the directory existed with permissions for everyone. I restarted the drill-bi

Re: Using Azure WASB

2015-06-25 Thread David Tucker
You'll need to make sure of two things : WASB jars are included by default with HDFS 2.6 and later. If you're using an earlier version (or simply a stand-alone installation of Drill), you'll need to grab the jar files and put them in the class path. Your Azure credentials must be in core-site.x

Re: AWS S3 region

2015-06-13 Thread David Tucker
Check the Amazon sites, but I believe that the EU-Central region has some constraints about data migration. It may well be that S3 buckets (even those defined as “public”) will not expose their contents to instances OUTSIDE the EU-Central region. — David On Jun 13, 2015, at 12:53 AM, Subrat

Re: Controlling the number of drill bits used in a query

2015-06-05 Thread David Tucker
The downside of that isolation is that the storage plugin configuration of the primary cluster is lost. If you connect DIRECTLY to the drill-bit rather than via zookeeper, then that drill-bit will be the foreman of your queries. For small data sets, the foreman will not involve any other drill

Re: Connection timeout 1.0.0

2015-05-26 Thread David Tucker
For many Linux services, this can be an unstable configuration. Better to use ifconfig eth0 to identify the configured IP address and add that entry to /etc/hosts. Some DHCP client packages will do this automatically, since the IP can change with every reboot. — David On May 26, 201

Re: Drill connect to S3 with AWS EMR role

2015-05-11 Thread David Tucker
The current version of Hadoop in EMR (both Apache and MapR) does not support the IAM authentication to S3 without the credentials in core-site. I believe the support has been integrated into Hadoop 2.6 … so when the EMR distributions upgrade to that level, the access you request should be supp

Re: SQL Comments

2015-04-28 Thread David Tucker
And that works … thank you Kristine ! On Apr 28, 2015, at 11:30 AM, Kristine Hahn wrote: > The new docs http://tshiran.github.io/drill/docs/lexical-structure/ will > say: > First paragraph, last bullet: /* This is a comment. */ > > Kristine Hahn > Sr. Technical Writer > 415-497-8107 @krishahn

Re: SQL Comments

2015-04-28 Thread David Tucker
Glad I’m not the only one fighting with this :) — David On Apr 28, 2015, at 11:28 AM, Ramana Inukonda wrote: > Sorry, > > Scratch that. Somehow had a notion that worked. Does not seem to be the > case. > > Regards > Ramana > > > On Tue, Apr 28, 2015 at 11:25 AM, Ramana Inukonda > wrote: >

Re: Drill to query Client-side encrypted data from S3

2015-04-07 Thread David Tucker
Ganesh, When you say the keys are “custom controlled”, does that mean that only special logic within your Java application allows the data to be properly accessed ? There are several mechanisms within the S3 API such that encryption/decryption occur transparently to the application. If your

Re: Hive SerDe Error

2015-03-31 Thread David Tucker
I believe you’ll need to put the custom jar in DRILL’s classpath (it does not include $HIVE_HOME/lib be default since there’s no guarantee it will be on all cluster nodes). I’ve been successful putting the extra libraries I need for object-store access into $DRILL_HOME/jars/3rdparty. — Davi

Re: Query performance and clustering

2015-03-25 Thread David Tucker
I’ll second Adnries’ comment about measurable performance in AWS : you should not expect consistency there (especially with instance types that are smaller than a physical server, such as the c3.xlarge instances you’re using). How does the memory utilization look during your queries ? Memory p

Re: Strange error when trying to use REST interface to configure a storage plug-in

2015-01-14 Thread David Tucker
ntType? It could be putting in the Content-Type header in > additiin to the one you specified. > On Jan 14, 2015 12:00 AM, "David Tucker" wrote: > >> Has anyone seen the error >>Error 415 Unsupported Media Type >> when trying to use the REST interfa

Strange error when trying to use REST interface to configure a storage plug-in

2015-01-13 Thread David Tucker
Has anyone seen the error Error 415 Unsupported Media Type when trying to use the REST interface to create a new plug-in ? — David DETAILS The json file describing the plug-in is : { "name" : "mdb", "config" : { "type" : "hbase", "config" : { "hbase.table.namespace.ma

Re: Best practices for DRILL_CLASSPATH setting

2014-12-01 Thread David Tucker
te: > >> Yes, drill-env.sh would be the place to put this, regardless of how Drill >> is deployed. >> >> On Mon, Dec 1, 2014 at 1:08 PM, David Tucker wrote: >> >>> >>> I can see the mechanics for adding additional jars to the invocation of &g

Best practices for DRILL_CLASSPATH setting

2014-12-01 Thread David Tucker
I can see the mechanics for adding additional jars to the invocation of the drillbit via the DRILL_CLASSPATH environment variable (used in $DRILL_HOME/bin/drill-config.sh). For drill deployments within a MapR cluster (where the mapr-drill package is installed, so the drillbit is managed by the

Client-only installation

2014-12-01 Thread David Tucker
Is there a process for installing the sqlline tool independent of the complete drillbit ? The scenario I’m considering is a cluster of 10 nodes where only 5 would have drill-bits installed but users will expect to connect to the drill cluster arbitrarily from any node out of the 10. — David