Re: Drill on Azure

2016-08-02 Thread Santosh Kulkarni
That's great! Thanks Neeraja.

On Tue, Aug 2, 2016 at 6:18 PM, Neeraja Rentachintala <
nrentachint...@maprtech.com> wrote:

> Just happened to see this great set of comprehensive blog/tutorials on how
> to deploy Drill on Azure and use it with a variety of sources on Azure.
> Wanted to share the link with the other users that might be interested in
> this topic.
>
>
> https://blogs.msdn.microsoft.com/data_otaku/2016/05/27/deploying-apache-drill-on-azure/
>


Drill view in shell mode

2016-07-28 Thread Santosh Kulkarni
How to see a view created in Drill Explorer thru Drill shell? Is there any
any command for Drill views?

Thanks,

Santosh


Re: Connecting to Drill ODBC DSN takes exceptionally long time

2016-07-27 Thread Santosh Kulkarni
Thanks Neeraja.
We are using TDC file with the ODBC driver.



On Tue, Jul 26, 2016 at 12:06 AM, Neeraja Rentachintala <
nrentachint...@maprtech.com> wrote:

> Andries did a great job on putting together the material below on this
> topic. This info will be helpful to you to optimize metadata access
> experience from Tableay.
> Additionally make sure you are using the Tableau TDC file that ships with
> Drill ODBC drier.
>
>
> https://community.mapr.com/community/answers/blog/2016/07/20/drill-best-practices-for-bi-and-analytical-tools
>
> -Neeraja
>
> On Mon, Jul 25, 2016 at 9:17 PM, Santosh Kulkarni <
> santoshskulkarn...@gmail.com> wrote:
>
> > While connecting Tableau to Drill ODBC DSN, it takes almost 5 minutes to
> > connect to Drill.I created 2 DSN, one for Zookeeper Quorum and other one
> > for Direct to drillbit, Both take very long time to connect successfully
> to
> > Drill.
> >
> > Also, after the connection just to open Schema and the tables within the
> > schema it takes another few minutes. Underlying datasource is Hive.
> >
> > Any thoughts on what causes this issue?
> >
> > Thanks,
> >
> > Santosh
> >
>


Connecting to Drill ODBC DSN takes exceptionally long time

2016-07-25 Thread Santosh Kulkarni
While connecting Tableau to Drill ODBC DSN, it takes almost 5 minutes to
connect to Drill.I created 2 DSN, one for Zookeeper Quorum and other one
for Direct to drillbit, Both take very long time to connect successfully to
Drill.

Also, after the connection just to open Schema and the tables within the
schema it takes another few minutes. Underlying datasource is Hive.

Any thoughts on what causes this issue?

Thanks,

Santosh


Re: Drill on Hive table

2016-07-17 Thread Santosh Kulkarni
For the most part it is resolved by kinit - R command from the shell
prompt. We also created a script to renew the Kerberos ticket daily. But
there were couple of instances when I still see this issue. Especially for
Hive table in parquet format.

On Tue, Jul 5, 2016 at 8:05 AM, Joseph Swingle <joeswin...@gmail.com> wrote:

> Did you get this resolved?  I have the same issue
>
> On Sun, Jul 3, 2016 at 10:49 PM, Santosh Kulkarni <
> santoshskulkarn...@gmail.com> wrote:
>
> > Hello,
> >
> > For a simple query select count(*) from table_name, Drill gives an error.
> > Error: SYSTEM ERROR: IOException: Can't get Master Kerberos principal for
> > use as renewer.
> >
> > I also ran the kinit -kt command and confirmed the ticket is active.
> >
> > So, if the Kerberos ticket is active and if I can query few other tables
> > why Drill gives error for certain tables although the file formats,
> > compression, table type are the same? Number of columns in this table is
> > 273. Does Drill has any limitations on the number of columns?
> >
> > Tried select count(col1) from table_name, it gives the same error.
> >
> > Another observation:
> > Describe table_name in hive and Drill shows different datatypes for the
> > columns.
> > e.g. String, tinyint in Hive shows as Character varying and Integer in
> > Drill. Is this by Design?
> >
> > Appreciate your help.
> >
> > Thanks,
> >
> > Santosh
> >
>


Re: CHAR data type

2016-07-04 Thread Santosh Kulkarni
Here is the link:

https://drill.apache.org/docs/supported-data-types/



On Mon, Jul 4, 2016 at 12:06 PM, rahul challapalli <
challapallira...@gmail.com> wrote:

> Can you point us to where you are looking? The documentation should only
> say that "CHAR" datatype in hive is supported from Drill 1.7 onward.
>
> - Rahul
>
> On Mon, Jul 4, 2016 at 9:53 AM, Santosh Kulkarni <
> santoshskulkarn...@gmail.com> wrote:
>
> > Thanks Shankar. I was looking in Drill documentation but did not realize
> to
> > check in 1.7 Release notes.
> >
> >
> >
> > On Mon, Jul 4, 2016 at 10:02 AM, Shankar Mane <
> shankar.m...@games24x7.com>
> > wrote:
> >
> > > It is being supported since 1.7.0. Please check this link
> > > https://drill.apache.org/docs/apache-drill-1-7-0-release-notes/
> > >
> > > On 04-Jul-2016 8:07 PM, "Santosh Kulkarni" <
> santoshskulkarn...@gmail.com
> > >
> > > wrote:
> > >
> > > Hello,
> > >
> > > While running another simple query for select count(*) from table_name,
> > > Drill gave an error for Unsupported Hive data type CHAR.
> > >
> > > The column is of CHAR(6) data type. Drill documentation shows CHAR as
> > > supported data type.
> > >
> > > This is on Drill version 1.6
> > >
> > > Thanks,
> > >
> > > Santosh
> > >
> >
>


Re: CHAR data type

2016-07-04 Thread Santosh Kulkarni
Thanks Shankar. I was looking in Drill documentation but did not realize to
check in 1.7 Release notes.



On Mon, Jul 4, 2016 at 10:02 AM, Shankar Mane <shankar.m...@games24x7.com>
wrote:

> It is being supported since 1.7.0. Please check this link
> https://drill.apache.org/docs/apache-drill-1-7-0-release-notes/
>
> On 04-Jul-2016 8:07 PM, "Santosh Kulkarni" <santoshskulkarn...@gmail.com>
> wrote:
>
> Hello,
>
> While running another simple query for select count(*) from table_name,
> Drill gave an error for Unsupported Hive data type CHAR.
>
> The column is of CHAR(6) data type. Drill documentation shows CHAR as
> supported data type.
>
> This is on Drill version 1.6
>
> Thanks,
>
> Santosh
>


CHAR data type

2016-07-04 Thread Santosh Kulkarni
Hello,

While running another simple query for select count(*) from table_name,
Drill gave an error for Unsupported Hive data type CHAR.

The column is of CHAR(6) data type. Drill documentation shows CHAR as
supported data type.

This is on Drill version 1.6

Thanks,

Santosh


Drill on Hive table

2016-07-03 Thread Santosh Kulkarni
Hello,

For a simple query select count(*) from table_name, Drill gives an error.
Error: SYSTEM ERROR: IOException: Can't get Master Kerberos principal for
use as renewer.

I also ran the kinit -kt command and confirmed the ticket is active.

So, if the Kerberos ticket is active and if I can query few other tables
why Drill gives error for certain tables although the file formats,
compression, table type are the same? Number of columns in this table is
273. Does Drill has any limitations on the number of columns?

Tried select count(col1) from table_name, it gives the same error.

Another observation:
Describe table_name in hive and Drill shows different datatypes for the
columns.
e.g. String, tinyint in Hive shows as Character varying and Integer in
Drill. Is this by Design?

Appreciate your help.

Thanks,

Santosh


Drill on Kerberos (Hive Storage plugin)

2016-07-03 Thread Santosh Kulkarni
Hi,

Drill query gives the following error for Hive storage plugin.

show databases;

Error: SYSTEM ERROR: GSSException: No valid credentials provided (Mechanism
Level: Failed to find any Kerberos tgt)
This worked before.

After renewing the Kerberos connection using kinit -R command, query shows
results. Tried few hours later and it gave the same error again.

By default the system generates the user ticket. Ensured the ticket is not
expired  using klist -e.

Hive works fine without any issues for the same query.

Any pointers how to fix this issue?

Thanks,

Santosh


Re: Apache Drill vs PrestoDB

2016-06-25 Thread Santosh Kulkarni
The Quora post gives a good design overview. However, I was looking for
more of operational and performance comparison given the same dataset and
most of the points you mentioned.



On Wed, Jun 8, 2016 at 12:56 PM, rahul challapalli <
challapallira...@gmail.com> wrote:

> The post on quora gives a good overview. It would be helpful if you can
> provide some insights into what you are trying to achieve. Few questions to
> that end
>
>   1. Who will be the users of your application
>   2. Where does your data live and in what format
>   3. What is the scale of data you want to the tool to handle
>   4. Interactive queries or long running queries(> 1Hr)
>   5. Maximum no of concurrent users you expect
>   6. Authentication/Authorization requirements
>   7. Any SLA's around query response times
>   8. Any specific BI tools that need to be supported
>
> - Rahul
>
> On Tue, Jun 7, 2016 at 8:07 PM, Santosh Kulkarni <
> santoshskulkarn...@gmail.com> wrote:
>
> > Hi,
> >
> > While searching for comparison between Drill and Presto, google search
> > gives a high level design comparison posted on Quora.
> >
> > Does anyone has more detailed comparison on these 2 tools?
> >
> > Thanks in advance.
> >
> > Santosh
> >
>


Apache Drill vs PrestoDB

2016-06-07 Thread Santosh Kulkarni
Hi,

While searching for comparison between Drill and Presto, google search
gives a high level design comparison posted on Quora.

Does anyone has more detailed comparison on these 2 tools?

Thanks in advance.

Santosh


Storage plugin for Cassandra and ElasticSearch

2016-06-07 Thread Santosh Kulkarni
Hello All,

I was looking for storage plugin for Cassandra and ElasticSearch. Both
Cassandra and ElasticSearch has a large and growing user community.
Wondering if anyone can shed some light on the availability of these
plugins in Drill?

Thanks,

Santosh


Which organizations are using Apache Drill?

2016-06-07 Thread Santosh Kulkarni
Hello,

We plan to conduct a POC on Apache Drill and would like to know which large
organizations are using Drill? What are some of the challenges/lessons
learnt along with way?

What has been your experience?

Thanks,

Santosh