what am I missing?

2016-02-22 Thread Ted Schwartz
I'm new to drill and trying to get up and running. My goal is to access drill from a JDBC client. I'm a bit confused when starting drill. If I use drill-embedded, it appears it only allows one connection and that connection is started along with drill. So if instead I try to run in distributed

Re: what am I missing?

2016-02-22 Thread Abdel Hakim Deneche
To run Drill in distributed mode you need to have Zookeeper up and running. This shouldn't be too complicated, you can find more details here: https://zookeeper.apache.org/doc/r3.1.2/zookeeperStarted.html#sc_InstallingSingleMode On my Mac I used brew and it took care of everything. On Mon, Feb 2

Re: [DISCUSS] New Feature: Drill Client Impersonation

2016-02-22 Thread Neeraja Rentachintala
Jacques, I think the current proposal by Sudheesh is an API level change to pass this additional end user id during the connection establishment. Can you elaborate what you mean by random query. -Neeraja On Sun, Feb 21, 2016 at 5:07 PM, Jacques Nadeau wrote: > Sudheesh, thanks for putting this

Re: [DISCUSS] New Feature: Drill Client Impersonation

2016-02-22 Thread Jacques Nadeau
Maybe I misunderstood the design document. I thought this was how the user would be changed: "Provide a way to change the user after the connection is made (details) through a session option" Did I miss something? -- Jacques Nadeau CTO and Co-Founder, Dremio On Mon, Feb 22, 2016 at 9:06 AM

what am I missing?

2016-02-22 Thread Ted Schwartz
I'm new to drill and trying to get up and running. My goal is to access drill from a JDBC client. I'm a bit confused when starting drill. If I use drill-embedded, it appears it only allows one connection and that connection is started along with drill. So if instead I try to run in distributed

Re: [DISCUSS] New Feature: Drill Client Impersonation

2016-02-22 Thread Sudheesh Katkam
“… when creating this connection, as part of the connection properties (JDBC, C++ Client), the application passes the end user’s identity (e.g. username) …” I had written the change user as a session option as part of the enhancement only, where you’ve pointed out a better way. I addressed your

Re: what am I missing?

2016-02-22 Thread Andries Engelbrecht
When running in clustered mode you ZK running and point to it appropriately. It seems you don't have ZK running in your environment. You can however connect multiple session to Drill in embedded mode. Simply point directly to the drillbit as apposed to using ZK. I.e. Start Drill with drill-embed

Drill join performance

2016-02-22 Thread Dmitry Krivov
Hello I have load (as CTAS) into parquet-files StarShema Benchmark generated csv-data (scale factor 50) For one of bencmark query's like : select d.d_year, c.c_region, sum(l.lo_extendedprice*l.lo_discount) as revenue from dfs.tpch.lineorder_part l, dfs.tpch.dates d, dfs.tpch.custom

Re: [DISCUSS] New Feature: Drill Client Impersonation

2016-02-22 Thread Neeraja Rentachintala
It seems to me that for phase 1, we should only have this as connection level property and have the list of proxy users as a static bootstrap option. Drill doesn't have a very granular privilege model other than admins vs non-admins, so till then exposing this via system options seems like a risk t

Re: [DISCUSS] New Feature: Drill Client Impersonation

2016-02-22 Thread Keys Botzum
Connection level identity setting is only viable if the scalability concerns I raised in the doc and Jacques indirectly raised are addressed. Historically DB connections have been so expensive that most applications created pools of connections and reused them across users. That model doesn't w

Re: [DISCUSS] New Feature: Drill Client Impersonation

2016-02-22 Thread Jacques Nadeau
Got it, makes sense. Do you have an interface proposal? I didn't see that. Also, what do you think about my comment and Keys response about moving pooling to the Driver and then making "connection" lightweight. -- Jacques Nadeau CTO and Co-Founder, Dremio On Mon, Feb 22, 2016 at 9:59 AM, Sudhee

Re: Drill join performance

2016-02-22 Thread Abdel Hakim Deneche
Hello Dmitry, Welcome to Drill's community :) What version of Drill are you using ? Also, can you share the query profile of your query, it helps to show what taking most of the time. Thanks On Mon, Feb 22, 2016 at 10:54 AM, Dmitry Krivov wrote: > Hello > > I have load (as CTAS) into parquet-

RE: what am I missing?

2016-02-22 Thread Geercken, Uwe
Hi Ted, The host and port is the Zookeeper host and port. If you run drill in distributed mode, you will have to have zookeeper running. Drill uses zookeeper to store configuration data. Go on and download and install zookeeper. Then run it: "bin/zkServer start". It will run on default on port