Re: Making Query to Remote MongoDB Faster

2016-03-01 Thread Rifat Mahmud
Created an issue: https://issues.apache.org/jira/browse/DRILL-4462 It seems like number of collection doesn't matter. The query takes same amount of time(around 27 seconds) on database containing only 2 collection. On Tue, Mar 1, 2016 at 10:17 PM, Jacques Nadeau wrote: > If the time is all spent

Re: [DISCUSS] New Feature: Drill Client Impersonation

2016-03-01 Thread Sudheesh Katkam
Thank you all for the feedback. + I am naming this feature User Delegation (since Client Impersonation can be confused with User Impersonation). + I updated the design document . + I opened a pull request (#400

Fwd: Drill Custom Authentication Startup Exception

2016-03-01 Thread Elango Ganesan
Hi All, I tried creating a custom authenticator with Drill 1.5.0 by creating jar based on below source code and drill configuration. I placed the jar in $DRILLHOME/jars/ drill.exec: { cluster-id: "drillbits1", zk.connect: "dmdvhc0002d:2181,dmdvhc0003d:2181,dmdvhc0004d:2181" security.use

Re: Memory Challenges

2016-03-01 Thread John Omernik
As a follow up to the spilling and the Drill hangout, I've opened a feature request to allow a "try hash, if out of memory, then execute as sort based" https://issues.apache.org/jira/browse/DRILL-4460 The only question I have here, is the setting to run a query as a sort based query one of the op

Re: Drill Bit Heap Space Issues

2016-03-01 Thread John Omernik
That seems to be allowing longer queries. (timeframe wise). Interesting, to_date(dir0) no issues, dir0 heap space issues is this expected? Is this a minor bug (that would need a JIRA?) On Tue, Mar 1, 2016 at 11:32 AM, Jason Altekruse wrote: > To_date takes a long assumed to be a unix ti

Memory Challenges

2016-03-01 Thread John Omernik
As I am exploring memory challenges on some queries, I noticed a few things, had some questions, and thought I'd summarize here for the group. (And hopefully get some questions answered). Basically, I was trying to run a query, with a few aggregations and having it fail. I tried upping the memory

Drill Hangout Starting

2016-03-01 Thread Jacques Nadeau
https://plus.google.com/hangouts/_/dremio.com/drillhangout?authuser=0 -- Jacques Nadeau CTO and Co-Founder, Dremio

Re: Drill Bit Heap Space Issues

2016-03-01 Thread Jason Altekruse
To_date takes a long assumed to be a unix timestamp, so the error you are getting here is from an implicit cast trying to turn the string into a long before converting it to a date. You can provide a second parameter to tell it how you would like to parse your string to properly parse these kinds o

Re: Drill Bit Heap Space Issues

2016-03-01 Thread John Omernik
In the view I have select to_date(dir0) as sub_date... When I run a query, I am getting "Error: SYSTEM ERROR: NumberFormatException: 2015-11-12" *even though I am using a where sub_date >= '2016-02-20' although I think this has to do with the planning slowness I've spoken about On Tue, Mar 1,

Re: Making Query to Remote MongoDB Faster

2016-03-01 Thread Jacques Nadeau
If the time is all spent in planning, that won't have much impact as query planning is not distributed. It sounds like we may be taking a while to plan queries against Mongo when there are a large number of collections. Can you open a JIRA that we can take a look at? -- Jacques Nadeau CTO and Co-

Rolling Window

2016-03-01 Thread François Méthot
Hi, We need to manage a rolling window of parquet data within drill. Our parquet files are partitioned by hour, Once hdfs reach a certain usage threshold, we want to delete the oldest partition folder. A simple approach would be to run a cron job that check the hdfs usage and delete the oldest

Re: Making Query to Remote MongoDB Faster

2016-03-01 Thread Jinfeng Ni
If the query time is mainly spent on query planning, then running distributed mode in a cluster will not help shorten the latency, as the query planning is done on Foreman, which is just a single node in the cluster. Can you please try to query a mongodb database with fewer collections? >From the

Re: Drill Bit Heap Space Issues

2016-03-01 Thread Jacques Nadeau
In the view. -- Jacques Nadeau CTO and Co-Founder, Dremio On Tue, Mar 1, 2016 at 6:02 AM, John Omernik wrote: > In the view or in the query? > > On Mon, Feb 29, 2016 at 9:05 PM, Jacques Nadeau > wrote: > > > Can you try to convert src_date to a date type? > > > > -- > > Jacques Nadeau > > CTO

Re: JDBC plugin case sensitive table names

2016-03-01 Thread Paul Mogren
https://issues.apache.org/jira/browse/DRILL-4458 Thanks, Paul On 3/1/16, 12:42 AM, "Jacques Nadeau" wrote: >That definitely looks like a bug. Can you file a JIRA? I'll get someone to >look at it. > >thanks, >Jacques > > >-- >Jacques Nadeau >CTO and Co-Founder, Dremio > >On Mon, Feb 29, 2016

Re: Drill Bit Heap Space Issues

2016-03-01 Thread John Omernik
In the view or in the query? On Mon, Feb 29, 2016 at 9:05 PM, Jacques Nadeau wrote: > Can you try to convert src_date to a date type? > > -- > Jacques Nadeau > CTO and Co-Founder, Dremio > > On Mon, Feb 29, 2016 at 10:28 AM, John Omernik wrote: > > > I am running 6 drill bits, they were running