Re: How to avoid case sensitivity in group by

2017-02-09 Thread Dechang Gu
How about the following: 0: jdbc:drill:schema=dfs.tpchRaw> select lower(state), sum(price) from junk group by lower(state); +--+-+ | EXPR$0 | EXPR$1 | +--+-+ | alaska | 165 | | florida | 170 | | texas| 80 | +--+-+ 3 rows

Re: How to avoid case sensitivity in group by

2017-02-08 Thread Dechang Gu
Sanjiv, Can you share the query and the output you get through sql server? Thanks, Dechang From: Sanjiv Kumar Sent: Wednesday, February 8, 2017 5:01:27 AM To: user@drill.apache.org Subject: Re: How to avoid case sensitivity in group by

Re: Problem in WHERE clause querying HBase

2017-02-07 Thread Dechang Gu
Hi Andrea, Thank you for trying out drill and identify the issue. Column names in HBase should be case-sensitive, according to drill doc: https://drill.apache.org/docs/lexical-structure/ Please file a jira to keep track of the issue, and drill dev will fix it. Thanks, Dechang Lexical

Re: drill return count(*) is wrong

2016-12-21 Thread Dechang Gu
I tried something similar to your query, the result looks correct to me: 0: jdbc:drill:schema=dfs.tpchPar100> select t.n, count(*) cnt from (select distinct c_address addr, c_nationkey n from customer limit 10) t group by t.n; *+-+--+* *| ** n ** | **cnt ** |* *+-+--+* *| *7

Re: Aggregation performance

2016-12-19 Thread Dechang Gu
0.007s 0.012s 0.000s 0.000s > 0.000s 116KB 116KB > 05-xx-02 PROJECT 0.000s 0.001s 0.004s 0.000s 0.001s 0.004s 0.000s 0.000s > 0.000s 100KB 100KB > 05-xx-03 UNKNOWN_OPERATOR 0.000s 0.000s 0.000s 1.350s 4.903s 7.817s 0.000s > 0.000s 0.000s 280KB 280KB > > Thanks & regard

Re: Aggregation performance

2016-12-15 Thread Dechang Gu
Yousuf, Which version of drill are you running? Can you share the profile of the query? Thanks, Dechang On Thu, Dec 15, 2016 at 3:27 AM, yousuf wrote: > Hello experts > > As a POC project, I've built a drill cluster on 5 VMs , each with the > following specs > > 32 GB

Re: Distributed Mode In Window Operating System

2016-08-04 Thread Dechang Gu
As far as I know, all of our deployments of Drill are on Linux platform. The following blog shows how to deploy Drill on Azure, which may provide some insights: https://blogs.msdn.microsoft.com/data_otaku/2016/05/27/deploying-apache-drill-on-azure/ On Thu, Aug 4, 2016 at 12:25 AM,

Re: Drill CPU Usage

2016-08-02 Thread Dechang Gu
On Tue, Aug 2, 2016 at 12:49 AM, Avi Haleva wrote: > Hi, > I'm in the process of evaluating Drill as a analytic repositiory. > I've noticed that even when idle, the drillbit process consume 20% of a > single core constantly. > > Is that expected behavior or have I miss

Re: concurrent get connection in different node

2016-08-01 Thread Dechang Gu
You may try to connect to a different drillbit in, say, a round-robin fashion, for the queries. See reference: https://drill.apache.org/docs/using-the-jdbc-driver/#using-the-jdbc-url-format-for-a-direct-drillbit-connection On Fri, Jul 29, 2016 at 2:27 AM, qiang li wrote:

Re: Query execution in Drill

2016-06-03 Thread Dechang Gu
On Fri, Jun 3, 2016 at 7:41 AM, Sebastian Hamann wrote: > Hi, > Iam currently trying to write a shell script to automatically setup drill > (embedded for now) and execute queries on it. > 1. Is there an option to execute SQL from command line or file, like in > Hive? > (I mean

Re: Performance tuning for TPC-H Q1 on a three nodes cluster

2016-05-25 Thread Dechang Gu
data will be created under the table lineitem directory. Can you please create the metadata file and provide it to me so we can check the row group distribution? Thanks, Dechang > > My HDFS block size is 256MB, and all my files are of 100MB on average, > perhaps not small IO? >

Re: Performance tuning for TPC-H Q1 on a three nodes cluster

2016-05-24 Thread Dechang Gu
and attached to this mail. Thank you. On Mon, May 23, 2016 at 11:12 PM, Dechang Gu <d...@maprtech.com <mailto:d...@maprtech.com>> wrote: Hi Yijie, This is Dechang at MapR. I work on Drill performance. From what you described, looks like scan took most of the time. How are the files are

Re: Performance tuning for TPC-H Q1 on a three nodes cluster

2016-05-23 Thread Dechang Gu
Hi Yijie, This is Dechang at MapR. I work on Drill performance. >From what you described, looks like scan took most of the time. How are the files are distributed on the disks, are there any skew? How many disks are there? If possible can you provide the profile for the run? Thanks, Dechang On