Re: Drill + gzipped-CSV performance

2015-10-07 Thread Ted Dunning
On Wed, Oct 7, 2015 at 2:03 PM, Jason Altekruse wrote: > Here is a presentation with some helpful information (I haven't read all > of it, but the table on slide 7 gies a nice overview of features in each > codec). > >

Re: Drill + gzipped-CSV performance

2015-10-07 Thread Jacques Nadeau
The other issue you might be running across is I have seen situations where gzip is not using native library for decompression. You should take a look at whether this is being used. -- Jacques Nadeau CTO and Co-Founder, Dremio On Wed, Oct 7, 2015 at 8:27 AM, Andy Pernsteiner

Re: Query Builder & Productivity tools

2015-10-07 Thread Fabian Wilckens
I personally had good results so far with visualization tools ... in the Drill UI you can see the queries that have been executed. On Wed, Oct 7, 2015 at 12:33 PM, Boris Chmiel < boris.chm...@yahoo.com.invalid> wrote: > Hello users, > I wonder which productivity tools are you using to build

Query Builder & Productivity tools

2015-10-07 Thread Boris Chmiel
Hello users, I wonder which productivity tools are you using to build your queries with Drill. I found myself using text editor for building queries, sqlline to run files and console to remove CTAS results when needed. I am ok with thoses tools but from an end user prespective - let's say a

Re: Querying k/v pairs in a CSV column

2015-10-07 Thread Tugdual Grall
You can create a custom function to do that: - https://drill.apache.org/docs/develop-custom-functions/ this will end with something like: select * from *.csv where GET_VALUE(col[x] , 'var1' ) = 'value' (I will publish, in the next few day, a similar function parsing URL in values in this repo

Re: Querying k/v pairs in a CSV column

2015-10-07 Thread Tomer Shiran
I think it would be much better to leverage something like CONVERT_FROM(col, 'MyKeyValueFormat') where the entire key/value cell is parsed into Drill's internal structure just like we do with JSON. That exposes all those key/value pairs such that you can leverage them in other SQL operators. On

Re: Query Builder & Productivity tools

2015-10-07 Thread Andries Engelbrecht
Drill Explorer and Squirrel are handy tools to explore the data and create Views, etc on DFS or complex data sources. A text editor with Drill Explorer or SQLline can be handy, but you may want to look at how to configure JDBC on your system to use Squirrel.

Querying k/v pairs in a CSV column

2015-10-07 Thread michael.england
Hi, If I have a separated file (PSV/TSV etc) with a number of columns, each with a single value aside from the last column which contains a bunch of key value pairs in a string e.g. var1=test var2=test var3=test. Is it possible to query based on individual key/value items in the last column? I

Re: Querying k/v pairs in a CSV column

2015-10-07 Thread Tugdual Grall
Hi, Can you clarify how to use our own format in CONVERT_FROM? 1- is it supported out of the box ? (cannot find that in the doc - https://drill.apache.org/docs/supported-data-types/#data-types-for-convert_to-and-convert_from-functions - may be not looking at the proper place. 2- it is another

Re: JDBC driver for MySQL - storage plugin config and push down ?

2015-10-07 Thread Neeraja Rentachintala
Andrew thanks for the update. Also is it possible for you to share a brief set of instructions on how to configure/use this plugin with a database , lets say mysql (or postgres or ..). There have been few question on the threads around this and could benefit from a quick summary. -Neeraja On

Re: Drill + gzipped-CSV performance

2015-10-07 Thread Andy Pernsteiner
In thinking this through, it probably is somewhat expected to see a slowdown when having to decompress data (esp gzip) as part of running a Drill query.    Andy Pernsteiner  Manager, Field Enablement ph: 206.228.0737 www.mapr.com Now Available - Free Hadoop On-Demand Training From: Andy

Re: JDBC driver for MySQL - storage plugin config and push down ?

2015-10-07 Thread Andrew Selden
There are 2 issues with the latest build that make the JDBC storage plugin unusable. I will be publishing a fix for both later today. The first issue is that the assembly phase wasn’t including the drill-module.conf for the JDBC plugin, so you could never actually register a JDBC source. The