Re: Connecting to Hive provided by AWS EMR

2015-06-26 Thread Venki Korukanti
Hi, What port is your Hive metastore listening? The default port is 9083. In your case you provided 1 (as part of hive.metastore.uris). Can you double check if that is the correct one. Also you need provide fs.default.name and other s3 related settings in Hive storage plugin config. Thanks V

Connecting to Hive provided by AWS EMR

2015-06-26 Thread Paul Mogren
I have scoured the Drill website and mailing list, and Google, and have come up with no advice. Can you help? I started up an EMR cluster with AWS Hive 0.13.1 installed, started the metastore service: hive/bin/hive ‹service metastore, created a table: CREATE TABLE apachelog ( host STRING, ID

Re: Quotes not being recognized in tab delimited (tsv) files

2015-06-26 Thread Hao Zhu
I think double quotes should be default. So anyway this is a bug that we need to fix. You can use whatever other quote to separate the columns, but you have to manually remove the double quotes. On Fri, Jun 26, 2015 at 12:21 PM, Chi-Lang Ngo wrote: > Thanks for the work-around. > > Didn't exp

Re: Quotes not being recognized in tab delimited (tsv) files

2015-06-26 Thread Chi-Lang Ngo
Thanks for the work-around. Didn't expect that you have to use single quote to have it process double quote :-) When I tried to specify double-quote (literally as well as escaping via \" or "") as "quote", the storage config UI didn't like that. On 26 June 2015 at 19:16, Hao Zhu wrote: > I ca

Re: Quotes not being recognized in tab delimited (tsv) files

2015-06-26 Thread Jason Altekruse
This is a reasonable hack for some cases, but I'm pretty sure this is going to break the most common purpose of having quotes at all. If you put the delimiter (tab) between quotes you are going to have it splitting on those characters where it shouldn't be. There is also the issue that the quotes

Re: Quotes not being recognized in tab delimited (tsv) files

2015-06-26 Thread Hao Zhu
I can reproduce the issue but I also have a workaround for it: *1. When storage plugin for "tsv" is default:* "tsv": { "type": "text", "extensions": [ "tsv" ], "delimiter": "\t" }, > select columns[0],columns[1] from `test.tsv`; +--+-+ | EXP

Re: Quotes not being recognized in tab delimited (tsv) files

2015-06-26 Thread Kristine Hahn
I think you might have a problem with your tsv file using spaces instead of tabs. CSV file contents: hello,1,2,3 hello,1,2,3 hello,1,2,3 TSV file contents (actual tab character, not spaces): hello 1 2 3 hello 1 2 3 hello 1 2 3 0: jdbc:drill:zk=local> select * from `/Users/khahn/Downloads/csv_test

Re: Quotes not being recognized in tab delimited (tsv) files

2015-06-26 Thread Kristine Hahn
There are some attributes that were introduced in Drill 1.0 that are partly documented (sorry no example): - http://drill.apache.org/docs/plugin-configuration-basics/#list-of-attributes-and-definitions (see "formats" . . . "quote") - http://drill.apache.org/docs/plugin-configuratio

Re: Access to file name while querying directories

2015-06-26 Thread Jacques Nadeau
Not at the moment. It has been a well requested. I can't find the JIRA at the moment but I believe we have one. Maybe someone can give feedback on which JIRA it is so you can follow it. On Fri, Jun 26, 2015 at 8:44 AM, Chi-Lang Ngo wrote: > Hi, > > Is there a way to access the file name as va

Access to file name while querying directories

2015-06-26 Thread Chi-Lang Ngo
Hi, Is there a way to access the file name as variable when querying for files in directories ? (similar to dir0, dir1 is exposed https://drill.apache.org/docs/querying-directories/) CL

Re: Quotes not being recognized in tab delimited (tsv) files

2015-06-26 Thread Chi-Lang Ngo
Filled https://issues.apache.org/jira/browse/DRILL-3393 On 26 June 2015 at 16:11, Jacques Nadeau wrote: > It definitely seems like a bug given your example. Can you file a jira? > > > > On Fri, Jun 26, 2015 at 7:27 AM, Chi-Lang Ngo wrote: > > > Hi, > > > > I'm having problem querying tab-delim

Re: drill configuration setting - rows overwriting one another

2015-06-26 Thread Jacques Nadeau
There were a number of changes in how SQLline displayed tables that had changing widths to make presentation better. There is a possibility that those changes are causing a problem. Can you file a JIRA with a simple reproduction? On Fri, Jun 26, 2015 at 6:47 AM, Jim Scott wrote: > That certainl

Re: Quotes not being recognized in tab delimited (tsv) files

2015-06-26 Thread Jacques Nadeau
It definitely seems like a bug given your example. Can you file a jira? On Fri, Jun 26, 2015 at 7:27 AM, Chi-Lang Ngo wrote: > Hi, > > I'm having problem querying tab-delimited (tsv) files which has quotes. > > Drill doesn't seem to recognise quotes in tsv while working fine for csv > files.

Re: drill scan all data problem

2015-06-26 Thread Jinfeng Ni
We are adding support of partitioning in CTAS, which may help in your case. CREATE TABLE Parquet_Table ( Column0, Column1, ...) PARTITION BY (Column0) FROM your_csv_file Then, the query would use partition pruning and see improved performance: SELECT * from Parquet_Table WHERE Column0 = '1'; Th

Re: drill scan all data problem

2015-06-26 Thread Andries Engelbrecht
How big are the files and what system are you running on? Can you provide a Drill show files for the directory listed? —Andries On Jun 26, 2015, at 12:47 AM, 陈礼剑 wrote: > Hi: > > > I have a csv file with 20,000,000 row. And create parquet file for each > 1,000,000 row, which means, I wi

Quotes not being recognized in tab delimited (tsv) files

2015-06-26 Thread Chi-Lang Ngo
Hi, I'm having problem querying tab-delimited (tsv) files which has quotes. Drill doesn't seem to recognise quotes in tsv while working fine for csv files. For example, given the following files test.tsv --- foobar bar "aa" "bc" --- test.csv -- foobar,bar "aa","bc" -- I

Re: drill configuration setting - rows overwriting one another

2015-06-26 Thread Jim Scott
That certainly rectified the issue. csv, tsv and vertical all work flawlessly. If I recall properly, when I use to run query with lots of columns I never had any problems like this. It also seems to have predated all the pretty coloring options that have been added. I can't recall having run lots

Re: drill view definition

2015-06-26 Thread Vince Gonzalez
select VIEW_DEFINITION from INFORMATION_SCHEMA.VIEWS where TABLE_NAME ='your_view_name'; On Fri, Jun 26, 2015 at 8:35 AM, Jacques Nadeau wrote: > I believe that the information is available from one of the information > Schema tables. > On Jun 26, 2015 5:19 AM, "Andy Pernsteiner" > wrote: > > >

Re: drill view definition

2015-06-26 Thread Jacques Nadeau
I believe that the information is available from one of the information Schema tables. On Jun 26, 2015 5:19 AM, "Andy Pernsteiner" wrote: > In Drill, a view is just a JSON file, which will live within the workspace > where you saved it. EG, if you do: > > create or replace view dfs.workspace.myvi

Re: drill view definition

2015-06-26 Thread Andy Pernsteiner
In Drill, a view is just a JSON file, which will live within the workspace where you saved it. EG, if you do: create or replace view dfs.workspace.myview as select * from mytable; It will create a file called 'myview.view.drill' , which will look similar to: { "name" : "testview", "sql" : "S

drill view definition

2015-06-26 Thread Vinupriyaa Muthusamypillai Ananthakrishna
How can we get the Drill views definition. The sql for which the drill view is created. Thanks Vinupriyaa

drill scan all data problem

2015-06-26 Thread 陈礼剑
Hi: I have a csv file with 20,000,000 row. And create parquet file for each 1,000,000 row, which means, I will have 20 parquet files in folder "/usr/download/com/togeek/data/csv/sample", now I use drill in embedded mode to select: SELECT * FROM dfs.`/usr/download/com/togeek/data/csv/sample