Re: Strange metadata from Text Reader

Paul Rogers Mon, 24 Jun 2019 14:44:36 -0700

Hi Charles,

Latest master? Please file a JIRA with repo steps. I’ll take a look.


- Paul

Sent from my iPhone

> On Jun 24, 2019, at 11:38 AM, Charles Givre <cgi...@gmail.com> wrote:
> 
> Hello Drill Devs,
> I'm noticing some strange behavior with the newest version of Drill.  If you 
> query a CSV file, you get the following metadata:
> 
> SELECT * FROM dfs.test.`domains.csvh` LIMIT 1
> 
> {
>  "queryId": "22eee85f-c02c-5878-9735-091d18788061",
>  "columns": [
>    "domain"
>  ],
>  "rows": [
>    {
>      "domain": "thedataist.com"
>    }
>  ],
>  "metadata": [
>    "VARCHAR(0, 0)",
>    "VARCHAR(0, 0)"
>  ],
>  "queryState": "COMPLETED",
>  "attemptedAutoLimit": 0
> }
> 
> 
> There are two issues here:
> 1.  VARCHAR now has precision 
> 2.  There are twice as many columns as there should be.
> 
> Additionally, if you query a regular CSV, without the columns extracted, you 
> get the following:
> 
> "rows": [
>    {
>      "columns": "[\"ACCT_NUM\",\"PRODUCT\",\"MONTH\",\"REVENUE\"]"
>    }
>  ],
>  "metadata": [
>    "VARCHAR(0, 0)",
>    "VARCHAR(0, 0)"
>  ],
> 
> This is bizarre in that the data type is not being reported correctly, it 
> should be LIST or something like that, AND we're getting too many columns in 
> the metadata.  I'll submit a JIRA as well, but could someone please take a 
> look?
> Thanks,
> -- C
> 
> 
>

Re: Strange metadata from Text Reader

Reply via email to