Re: CTAS query timeout on connection pool

2016-04-15 Thread Abhishek Girish
Can you take a look at https://drill.apache.org/docs/s3-storage-plugin/#quering-parquet-format-files-on-s3 ? It could be an issue of connection to s3 timing out. On Fri, Apr 15, 2016 at 1:03 AM, Ashish Goel wrote: > Hi, > > I am running a CTAS query to convert JSON

Re: Drill 1.6 on MapR cluster not using extractHeader ?

2016-04-15 Thread Abhishek Girish
Hello, This is my format setting: "csv": { "type": "text", "extensions": [ "csv" ], "extractHeader": true, "delimiter": "," } I was able to extract the header and get expected results: > select * from mfs.tmp.`abcd.csv`; +++++ | A

Drill 1.6 on MapR cluster not using extractHeader ?

2016-04-15 Thread Matt
With files in the local filesystem, and an embedded drill bit from the download on drill.apache.org, I can successfully query csv data by column name with the extractHeader option on, as in SELECT customer_if FROM `file`; But in a MapR cluster (v. 5.1.0.37549.GA) with the data in MapR-FS, the

CTAS query timeout on connection pool

2016-04-15 Thread Ashish Goel
Hi, I am running a CTAS query to convert JSON data stored in S3 into parquet store back into S3. Both the input and output are S3 locations. While some of parquest files are created in S3 but not all. I receive this error message after some time - *Error: DATA_READ ERROR: Failure reading JSON