Re: Failed to Query Cassandra using Drill

2016-01-21 Thread Jim Bates
Here is a post I saw on cassandra/drill a while back. http://www.confusedcoders.com/bigdata/apache-drill/sql-on-cassandra-querying-cassandra-via-apache-drill On Thu, Jan 21, 2016 at 6:07 AM, Anup Tiwari wrote: > Hi, > > I am using Drill 1.2 and want to query

Re: Failed to Query Cassandra using Drill

2016-01-21 Thread Devender Yadav
This patch is not working with Drill 1.0+ version. Check details: http://stackoverflow.com/questions/31017755/how-to-use-apache-drill-with-cassandra Regards, Devender On Thu, Jan 21, 2016 at 7:57 PM, Jim Bates wrote: > Here is a post I saw on cassandra/drill a while

Re: CTAS plan showing single node?

2016-01-21 Thread Jason Altekruse
The query plans can indicate if a query is parallelized, by looking for exchanges, which are used to merge work from multiple execution fragments, or to re-distribute data for an operation. Execution fragments can run on different threads or different machines. The best place to find out how

Re: Failed to Query Cassandra using Drill

2016-01-21 Thread Anup Tiwari
@jim I have already followed steps given in that post but its not working. On 21-Jan-2016 8:45 pm, "Devender Yadav" wrote: > This patch is not working with Drill 1.0+ version. Check details: > > >

CTAS plan showing single node?

2016-01-21 Thread Matt
Running a CTAS from csv files in a 4 node HDFS cluster into a Parquet file, and I note the physical plan in the Drill UI references scans of all the csv sources on a single node. collectl implies read and write IO on all 4 nodes - does this imply that the full cluster is used for scanning the

File size limit for CTAS?

2016-01-21 Thread Matt
Converting CSV files to Parquet with CTAS, and getting errors on some larger files: With a source file of 16.34GB (as reported in the HDFS explorer): ~~~ create table `/parquet/customer_20151017` partition by (date_tm) AS select * from `/csv/customer/customer_20151017.csv`; Error: SYSTEM

Re: File size limit for CTAS?

2016-01-21 Thread rahul challapalli
Ignoring the CTAS part can you try running the select query and see if it completes. My suspicion is that some record/field in your large file is causing drill to break. Also it would be helpful if you can give more information from the drillbit.log when this error happens (Search for

Re: Changing Map schema in an embeded list

2016-01-21 Thread Andries Engelbrecht
This seems to be an interesting issue with flatten. All the data seems to be recognized as a list, and would expect flatten to just unnest the the first layer?? Seems to be somewhat related to DRILL-1673 select