subject:"Using Spark SQL for temporal data"

Using Spark SQL for temporal data

2015-02-12 Thread Corey Nolet

I have a temporal data set in which I'd like to be able to query using Spark SQL. The dataset is actually in Accumulo and I've already written a CatalystScan implementation and RelationProvider[1] to register with the SQLContext so that I can apply my SQL statements. With my current

Re: Using Spark SQL for temporal data

2015-02-12 Thread Michael Armbrust

Hi Corey, I would not recommend using the CatalystScan for this. Its lower level, and not stable across releases. You should be able to do what you want with PrunedFilteredScan

Re: Using Spark SQL for temporal data

2015-02-12 Thread Michael Armbrust

I haven't been paying close attention to the JIRA tickets for PrunedFilteredScan but I noticed some weird behavior around the filters being applied when OR expressions were used in the WHERE clause. From what I was seeing, it looks like it could be possible that the start and end ranges you