How big is the data set? Is it just a single file, or multiple? And how
long is it taking?

It does not have to read the entire file to produce output. I suspect it's
taking a long time to set up the query.

On Mon, Mar 30, 2015 at 3:39 PM, Ganesha Muthuraman <mganesh...@outlook.com>
wrote:

> I see that for a large data set ( in my case a large CSV ), even a LIMIT 5
> type of SQL take a very long time. Is it that Drill has to read through the
> entire file to produce any output even if I just want, say the first 10
> lines of the file?
>
> Is there a way to let Drill stream the results back as it reads the CSV in
> such cases? I am trying to use Drill for data exploration purposes and
> mostly to get a peek into the data set from my data lake before running
> bigger queries/analytics on this data set.
>
> Regards,
> Ganesh
>




-- 
 Steven Phillips
 Software Engineer

 mapr.com

Reply via email to