Spark SQL on large number of columns

madhu phatak Tue, 19 May 2015 03:05:18 -0700

Hi,
I  am trying run spark sql aggregation on a file with 26k columns. No of
rows is very small. I am running into issue that spark is taking huge
amount of time to parse the sql and create a logical plan. Even if i have
just one row, it's taking more than 1 hour just to get pass the parsing.
Any idea how to optimize in these kind of scenarios?



Regards,
Madhukara Phatak
http://datamantra.io/

Spark SQL on large number of columns

Reply via email to