You seem to have a lot of column :-) ! 
df.count() displays the size of your data frame. 
df.columns.size() the number of columns.

Finally, I suggest you check the size of your drive and customize it 
accordingly.

Cheers,

Ardo 

Sent from my iPhone

> On 09 Apr 2016, at 19:37, bdev <buntu...@gmail.com> wrote:
> 
> I keep running out of memory on the driver when I attempt to do df.show().
> Can anyone let me know how to estimate the size of the dataframe?
> 
> Thanks!
> 
> 
> 
> --
> View this message in context: 
> http://apache-spark-user-list.1001560.n3.nabble.com/How-to-estimate-the-size-of-dataframe-using-pyspark-tp26729.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
> For additional commands, e-mail: user-h...@spark.apache.org
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to