[ https://issues.apache.org/jira/browse/SPARK-11767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Davies Liu resolved SPARK-11767. -------------------------------- Resolution: Fixed Fix Version/s: 1.6.0 > Easy to OOM when cache large column > ----------------------------------- > > Key: SPARK-11767 > URL: https://issues.apache.org/jira/browse/SPARK-11767 > Project: Spark > Issue Type: Improvement > Components: SQL > Reporter: Davies Liu > Assignee: Davies Liu > Fix For: 1.6.0 > > > The default batch size (10000) does not work well the large column (with > serialized size about 100k), it's easy to OOM when unrolling the rows. > We should limit the serialized size of batch. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org