[ https://issues.apache.org/jira/browse/SPARK-6713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Xiangrui Meng updated SPARK-6713: --------------------------------- Assignee: Reza Zadeh > Iterators in columnSimilarities to allow flatMap spill > ------------------------------------------------------ > > Key: SPARK-6713 > URL: https://issues.apache.org/jira/browse/SPARK-6713 > Project: Spark > Issue Type: Improvement > Components: MLlib > Reporter: Reza Zadeh > Assignee: Reza Zadeh > Fix For: 1.4.0 > > > We should use Iterators in columnSimilarities to allow mapPartitionsWithIndex > to spill to disk. This could happen in a dense and large column - this way > Spark can spill the pairs onto disk instead of building all the pairs before > handing them to Spark. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org