[ https://issues.apache.org/jira/browse/SPARK-15061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15269315#comment-15269315 ]
holdenk commented on SPARK-15061: --------------------------------- It seems like the latest Py4J has landed, I'd be happy to take this task as I did the last Py4J upgrade as well. > Upgrade Py4J to 0.10.1 > ---------------------- > > Key: SPARK-15061 > URL: https://issues.apache.org/jira/browse/SPARK-15061 > Project: Spark > Issue Type: Improvement > Components: PySpark > Reporter: Chris Kanich > Labels: easyfix > > Py4J 0.10.1 hasn't landed yet, but it will likely cause a significant > performance improvement for PySpark and MLLib in particular. More details are > available at https://github.com/bartdag/py4j/issues/201 > The syscall overhead was likely the reason that > https://issues.apache.org/jira/browse/SPARK-6728 was reported as well - > dropping the base64 encoding will help too, but I imagine this fix will help > more. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org