[jira] [Updated] (SPARK-12402) Memory leak in pyspark

2019-05-20 Thread Hyukjin Kwon (JIRA)


 [ 
https://issues.apache.org/jira/browse/SPARK-12402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hyukjin Kwon updated SPARK-12402:
-
Labels: bulk-closed  (was: )

> Memory leak in pyspark
> --
>
> Key: SPARK-12402
> URL: https://issues.apache.org/jira/browse/SPARK-12402
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Reporter: Davies Liu
>Priority: Major
>  Labels: bulk-closed
>
> After run some SQL query in PySpark, the DataFrame are still referenced by 
> py4j Gateway, they are freed after calll `gc.collect()` in Python.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-12402) Memory leak in pyspark

2015-12-17 Thread Davies Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-12402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Davies Liu updated SPARK-12402:
---
Description: After run some SQL query in PySpark, the DataFrame are still 
referenced by py4j Gateway, they are freed after calll `gc.collect()` in 
Python.  (was: The broadcasted HashRelation is not destroyed after query 
finished (also can't be reused).)

> Memory leak in pyspark
> --
>
> Key: SPARK-12402
> URL: https://issues.apache.org/jira/browse/SPARK-12402
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Reporter: Davies Liu
>
> After run some SQL query in PySpark, the DataFrame are still referenced by 
> py4j Gateway, they are freed after calll `gc.collect()` in Python.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-12402) Memory leak in pyspark

2015-12-17 Thread Davies Liu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-12402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Davies Liu updated SPARK-12402:
---
Summary: Memory leak in pyspark  (was: Memory leak in broadcast hash join)

> Memory leak in pyspark
> --
>
> Key: SPARK-12402
> URL: https://issues.apache.org/jira/browse/SPARK-12402
> Project: Spark
>  Issue Type: Bug
>  Components: SQL
>Reporter: Davies Liu
>
> The broadcasted HashRelation is not destroyed after query finished (also 
> can't be reused).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org