[jira] [Created] (SPARK-12617) socket descriptor leak killing streaming app

2016-01-04 Thread Antony Mayi (JIRA)
Antony Mayi created SPARK-12617: --- Summary: socket descriptor leak killing streaming app Key: SPARK-12617 URL: https://issues.apache.org/jira/browse/SPARK-12617 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-12617) socket descriptor leak killing streaming app

2016-01-04 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-12617: Description: There is a socket descriptor leakage in a pyspark streaming app when configured with

[jira] [Updated] (SPARK-12617) socket descriptor leak killing streaming app

2016-01-04 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-12617: Attachment: bug.py > socket descriptor leak killing streaming app >

[jira] [Updated] (SPARK-12617) socket descriptor leak killing streaming app

2016-01-04 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-12617: Description: There is a socket descriptor leakage in a pyspark streaming app when configured with

[jira] [Updated] (SPARK-12617) socket descriptor leak killing streaming app

2016-01-04 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-12617: Description: There is a socket descriptor leakage in a pyspark streaming app when configured with

[jira] [Commented] (SPARK-12511) streaming driver with checkpointing unable to finalize leading to OOM

2015-12-29 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15073816#comment-15073816 ] Antony Mayi commented on SPARK-12511: - just found SPARK-11711 which looks to be the same issue >

[jira] [Updated] (SPARK-12511) streaming driver with checkpointing unable to finalize leading to OOM

2015-12-23 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-12511: Description: Spark streaming application when configured with checkpointing is filling driver's

[jira] [Updated] (SPARK-12511) streaming driver with checkpointing unable to finalize leading to OOM

2015-12-23 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-12511: Description: Spark streaming application when configured with checkpointing is filling driver's

[jira] [Created] (SPARK-12511) streaming driver with checkpointing unable to finalize leading to OOM

2015-12-23 Thread Antony Mayi (JIRA)
Antony Mayi created SPARK-12511: --- Summary: streaming driver with checkpointing unable to finalize leading to OOM Key: SPARK-12511 URL: https://issues.apache.org/jira/browse/SPARK-12511 Project: Spark

[jira] [Updated] (SPARK-12511) streaming driver with checkpointing unable to finalize leading to OOM

2015-12-23 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-12511: Description: Spark streaming application when configured with checkpointing is filling driver's

[jira] [Updated] (SPARK-12511) streaming driver with checkpointing unable to finalize leading to OOM

2015-12-23 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-12511: Attachment: bug.py > streaming driver with checkpointing unable to finalize leading to OOM >

[jira] [Updated] (SPARK-12511) streaming driver with checkpointing unable to finalize leading to OOM

2015-12-23 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-12511: Attachment: finalizer-spark_assembly.png finalizer-pending.png

[jira] [Updated] (SPARK-12511) streaming driver with checkpointing unable to finalize leading to OOM

2015-12-23 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-12511: Description: Spark streaming application when configured with checkpointing is filling driver's

[jira] [Commented] (SPARK-6717) Clear shuffle files after checkpointing in ALS

2015-11-30 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6717?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15032822#comment-15032822 ] Antony Mayi commented on SPARK-6717: this seems to be even bigger problem in 1.5 as the workaround

[jira] [Commented] (SPARK-11498) TreeNodeException under very special condition

2015-11-19 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15015060#comment-15015060 ] Antony Mayi commented on SPARK-11498: - can't reproduce on 1.5.2, seems to be fixed. >

[jira] [Updated] (SPARK-11498) TreeNodeException under very special condition

2015-11-04 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-11498: Description: {code:title=/tmp/bug.py} from pyspark import SparkContext from pyspark.sql import

[jira] [Created] (SPARK-11498) TreeNodeException under very special condition

2015-11-03 Thread Antony Mayi (JIRA)
Antony Mayi created SPARK-11498: --- Summary: TreeNodeException under very special condition Key: SPARK-11498 URL: https://issues.apache.org/jira/browse/SPARK-11498 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-11498) TreeNodeException under very special condition

2015-11-03 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-11498: Description: {code} from pyspark import SparkContext from pyspark.sql import SQLContext, Row sc =

[jira] [Commented] (SPARK-8708) MatrixFactorizationModel.predictAll() populates single partition only

2015-07-02 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14611555#comment-14611555 ] Antony Mayi commented on SPARK-8708: bq. Antony Mayi In your real case, how many

[jira] [Commented] (SPARK-8708) MatrixFactorizationModel.predictAll() populates single partition only

2015-07-01 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14611304#comment-14611304 ] Antony Mayi commented on SPARK-8708: I suspect this issue with all data in single

[jira] [Commented] (SPARK-8708) MatrixFactorizationModel.predictAll() populates single partition only

2015-06-30 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14608169#comment-14608169 ] Antony Mayi commented on SPARK-8708: The real case is about 13M of users, few hundreds

[jira] [Commented] (SPARK-8708) MatrixFactorizationModel.predictAll() populates single partition only

2015-06-30 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14608214#comment-14608214 ] Antony Mayi commented on SPARK-8708: ok, more detailed example showing there really

[jira] [Comment Edited] (SPARK-8708) MatrixFactorizationModel.predictAll() populates single partition only

2015-06-30 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14608169#comment-14608169 ] Antony Mayi edited comment on SPARK-8708 at 6/30/15 11:55 AM: --

[jira] [Created] (SPARK-8708) MatrixFactorizationModel.predictAll() populates single partition only

2015-06-29 Thread Antony Mayi (JIRA)
Antony Mayi created SPARK-8708: -- Summary: MatrixFactorizationModel.predictAll() populates single partition only Key: SPARK-8708 URL: https://issues.apache.org/jira/browse/SPARK-8708 Project: Spark

[jira] [Updated] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-04-01 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-6334: --- Attachment: gc.png spark-local dir not getting cleared during ALS

[jira] [Commented] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-04-01 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14390771#comment-14390771 ] Antony Mayi commented on SPARK-6334: bq. btw. I see based on the sourcecode

[jira] [Reopened] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-04-01 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi reopened SPARK-6334: spark-local dir not getting cleared during ALS --

[jira] [Commented] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-20 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14371122#comment-14371122 ] Antony Mayi commented on SPARK-6334: bq. 2. Use less number of blocks, even you have

[jira] [Commented] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-17 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14364764#comment-14364764 ] Antony Mayi commented on SPARK-6334: users: 12.5 millions ratings: 3.3 billions rank:

[jira] [Updated] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-14 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-6334: --- Attachment: als-diskusage.png this is the disk usage pattern during ALS - 90% is when YARN kills the

[jira] [Issue Comment Deleted] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-14 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-6334: --- Comment: was deleted (was: this is the disk usage pattern during ALS - 90% is when YARN kills the

[jira] [Updated] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-14 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-6334: --- Description: when running bigger ALS training spark spills loads of temp data into the local-dir (in

[jira] [Updated] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-14 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antony Mayi updated SPARK-6334: --- Description: when running bigger ALS training spark spills loads of temp data into the local-dir (in

[jira] [Created] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-14 Thread Antony Mayi (JIRA)
Antony Mayi created SPARK-6334: -- Summary: spark-local dir not getting cleared during ALS Key: SPARK-6334 URL: https://issues.apache.org/jira/browse/SPARK-6334 Project: Spark Issue Type: Bug

[jira] [Comment Edited] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-14 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14361715#comment-14361715 ] Antony Mayi edited comment on SPARK-6334 at 3/14/15 11:11 AM: --

[jira] [Commented] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-14 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14361715#comment-14361715 ] Antony Mayi commented on SPARK-6334: bq. What are the files that are filling up the

[jira] [Commented] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-14 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14361712#comment-14361712 ] Antony Mayi commented on SPARK-6334: it is 12TB combined across all nodes available to

[jira] [Commented] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-14 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14361947#comment-14361947 ] Antony Mayi commented on SPARK-6334: I had to increase the partitioning up to this

[jira] [Commented] (SPARK-6334) spark-local dir not getting cleared during ALS

2015-03-14 Thread Antony Mayi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14361951#comment-14361951 ] Antony Mayi commented on SPARK-6334: btw. I see based on the sourcecode checkpointing