[jira] [Assigned] (SPARK-10531) AppId is set as AppName in status rest api

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10531: Assignee: Apache Spark > AppId is set as AppName in status rest api >

[jira] [Commented] (SPARK-10531) AppId is set as AppName in status rest api

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738314#comment-14738314 ] Apache Spark commented on SPARK-10531: -- User 'zjffdu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10531) AppId is set as AppName in status rest api

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10531: Assignee: (was: Apache Spark) > AppId is set as AppName in status rest api >

[jira] [Comment Edited] (SPARK-10529) When creating multiple HiveContext objects in one jvm, jdbc connections to metastore cann't be released and it may cause PermGen OutOfMemoryError.

2015-09-10 Thread ZhengYaofeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738541#comment-14738541 ] ZhengYaofeng edited comment on SPARK-10529 at 9/10/15 10:22 AM: I find

[jira] [Assigned] (SPARK-10279) Add @since annotation to pyspark.mllib.util

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10279: Assignee: (was: Apache Spark) > Add @since annotation to pyspark.mllib.util >

[jira] [Assigned] (SPARK-10279) Add @since annotation to pyspark.mllib.util

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10279: Assignee: Apache Spark > Add @since annotation to pyspark.mllib.util >

[jira] [Commented] (SPARK-10531) AppId is set as AppName in status rest api

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738343#comment-14738343 ] Apache Spark commented on SPARK-10531: -- User 'zjffdu' has created a pull request for this issue:

[jira] [Commented] (SPARK-10279) Add @since annotation to pyspark.mllib.util

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738344#comment-14738344 ] Apache Spark commented on SPARK-10279: -- User 'yu-iskw' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10280) Add @since annotation to pyspark.ml.classification

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10280: Assignee: Apache Spark > Add @since annotation to pyspark.ml.classification >

[jira] [Assigned] (SPARK-10280) Add @since annotation to pyspark.ml.classification

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10280: Assignee: (was: Apache Spark) > Add @since annotation to pyspark.ml.classification >

[jira] [Commented] (SPARK-10280) Add @since annotation to pyspark.ml.classification

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738368#comment-14738368 ] Apache Spark commented on SPARK-10280: -- User 'yu-iskw' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-10525) Add Python example for VectorSlicer to user guide

2015-09-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738439#comment-14738439 ] Yanbo Liang edited comment on SPARK-10525 at 9/10/15 9:16 AM: -- [~josephkb]

[jira] [Commented] (SPARK-10510) Add documentation for how to register a custom Kryo serializer in Spark

2015-09-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738404#comment-14738404 ] Johannes Günther commented on SPARK-10510: -- If I understand It correctly from reading the Kryo

[jira] [Comment Edited] (SPARK-10510) Add documentation for how to register a custom Kryo serializer in Spark

2015-09-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738404#comment-14738404 ] Johannes Günther edited comment on SPARK-10510 at 9/10/15 8:34 AM: --- If

[jira] [Assigned] (SPARK-10284) Add @since annotation to pyspark.ml.tuning

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10284: Assignee: (was: Apache Spark) > Add @since annotation to pyspark.ml.tuning >

[jira] [Commented] (SPARK-10284) Add @since annotation to pyspark.ml.tuning

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738459#comment-14738459 ] Apache Spark commented on SPARK-10284: -- User 'yu-iskw' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10284) Add @since annotation to pyspark.ml.tuning

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10284: Assignee: Apache Spark > Add @since annotation to pyspark.ml.tuning >

[jira] [Commented] (SPARK-10529) When creating multiple HiveContext objects in one jvm, jdbc connections to metastore cann't be released and it may cause PermGen OutOfMemoryError.

2015-09-10 Thread ZhengYaofeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738541#comment-14738541 ] ZhengYaofeng commented on SPARK-10529: -- I find that IsolatedClientLoader class contains a attribute

[jira] [Assigned] (SPARK-10278) Add @since annotation to pyspark.mllib.tree

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10278: Assignee: (was: Apache Spark) > Add @since annotation to pyspark.mllib.tree >

[jira] [Assigned] (SPARK-10278) Add @since annotation to pyspark.mllib.tree

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10278: Assignee: Apache Spark > Add @since annotation to pyspark.mllib.tree >

[jira] [Updated] (SPARK-10528) spark-shell throws java.lang.RuntimeException: The root scratch dir: /tmp/hive on HDFS should be writable.

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10528: -- Target Version/s: (was: 1.5.0) Priority: Minor (was: Major) [~beloblotskiy] have a look

[jira] [Commented] (SPARK-10525) Add Python example for VectorSlicer to user guide

2015-09-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738439#comment-14738439 ] Yanbo Liang commented on SPARK-10525: - OK, very glad to do that. > Add Python example for

[jira] [Updated] (SPARK-10529) When creating multiple HiveContext objects in one jvm, jdbc connections to metastore cann't be released and it may cause PermGen OutOfMemoryError.

2015-09-10 Thread ZhengYaofeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ZhengYaofeng updated SPARK-10529: - Description: Test code as follows: object SqlTest { def main(args: Array[String]) { def

[jira] [Assigned] (SPARK-10532) Added new option to specify "user profile" of AWS credentials in spark/spark-ec2.py

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10532: Assignee: (was: Apache Spark) > Added new option to specify "user profile" of AWS

[jira] [Commented] (SPARK-10532) Added new option to specify "user profile" of AWS credentials in spark/spark-ec2.py

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738519#comment-14738519 ] Apache Spark commented on SPARK-10532: -- User 'teramonagi' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10532) Added new option to specify "user profile" of AWS credentials in spark/spark-ec2.py

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10532: Assignee: Apache Spark > Added new option to specify "user profile" of AWS credentials in

[jira] [Updated] (SPARK-10532) Added new option to specify "user profile" of AWS credentials in spark/spark-ec2.py

2015-09-10 Thread teramonagi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] teramonagi updated SPARK-10532: --- Description: AWS users want to use "Named Profiles" sometimes. -

[jira] [Commented] (SPARK-10528) spark-shell throws java.lang.RuntimeException: The root scratch dir: /tmp/hive on HDFS should be writable.

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738303#comment-14738303 ] Sean Owen commented on SPARK-10528: --- I think the problem is that it's not executable? This is an

[jira] [Assigned] (SPARK-10281) Add @since annotation to pyspark.ml.clustering

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10281: Assignee: Apache Spark > Add @since annotation to pyspark.ml.clustering >

[jira] [Assigned] (SPARK-10281) Add @since annotation to pyspark.ml.clustering

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10281: Assignee: (was: Apache Spark) > Add @since annotation to pyspark.ml.clustering >

[jira] [Commented] (SPARK-10281) Add @since annotation to pyspark.ml.clustering

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738386#comment-14738386 ] Apache Spark commented on SPARK-10281: -- User 'yu-iskw' has created a pull request for this issue:

[jira] [Updated] (SPARK-10529) When creating multiple HiveContext objects in one jvm, jdbc connections to metastore cann't be released and it may cause PermGen OutOfMemoryError.

2015-09-10 Thread ZhengYaofeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ZhengYaofeng updated SPARK-10529: - Flags: Patch > When creating multiple HiveContext objects in one jvm, jdbc connections to >

[jira] [Updated] (SPARK-10529) When creating multiple HiveContext objects in one jvm, jdbc connections to metastore cann't be released and it may cause PermGen OutOfMemoryError.

2015-09-10 Thread ZhengYaofeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ZhengYaofeng updated SPARK-10529: - Attachment: IsolatedClientLoader.scala > When creating multiple HiveContext objects in one jvm,

[jira] [Updated] (SPARK-10529) When creating multiple HiveContext objects in one jvm, jdbc connections to metastore cann't be released and it may cause PermGen OutOfMemoryError.

2015-09-10 Thread ZhengYaofeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ZhengYaofeng updated SPARK-10529: - Description: Test code as follows: object SqlTest { def main(args: Array[String]) { def

[jira] [Commented] (SPARK-9899) JSON/Parquet writing on retry or speculation broken with direct output committer

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738328#comment-14738328 ] Apache Spark commented on SPARK-9899: - User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-10285) Add @since annotation to pyspark.ml.util

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738472#comment-14738472 ] Apache Spark commented on SPARK-10285: -- User 'yu-iskw' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-10525) Add Python example for VectorSlicer to user guide

2015-09-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738439#comment-14738439 ] Yanbo Liang edited comment on SPARK-10525 at 9/10/15 9:21 AM: -- [~josephkb]

[jira] [Comment Edited] (SPARK-10525) Add Python example for VectorSlicer to user guide

2015-09-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738439#comment-14738439 ] Yanbo Liang edited comment on SPARK-10525 at 9/10/15 9:21 AM: -- [~josephkb]

[jira] [Assigned] (SPARK-10285) Add @since annotation to pyspark.ml.util

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10285: Assignee: Apache Spark > Add @since annotation to pyspark.ml.util >

[jira] [Assigned] (SPARK-10285) Add @since annotation to pyspark.ml.util

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10285: Assignee: (was: Apache Spark) > Add @since annotation to pyspark.ml.util >

[jira] [Comment Edited] (SPARK-10525) Add Python example for VectorSlicer to user guide

2015-09-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738439#comment-14738439 ] Yanbo Liang edited comment on SPARK-10525 at 9/10/15 9:19 AM: -- [~josephkb]

[jira] [Comment Edited] (SPARK-10529) When creating multiple HiveContext objects in one jvm, jdbc connections to metastore cann't be released and it may cause PermGen OutOfMemoryError.

2015-09-10 Thread ZhengYaofeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738541#comment-14738541 ] ZhengYaofeng edited comment on SPARK-10529 at 9/10/15 10:13 AM: I find

[jira] [Updated] (SPARK-9790) [YARN] Expose in WebUI if NodeManager is the reason why executors were killed.

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9790: - Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) > [YARN] Expose in WebUI if

[jira] [Commented] (SPARK-10251) Some internal spark classes are not registered with kryo

2015-09-10 Thread Marius Soutier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738390#comment-14738390 ] Marius Soutier commented on SPARK-10251: Any chance for a backport to 1.4.2? > Some internal

[jira] [Commented] (SPARK-10283) Add @since annotation to pyspark.ml.regression

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738433#comment-14738433 ] Apache Spark commented on SPARK-10283: -- User 'yu-iskw' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10283) Add @since annotation to pyspark.ml.regression

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10283: Assignee: (was: Apache Spark) > Add @since annotation to pyspark.ml.regression >

[jira] [Assigned] (SPARK-10283) Add @since annotation to pyspark.ml.regression

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10283: Assignee: Apache Spark > Add @since annotation to pyspark.ml.regression >

[jira] [Comment Edited] (SPARK-10525) Add Python example for VectorSlicer to user guide

2015-09-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738439#comment-14738439 ] Yanbo Liang edited comment on SPARK-10525 at 9/10/15 9:17 AM: -- [~josephkb]

[jira] [Comment Edited] (SPARK-10525) Add Python example for VectorSlicer to user guide

2015-09-10 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738439#comment-14738439 ] Yanbo Liang edited comment on SPARK-10525 at 9/10/15 9:18 AM: -- [~josephkb]

[jira] [Created] (SPARK-10532) Added new option to specify "user profile" of AWS credentials in spark/spark-ec2.py

2015-09-10 Thread teramonagi (JIRA)
teramonagi created SPARK-10532: -- Summary: Added new option to specify "user profile" of AWS credentials in spark/spark-ec2.py Key: SPARK-10532 URL: https://issues.apache.org/jira/browse/SPARK-10532

[jira] [Comment Edited] (SPARK-10529) When creating multiple HiveContext objects in one jvm, jdbc connections to metastore cann't be released and it may cause PermGen OutOfMemoryError.

2015-09-10 Thread ZhengYaofeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738541#comment-14738541 ] ZhengYaofeng edited comment on SPARK-10529 at 9/10/15 10:12 AM: I find

[jira] [Commented] (SPARK-10278) Add @since annotation to pyspark.mllib.tree

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738282#comment-14738282 ] Apache Spark commented on SPARK-10278: -- User 'yu-iskw' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10282) Add @since annotation to pyspark.ml.recommendation

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10282: Assignee: (was: Apache Spark) > Add @since annotation to pyspark.ml.recommendation >

[jira] [Commented] (SPARK-10282) Add @since annotation to pyspark.ml.recommendation

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738394#comment-14738394 ] Apache Spark commented on SPARK-10282: -- User 'yu-iskw' has created a pull request for this issue:

[jira] [Commented] (SPARK-10251) Some internal spark classes are not registered with kryo

2015-09-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738396#comment-14738396 ] Reynold Xin commented on SPARK-10251: - I think that's possible -- do you want to submit a patch? >

[jira] [Assigned] (SPARK-10282) Add @since annotation to pyspark.ml.recommendation

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10282: Assignee: Apache Spark > Add @since annotation to pyspark.ml.recommendation >

[jira] [Commented] (SPARK-10500) sparkr.zip cannot be created if $SPARK_HOME/R/lib is unwritable

2015-09-10 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738479#comment-14738479 ] Sun Rui commented on SPARK-10500: - I also realized that SPARK-8313 has problem in Standalone mode.

[jira] [Created] (SPARK-10533) DataFrame filter is not handling float/double with Scientific Notation 'e' / 'E'

2015-09-10 Thread Rishabh Bhardwaj (JIRA)
Rishabh Bhardwaj created SPARK-10533: Summary: DataFrame filter is not handling float/double with Scientific Notation 'e' / 'E' Key: SPARK-10533 URL: https://issues.apache.org/jira/browse/SPARK-10533

[jira] [Comment Edited] (SPARK-9610) Class and instance weighting for ML

2015-09-10 Thread Nickolay Yakushev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738596#comment-14738596 ] Nickolay Yakushev edited comment on SPARK-9610 at 9/10/15 11:21 AM:

[jira] [Updated] (SPARK-10534) ORDER BY clause allows only columns that are present in SELECT statement

2015-09-10 Thread Michal Cwienczek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michal Cwienczek updated SPARK-10534: - Description: When invoking query SELECT EmployeeID from Employees order by

[jira] [Comment Edited] (SPARK-9610) Class and instance weighting for ML

2015-09-10 Thread Nickolay Yakushev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738596#comment-14738596 ] Nickolay Yakushev edited comment on SPARK-9610 at 9/10/15 11:22 AM:

[jira] [Assigned] (SPARK-10442) select cast('false' as boolean) returns true

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10442: Assignee: Apache Spark > select cast('false' as boolean) returns true >

[jira] [Assigned] (SPARK-10442) select cast('false' as boolean) returns true

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10442: Assignee: (was: Apache Spark) > select cast('false' as boolean) returns true >

[jira] [Commented] (SPARK-10442) select cast('false' as boolean) returns true

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738655#comment-14738655 ] Apache Spark commented on SPARK-10442: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-8582) Optimize checkpointing to avoid computing an RDD twice

2015-09-10 Thread Robert B. Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738587#comment-14738587 ] Robert B. Kim commented on SPARK-8582: -- [~andrewor14] Because of this bug, we suffer from performance

[jira] [Assigned] (SPARK-10518) Update code examples in spark.ml user guide to use LIBSVM data source instead of MLUtils

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10518: Assignee: Apache Spark > Update code examples in spark.ml user guide to use LIBSVM data

[jira] [Created] (SPARK-10534) ORDER BY clause allows only columns that are present in SELECT statement

2015-09-10 Thread Michal Cwienczek (JIRA)
Michal Cwienczek created SPARK-10534: Summary: ORDER BY clause allows only columns that are present in SELECT statement Key: SPARK-10534 URL: https://issues.apache.org/jira/browse/SPARK-10534

[jira] [Assigned] (SPARK-10518) Update code examples in spark.ml user guide to use LIBSVM data source instead of MLUtils

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10518: Assignee: (was: Apache Spark) > Update code examples in spark.ml user guide to use

[jira] [Commented] (SPARK-10518) Update code examples in spark.ml user guide to use LIBSVM data source instead of MLUtils

2015-09-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738625#comment-14738625 ] Apache Spark commented on SPARK-10518: -- User 'y-shimizu' has created a pull request for this issue:

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738648#comment-14738648 ] Sean Owen commented on SPARK-10493: --- OK, yes I see now that temp4 is count-ed. I'm out of ideas. I

[jira] [Comment Edited] (SPARK-9610) Class and instance weighting for ML

2015-09-10 Thread Nickolay Yakushev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738596#comment-14738596 ] Nickolay Yakushev edited comment on SPARK-9610 at 9/10/15 11:20 AM:

[jira] [Commented] (SPARK-9610) Class and instance weighting for ML

2015-09-10 Thread Nickolay Yakushev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738596#comment-14738596 ] Nickolay Yakushev commented on SPARK-9610: -- Sometimes an algorithm for non-weighted data may be

[jira] [Created] (SPARK-10535) Support for recommendUsersForProducts and recommendProductsForUsers in matrix factorization model for PySpark

2015-09-10 Thread Vladimir Vladimirov (JIRA)
Vladimir Vladimirov created SPARK-10535: --- Summary: Support for recommendUsersForProducts and recommendProductsForUsers in matrix factorization model for PySpark Key: SPARK-10535 URL:

[jira] [Created] (SPARK-10536) filtered POJOs replaced by other instances after collect()

2015-09-10 Thread Erik Schmiegelow (JIRA)
Erik Schmiegelow created SPARK-10536: Summary: filtered POJOs replaced by other instances after collect() Key: SPARK-10536 URL: https://issues.apache.org/jira/browse/SPARK-10536 Project: Spark

[jira] [Updated] (SPARK-10506) There exits some potential resource leak in jsonExpressions.scala

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10506: -- Priority: Minor (was: Major) > There exits some potential resource leak in jsonExpressions.scala >

[jira] [Resolved] (SPARK-7880) Silent failure if assembly jar is corrupted

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7880. -- Resolution: Fixed Assignee: Sean Owen Fix Version/s: 1.4.0 Target

[jira] [Updated] (SPARK-10294) When Parquet writer's close method throws an exception, we will call close again and trigger a NPE

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10294: -- Target Version/s: 1.5.1 (was: 1.5.0, 1.5.1) > When Parquet writer's close method throws an exception,

[jira] [Updated] (SPARK-8697) MatchIterator not serializable exception in RegexTokenizer

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8697: - Target Version/s: (was: 1.5.0) > MatchIterator not serializable exception in RegexTokenizer >

[jira] [Created] (SPARK-10537) Document LIBSVM data source options in public doc and minor improvements

2015-09-10 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10537: - Summary: Document LIBSVM data source options in public doc and minor improvements Key: SPARK-10537 URL: https://issues.apache.org/jira/browse/SPARK-10537 Project:

[jira] [Updated] (SPARK-10535) Support for recommendUsersForProducts and recommendProductsForUsers in matrix factorization model for PySpark

2015-09-10 Thread Vladimir Vladimirov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vladimir Vladimirov updated SPARK-10535: Affects Version/s: 1.5.0 > Support for recommendUsersForProducts and

[jira] [Updated] (SPARK-10534) ORDER BY clause allows only columns that are present in SELECT statement

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10534: -- Component/s: SQL > ORDER BY clause allows only columns that are present in SELECT statement >

[jira] [Updated] (SPARK-6484) Ganglia metrics xml reporter doesn't escape correctly

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6484: - Target Version/s: 1.5.1 (was: 1.5.0) > Ganglia metrics xml reporter doesn't escape correctly >

[jira] [Updated] (SPARK-10337) Views are broken

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10337: -- Target Version/s: 1.5.1 (was: 1.5.0) > Views are broken > > > Key:

[jira] [Updated] (SPARK-6701) Flaky test: o.a.s.deploy.yarn.YarnClusterSuite Python application

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6701: - Target Version/s: 1.5.1 (was: 1.5.0) > Flaky test: o.a.s.deploy.yarn.YarnClusterSuite Python application

[jira] [Updated] (SPARK-10310) [Spark SQL] All result records will be popluated into ONE line during the script transform due to missing the correct line/filed delimiter

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10310: -- Target Version/s: 1.5.1 (was: 1.5.0) > [Spark SQL] All result records will be popluated into ONE line

[jira] [Updated] (SPARK-7420) Flaky test: o.a.s.streaming.JobGeneratorSuite "Do not clear received block data too soon"

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7420: - Target Version/s: 1.5.1 (was: 1.5.0) > Flaky test: o.a.s.streaming.JobGeneratorSuite "Do not clear

[jira] [Commented] (SPARK-10538) java.lang.NegativeArraySizeException during join

2015-09-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738921#comment-14738921 ] Maciej Bryński commented on SPARK-10538: There is similar problem behaviour a few joins before,

[jira] [Resolved] (SPARK-10390) Py4JJavaError java.lang.NoSuchMethodError: com.google.common.base.Stopwatch.elapsedMillis()J

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10390. --- Resolution: Not A Problem FWIW I ran a recent build of Spark successfully with ipython. It was

[jira] [Resolved] (SPARK-10418) pyspark issue with nested array types

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10418. --- Resolution: Cannot Reproduce For now, resolving as cannot reproduce, until there's a response >

[jira] [Commented] (SPARK-10536) filtered POJOs replaced by other instances after collect()

2015-09-10 Thread Erik Schmiegelow (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738917#comment-14738917 ] Erik Schmiegelow commented on SPARK-10536: -- Hi Sean, thanks for your feedback - your assumption

[jira] [Updated] (SPARK-10536) filtered POJOs replaced by other instances after collect()

2015-09-10 Thread Erik Schmiegelow (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erik Schmiegelow updated SPARK-10536: - Description: I've encountered a very strange phenomenon with collect() in a simplistic

[jira] [Updated] (SPARK-9730) Sort Merge Join for Full Outer Join

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-9730: - Assignee: Davies Liu > Sort Merge Join for Full Outer Join > --- > >

[jira] [Updated] (SPARK-10527) evaluate debugString only when log level is debug

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10527: -- Component/s: Spark Core > evaluate debugString only when log level is debug >

[jira] [Updated] (SPARK-10461) make sure `input.primitive` is always variable name not code at GenerateUnsafeProjection

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10461: -- Assignee: Wenchen Fan > make sure `input.primitive` is always variable name not code at >

[jira] [Updated] (SPARK-10517) Console "Output" field is empty when using DataFrameWriter.json

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10517: -- Component/s: Web UI > Console "Output" field is empty when using DataFrameWriter.json >

[jira] [Updated] (SPARK-10044) AnalysisException in resolving reference for sorting with aggregation

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-10044: -- Target Version/s: (was: 1.5.0) > AnalysisException in resolving reference for sorting with

[jira] [Updated] (SPARK-7880) Silent failure if assembly jar is corrupted

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7880: - Priority: Minor (was: Major) Fix Version/s: (was: 1.4.0) 1.5.0 I

[jira] [Updated] (SPARK-10538) java.lang.NegativeArraySizeException during join

2015-09-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-10538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-10538: --- Description: Hi, I've got a problem during joining tables. (in my example 20 of them) I can

[jira] [Commented] (SPARK-10536) filtered POJOs replaced by other instances after collect()

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738832#comment-14738832 ] Sean Owen commented on SPARK-10536: --- My snap guess is that this is the problem: {{f._1.datum()}} You

[jira] [Commented] (SPARK-10493) reduceByKey not returning distinct results

2015-09-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14738829#comment-14738829 ] Sean Owen commented on SPARK-10493: --- Maybe union() tides you over; CDH 5.5 = Spark 1.5 is coming in ~2

  1   2   3   >