[jira] [Assigned] (SPARK-14844) KMeansModel in spark.ml should allow to change featureCol and predictionCol

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14844: Assignee: Apache Spark > KMeansModel in spark.ml should allow to change featureCol and

[jira] [Commented] (SPARK-14844) KMeansModel in spark.ml should allow to change featureCol and predictionCol

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253764#comment-15253764 ] Apache Spark commented on SPARK-14844: -- User 'dominik-jastrzebski' has created a pull request for

[jira] [Created] (SPARK-14847) ML/MLlib breaking changes between 1.6 & 2.0

2016-04-22 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-14847: --- Summary: ML/MLlib breaking changes between 1.6 & 2.0 Key: SPARK-14847 URL: https://issues.apache.org/jira/browse/SPARK-14847 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-11227) Spark1.5+ HDFS HA mode throw java.net.UnknownHostException: nameservice1

2016-04-22 Thread Yuri Saito (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253760#comment-15253760 ] Yuri Saito edited comment on SPARK-11227 at 4/22/16 11:17 AM: --

[jira] [Created] (SPARK-14846) Driver process fails to terminate when graceful shutdown is used

2016-04-22 Thread Mattias Aspholm (JIRA)
Mattias Aspholm created SPARK-14846: --- Summary: Driver process fails to terminate when graceful shutdown is used Key: SPARK-14846 URL: https://issues.apache.org/jira/browse/SPARK-14846 Project:

[jira] [Commented] (SPARK-11227) Spark1.5+ HDFS HA mode throw java.net.UnknownHostException: nameservice1

2016-04-22 Thread Yuri Saito (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253760#comment-15253760 ] Yuri Saito commented on SPARK-11227: [~valgrind_girl]: Have you run spark-submit and your jar with

[jira] [Comment Edited] (SPARK-11227) Spark1.5+ HDFS HA mode throw java.net.UnknownHostException: nameservice1

2016-04-22 Thread Yuri Saito (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253760#comment-15253760 ] Yuri Saito edited comment on SPARK-11227 at 4/22/16 11:15 AM: --

[jira] [Comment Edited] (SPARK-11227) Spark1.5+ HDFS HA mode throw java.net.UnknownHostException: nameservice1

2016-04-22 Thread Yuri Saito (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253760#comment-15253760 ] Yuri Saito edited comment on SPARK-11227 at 4/22/16 11:16 AM: --

[jira] [Commented] (SPARK-11559) Make `runs` no effect in k-means

2016-04-22 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253755#comment-15253755 ] Yanbo Liang commented on SPARK-11559: - Sure, sent https://github.com/apache/spark/pull/12608 to

[jira] [Commented] (SPARK-14737) Kafka Brokers are down - spark stream should retry

2016-04-22 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253751#comment-15253751 ] Sean Owen commented on SPARK-14737: --- I think it's most sensible to fail the application. It can't

[jira] [Commented] (SPARK-11559) Make `runs` no effect in k-means

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253745#comment-15253745 ] Apache Spark commented on SPARK-11559: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Updated] (SPARK-14609) LOAD DATA

2016-04-22 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-14609: Assignee: Liang-Chi Hsieh > LOAD DATA > - > > Key: SPARK-14609 >

[jira] [Assigned] (SPARK-14806) Alias original Hive options in Spark SQL conf

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14806: Assignee: Apache Spark > Alias original Hive options in Spark SQL conf >

[jira] [Assigned] (SPARK-14806) Alias original Hive options in Spark SQL conf

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14806: Assignee: (was: Apache Spark) > Alias original Hive options in Spark SQL conf >

[jira] [Commented] (SPARK-14806) Alias original Hive options in Spark SQL conf

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253714#comment-15253714 ] Apache Spark commented on SPARK-14806: -- User 'bomeng' has created a pull request for this issue:

[jira] [Resolved] (SPARK-14609) LOAD DATA

2016-04-22 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-14609. - Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12412

[jira] [Created] (SPARK-14845) spark.files in properties file is not distributed to driver in yarn-cluster mode

2016-04-22 Thread Jeff Zhang (JIRA)
Jeff Zhang created SPARK-14845: -- Summary: spark.files in properties file is not distributed to driver in yarn-cluster mode Key: SPARK-14845 URL: https://issues.apache.org/jira/browse/SPARK-14845

[jira] [Commented] (SPARK-14845) spark.files in properties file is not distributed to driver in yarn-cluster mode

2016-04-22 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253673#comment-15253673 ] Jeff Zhang commented on SPARK-14845: working on it. > spark.files in properties file is not

[jira] [Created] (SPARK-14844) KMeansModel in spark.ml should allow to change featureCol and predictionCol

2016-04-22 Thread JIRA
Dominik Jastrzębski created SPARK-14844: --- Summary: KMeansModel in spark.ml should allow to change featureCol and predictionCol Key: SPARK-14844 URL: https://issues.apache.org/jira/browse/SPARK-14844

[jira] [Updated] (SPARK-14843) Error while encoding: java.lang.ClassCastException with LibSVMRelation

2016-04-22 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-14843: --- Component/s: SQL > Error while encoding: java.lang.ClassCastException with LibSVMRelation >

[jira] [Created] (SPARK-14843) Error while encoding: java.lang.ClassCastException with LibSVMRelation

2016-04-22 Thread Nick Pentreath (JIRA)
Nick Pentreath created SPARK-14843: -- Summary: Error while encoding: java.lang.ClassCastException with LibSVMRelation Key: SPARK-14843 URL: https://issues.apache.org/jira/browse/SPARK-14843 Project:

[jira] [Commented] (SPARK-14820) Reduce shuffle data by pushing filter toward storage

2016-04-22 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253608#comment-15253608 ] Takeshi Yamamuro commented on SPARK-14820: -- Seems `Optimizer#PushPredicateThroughJoin` handles

[jira] [Comment Edited] (SPARK-14820) Reduce shuffle data by pushing filter toward storage

2016-04-22 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253608#comment-15253608 ] Takeshi Yamamuro edited comment on SPARK-14820 at 4/22/16 9:24 AM: ---

[jira] [Resolved] (SPARK-14826) Remove HiveQueryExecution

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14826. - Resolution: Fixed Assignee: Reynold Xin Fix Version/s: 2.0.0 > Remove

[jira] [Commented] (SPARK-14489) RegressionEvaluator returns NaN for ALS in Spark ml

2016-04-22 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253536#comment-15253536 ] Nick Pentreath commented on SPARK-14489: Is naive sampling not an option then for the

[jira] [Commented] (SPARK-12524) Group by key in a pairrdd without any shuffle

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253531#comment-15253531 ] Apache Spark commented on SPARK-12524: -- User 'seayi' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-14658) when executor lost DagScheduer may submit one stage twice even if the first running taskset for this stage is not finished

2016-04-22 Thread yixiaohua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253520#comment-15253520 ] yixiaohua edited comment on SPARK-14658 at 4/22/16 8:18 AM: Owen thanks for

[jira] [Commented] (SPARK-14658) when executor lost DagScheduer may submit one stage twice even if the first running taskset for this stage is not finished

2016-04-22 Thread yixiaohua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253520#comment-15253520 ] yixiaohua commented on SPARK-14658: --- Owen thanks for your attention ,but i think it is not the

[jira] [Commented] (SPARK-13178) RRDD faces with concurrency issue in case of rdd.zip(rdd).count()

2016-04-22 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253518#comment-15253518 ] Sun Rui commented on SPARK-13178: - This is fixed as the SparkR unit tests can pass after removing the

[jira] [Commented] (SPARK-13178) RRDD faces with concurrency issue in case of rdd.zip(rdd).count()

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253517#comment-15253517 ] Apache Spark commented on SPARK-13178: -- User 'sun-rui' has created a pull request for this issue:

[jira] [Assigned] (SPARK-13178) RRDD faces with concurrency issue in case of rdd.zip(rdd).count()

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13178: Assignee: (was: Apache Spark) > RRDD faces with concurrency issue in case of

[jira] [Assigned] (SPARK-13178) RRDD faces with concurrency issue in case of rdd.zip(rdd).count()

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-13178: Assignee: Apache Spark > RRDD faces with concurrency issue in case of

[jira] [Commented] (SPARK-14812) ML 2.0 QA: API: Experimental, DeveloperApi, final, sealed audit

2016-04-22 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253508#comment-15253508 ] Nick Pentreath commented on SPARK-14812: I would like to keep ALS experimental until SPARK-13857

[jira] [Commented] (SPARK-10001) Allow Ctrl-C in spark-shell to kill running job

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253486#comment-15253486 ] Apache Spark commented on SPARK-10001: -- User 'rxin' has created a pull request for this issue:

[jira] [Commented] (SPARK-12660) Rewrite except using anti-join

2016-04-22 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253480#comment-15253480 ] Herman van Hovell commented on SPARK-12660: --- Yeah this can be done now. > Rewrite except using

[jira] [Updated] (SPARK-14838) Implement statistics in SerializeFromObject to avoid failure when estimating sizeInBytes for ObjectType

2016-04-22 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-14838: Summary: Implement statistics in SerializeFromObject to avoid failure when estimating

[jira] [Commented] (SPARK-14706) Python ML persistence integration test

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253470#comment-15253470 ] Apache Spark commented on SPARK-14706: -- User 'yinxusen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14706) Python ML persistence integration test

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14706: Assignee: (was: Apache Spark) > Python ML persistence integration test >

[jira] [Assigned] (SPARK-14706) Python ML persistence integration test

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14706: Assignee: Apache Spark > Python ML persistence integration test >

[jira] [Updated] (SPARK-14806) Alias original Hive options in Spark SQL conf

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14806: Description: There are couple options we should alias: spark.sql.variable.substitute and

[jira] [Commented] (SPARK-14594) Improve error messages for RDD API

2016-04-22 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253462#comment-15253462 ] Marco Gaido commented on SPARK-14594: - Yes, it works with few data. But if you put a lot of data

[jira] [Commented] (SPARK-14842) Implement view creation in sql/core

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253450#comment-15253450 ] Apache Spark commented on SPARK-14842: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14842) Implement view creation in sql/core

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14842: Assignee: Reynold Xin (was: Apache Spark) > Implement view creation in sql/core >

[jira] [Assigned] (SPARK-14842) Implement view creation in sql/core

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14842: Assignee: Apache Spark (was: Reynold Xin) > Implement view creation in sql/core >

[jira] [Created] (SPARK-14842) Implement view creation in sql/core

2016-04-22 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-14842: --- Summary: Implement view creation in sql/core Key: SPARK-14842 URL: https://issues.apache.org/jira/browse/SPARK-14842 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-12660) Rewrite except using anti-join

2016-04-22 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253422#comment-15253422 ] Xiao Li commented on SPARK-12660: - Thanks! > Rewrite except using anti-join >

[jira] [Assigned] (SPARK-14841) Move SQLBuilder into sql/core

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14841: Assignee: Apache Spark (was: Reynold Xin) > Move SQLBuilder into sql/core >

[jira] [Commented] (SPARK-14841) Move SQLBuilder into sql/core

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253421#comment-15253421 ] Apache Spark commented on SPARK-14841: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14841) Move SQLBuilder into sql/core

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14841: Assignee: Reynold Xin (was: Apache Spark) > Move SQLBuilder into sql/core >

[jira] [Created] (SPARK-14841) Move SQLBuilder into sql/core

2016-04-22 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-14841: --- Summary: Move SQLBuilder into sql/core Key: SPARK-14841 URL: https://issues.apache.org/jira/browse/SPARK-14841 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-14525) DataFrameWriter's save method should delegate to jdbc for jdbc datasource

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14525: Assignee: Apache Spark > DataFrameWriter's save method should delegate to jdbc for jdbc

[jira] [Assigned] (SPARK-14525) DataFrameWriter's save method should delegate to jdbc for jdbc datasource

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14525: Assignee: (was: Apache Spark) > DataFrameWriter's save method should delegate to jdbc

[jira] [Commented] (SPARK-14525) DataFrameWriter's save method should delegate to jdbc for jdbc datasource

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253406#comment-15253406 ] Apache Spark commented on SPARK-14525: -- User 'JustinPihony' has created a pull request for this

<    1   2   3