[jira] [Commented] (SPARK-14856) Returning batch unexpected from wide table

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254592#comment-15254592 ] Apache Spark commented on SPARK-14856: -- User 'davies' has created a pull request for this issue:

[jira] [Resolved] (SPARK-14796) Add spark.sql.optimizer.inSetConversionThreshold config option

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14796. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.0.0 > Add

[jira] [Resolved] (SPARK-12543) Support subquery in select/where/having

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-12543. Resolution: Fixed Assignee: Herman van Hovell (was: Davies Liu) Fix Version/s:

[jira] [Commented] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-22 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254641#comment-15254641 ] Felix Cheung commented on SPARK-14831: -- 2. +1 read.spark.model and write.spark.model might be more

[jira] [Commented] (SPARK-14489) RegressionEvaluator returns NaN for ALS in Spark ml

2016-04-22 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254680#comment-15254680 ] Seth Hendrickson commented on SPARK-14489: -- This is an interesting idea. I would say that under

[jira] [Commented] (SPARK-14773) Enable the tests in HiveCompatibilitySuite for subquery

2016-04-22 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254709#comment-15254709 ] Herman van Hovell commented on SPARK-14773: --- Sure > Enable the tests in HiveCompatibilitySuite

[jira] [Issue Comment Deleted] (SPARK-14797) Spark SQL should not hardcode dependency on spark-sketch_2.11

2016-04-22 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated SPARK-14797: -- Comment: was deleted (was: [~joshrosen], I can no longer build using maven after this commit. I'm

[jira] [Assigned] (SPARK-12148) SparkR: rename DataFrame to SparkDataFrame

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12148: Assignee: Apache Spark > SparkR: rename DataFrame to SparkDataFrame >

[jira] [Commented] (SPARK-11057) SQL: corr and cov for many columns

2016-04-22 Thread Sajjad Bey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254778#comment-15254778 ] Sajjad Bey commented on SPARK-11057: I have been trying to use correlation on a matrix with many

[jira] [Assigned] (SPARK-14615) Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14615: Assignee: DB Tsai (was: Apache Spark) > Use the new ML Vector and Matrix in the ML

[jira] [Assigned] (SPARK-14615) Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14615: Assignee: Apache Spark (was: DB Tsai) > Use the new ML Vector and Matrix in the ML

[jira] [Commented] (SPARK-14615) Use the new ML Vector and Matrix in the ML pipeline based algorithms

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254901#comment-15254901 ] Apache Spark commented on SPARK-14615: -- User 'dbtsai' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-14846) Driver process fails to terminate when graceful shutdown is used

2016-04-22 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254628#comment-15254628 ] Shixiong Zhu edited comment on SPARK-14846 at 4/22/16 8:39 PM: ---

[jira] [Commented] (SPARK-14846) Driver process fails to terminate when graceful shutdown is used

2016-04-22 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254628#comment-15254628 ] Shixiong Zhu commented on SPARK-14846: -- `awaitTermination` doesn't need to wait at least one hour.

[jira] [Commented] (SPARK-14859) [PYSPARK] Make Lambda Serializer Configurable

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254666#comment-15254666 ] Apache Spark commented on SPARK-14859: -- User 'njwhite' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14859) [PYSPARK] Make Lambda Serializer Configurable

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14859: Assignee: (was: Apache Spark) > [PYSPARK] Make Lambda Serializer Configurable >

[jira] [Assigned] (SPARK-14859) [PYSPARK] Make Lambda Serializer Configurable

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14859: Assignee: Apache Spark > [PYSPARK] Make Lambda Serializer Configurable >

[jira] [Commented] (SPARK-14859) [PYSPARK] Make Lambda Serializer Configurable

2016-04-22 Thread Nick White (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254665#comment-15254665 ] Nick White commented on SPARK-14859: I've got a PR for this here:

[jira] [Assigned] (SPARK-14785) Support correlated scalar subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-14785: -- Assignee: Davies Liu > Support correlated scalar subquery >

[jira] [Updated] (SPARK-14773) Enable the tests in HiveCompatibilitySuite for subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14773: --- Assignee: Herman van Hovell > Enable the tests in HiveCompatibilitySuite for subquery >

[jira] [Commented] (SPARK-14773) Enable the tests in HiveCompatibilitySuite for subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254622#comment-15254622 ] Davies Liu commented on SPARK-14773: [~hvanhovell] Could you take this? > Enable the tests in

[jira] [Updated] (SPARK-14785) Support correlated scalar subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14785: --- Description: For example: {code} SELECT a from t where b > (select avg(c) from t2 where t.id =

[jira] [Commented] (SPARK-14858) Push predicates with subquery

2016-04-22 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254708#comment-15254708 ] Herman van Hovell commented on SPARK-14858: --- You want me to take this one? > Push predicates

[jira] [Commented] (SPARK-14858) Push predicates with subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254744#comment-15254744 ] Davies Liu commented on SPARK-14858: I created a few JIRA related to subquery, it will be great if

[jira] [Commented] (SPARK-14858) Push predicates with subquery

2016-04-22 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254759#comment-15254759 ] Herman van Hovell commented on SPARK-14858: --- Sure I'll take a stab at them; would to great

[jira] [Commented] (SPARK-14594) Improve error messages for RDD API

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254775#comment-15254775 ] Apache Spark commented on SPARK-14594: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-14594) Improve error messages for RDD API

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14594: Assignee: Apache Spark > Improve error messages for RDD API >

[jira] [Assigned] (SPARK-14594) Improve error messages for RDD API

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14594: Assignee: (was: Apache Spark) > Improve error messages for RDD API >

[jira] [Created] (SPARK-14863) Cache TreeNode's hashCode

2016-04-22 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-14863: -- Summary: Cache TreeNode's hashCode Key: SPARK-14863 URL: https://issues.apache.org/jira/browse/SPARK-14863 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-14834) Force adding doc for new api in pyspark with @since annotation

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254639#comment-15254639 ] Reynold Xin commented on SPARK-14834: - What is this ticket about? > Force adding doc for new api in

[jira] [Commented] (SPARK-12148) SparkR: rename DataFrame to SparkDataFrame

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254762#comment-15254762 ] Apache Spark commented on SPARK-12148: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-12148) SparkR: rename DataFrame to SparkDataFrame

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12148: Assignee: (was: Apache Spark) > SparkR: rename DataFrame to SparkDataFrame >

[jira] [Assigned] (SPARK-14860) Fix flaky test: o.a.s.sql.util.ContinuousQueryListenerSuite "event ordering"

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14860: Assignee: Apache Spark (was: Shixiong Zhu) > Fix flaky test:

[jira] [Commented] (SPARK-14860) Fix flaky test: o.a.s.sql.util.ContinuousQueryListenerSuite "event ordering"

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254808#comment-15254808 ] Apache Spark commented on SPARK-14860: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14860) Fix flaky test: o.a.s.sql.util.ContinuousQueryListenerSuite "event ordering"

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14860: Assignee: Shixiong Zhu (was: Apache Spark) > Fix flaky test:

[jira] [Assigned] (SPARK-14861) Replace internal usages of SQLContext with SparkSession

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14861: Assignee: Andrew Or (was: Apache Spark) > Replace internal usages of SQLContext with

[jira] [Assigned] (SPARK-14861) Replace internal usages of SQLContext with SparkSession

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14861: Assignee: Apache Spark (was: Andrew Or) > Replace internal usages of SQLContext with

[jira] [Commented] (SPARK-14861) Replace internal usages of SQLContext with SparkSession

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254828#comment-15254828 ] Apache Spark commented on SPARK-14861: -- User 'andrewor14' has created a pull request for this issue:

[jira] [Resolved] (SPARK-14701) checkpointWriter is stopped before eventLoop. Hence rejectedExecution exception is coming in StreamingContext.stop

2016-04-22 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-14701. -- Resolution: Fixed Assignee: Liwei Lin Fix Version/s: 2.0.0 > checkpointWriter

[jira] [Created] (SPARK-14862) Tree and ensemble classification: do not require label metadata

2016-04-22 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-14862: - Summary: Tree and ensemble classification: do not require label metadata Key: SPARK-14862 URL: https://issues.apache.org/jira/browse/SPARK-14862 Project:

[jira] [Assigned] (SPARK-14863) Cache TreeNode's hashCode

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14863: Assignee: Josh Rosen (was: Apache Spark) > Cache TreeNode's hashCode >

[jira] [Assigned] (SPARK-14863) Cache TreeNode's hashCode

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14863: Assignee: Apache Spark (was: Josh Rosen) > Cache TreeNode's hashCode >

[jira] [Commented] (SPARK-14863) Cache TreeNode's hashCode

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14863?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254878#comment-15254878 ] Apache Spark commented on SPARK-14863: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Created] (SPARK-14858) Push predicates with subquery

2016-04-22 Thread Davies Liu (JIRA)
Davies Liu created SPARK-14858: -- Summary: Push predicates with subquery Key: SPARK-14858 URL: https://issues.apache.org/jira/browse/SPARK-14858 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-14594) Improve error messages for RDD API

2016-04-22 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254631#comment-15254631 ] Felix Cheung commented on SPARK-14594: -- I see - it is likely then the JVM process died running out

[jira] [Comment Edited] (SPARK-14831) Make ML APIs in SparkR consistent

2016-04-22 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254641#comment-15254641 ] Felix Cheung edited comment on SPARK-14831 at 4/22/16 8:45 PM: --- 2. +1

[jira] [Created] (SPARK-14859) [PYSPARK] Make Lambda Serializer Configurable

2016-04-22 Thread Nick White (JIRA)
Nick White created SPARK-14859: -- Summary: [PYSPARK] Make Lambda Serializer Configurable Key: SPARK-14859 URL: https://issues.apache.org/jira/browse/SPARK-14859 Project: Spark Issue Type:

[jira] [Updated] (SPARK-14785) Support correlated scalar subquery

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-14785: --- Description: For example: SELECT a from t where b > (select avg(c) from t2 where t.id = t2.id) it

[jira] [Commented] (SPARK-14797) Spark SQL should not hardcode dependency on spark-sketch_2.11

2016-04-22 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254746#comment-15254746 ] Ryan Blue commented on SPARK-14797: --- [~joshrosen], I can no longer build using maven after this commit.

[jira] [Created] (SPARK-14861) Replace internal usages of SQLContext with SparkSession

2016-04-22 Thread Andrew Or (JIRA)
Andrew Or created SPARK-14861: - Summary: Replace internal usages of SQLContext with SparkSession Key: SPARK-14861 URL: https://issues.apache.org/jira/browse/SPARK-14861 Project: Spark Issue

[jira] [Assigned] (SPARK-14856) Returning batch unexpected from wide table

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14856: Assignee: Apache Spark (was: Davies Liu) > Returning batch unexpected from wide table >

[jira] [Assigned] (SPARK-14856) Returning batch unexpected from wide table

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14856: Assignee: Davies Liu (was: Apache Spark) > Returning batch unexpected from wide table >

[jira] [Closed] (SPARK-14347) Require Java 8 for Spark 2.x

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-14347. --- Resolution: Later > Require Java 8 for Spark 2.x > > >

[jira] [Assigned] (SPARK-14852) Update GeneralizedLinearRegressionSummary API

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14852: Assignee: Apache Spark (was: Joseph K. Bradley) > Update

[jira] [Commented] (SPARK-14852) Update GeneralizedLinearRegressionSummary API

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254813#comment-15254813 ] Apache Spark commented on SPARK-14852: -- User 'jkbradley' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14852) Update GeneralizedLinearRegressionSummary API

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14852: Assignee: Joseph K. Bradley (was: Apache Spark) > Update

[jira] [Assigned] (SPARK-14862) Tree and ensemble classification: do not require label metadata

2016-04-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-14862: - Assignee: Joseph K. Bradley > Tree and ensemble classification: do not require

[jira] [Updated] (SPARK-14862) Tree and ensemble classification: do not require label metadata

2016-04-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14862: -- Description: spark.ml DecisionTreeClassifier, RandomForestClassifier, and

[jira] [Created] (SPARK-14865) When creating a view, we should verify the generated SQL string

2016-04-22 Thread Yin Huai (JIRA)
Yin Huai created SPARK-14865: Summary: When creating a view, we should verify the generated SQL string Key: SPARK-14865 URL: https://issues.apache.org/jira/browse/SPARK-14865 Project: Spark

[jira] [Assigned] (SPARK-14800) Dealing with null as a value in options for each internal data source

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14800: Assignee: Apache Spark > Dealing with null as a value in options for each internal data

[jira] [Commented] (SPARK-14800) Dealing with null as a value in options for each internal data source

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14800?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255099#comment-15255099 ] Apache Spark commented on SPARK-14800: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-14866) Break SQLQuerySuite out into smaller test suites

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255117#comment-15255117 ] Apache Spark commented on SPARK-14866: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14866) Break SQLQuerySuite out into smaller test suites

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14866: Assignee: Apache Spark (was: Reynold Xin) > Break SQLQuerySuite out into smaller test

[jira] [Assigned] (SPARK-14866) Break SQLQuerySuite out into smaller test suites

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14866: Assignee: Reynold Xin (was: Apache Spark) > Break SQLQuerySuite out into smaller test

[jira] [Created] (SPARK-14866) Break SQLQuerySuite out into smaller test suites

2016-04-22 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-14866: --- Summary: Break SQLQuerySuite out into smaller test suites Key: SPARK-14866 URL: https://issues.apache.org/jira/browse/SPARK-14866 Project: Spark Issue Type:

[jira] [Commented] (SPARK-14694) Thrift Server + Hive Metastore + Kerberos doesn't work

2016-04-22 Thread zhangguancheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255036#comment-15255036 ] zhangguancheng commented on SPARK-14694: Content of hive-site.xml: {quote}

[jira] [Assigned] (SPARK-14800) Dealing with null as a value in options for each internal data source

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14800: Assignee: (was: Apache Spark) > Dealing with null as a value in options for each

[jira] [Created] (SPARK-14867) Make `build/mvn` to use the downloaded maven if it exist.

2016-04-22 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-14867: - Summary: Make `build/mvn` to use the downloaded maven if it exist. Key: SPARK-14867 URL: https://issues.apache.org/jira/browse/SPARK-14867 Project: Spark

[jira] [Assigned] (SPARK-14867) Make `build/mvn` to use the downloaded maven if it exist.

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14867: Assignee: Apache Spark > Make `build/mvn` to use the downloaded maven if it exist. >

[jira] [Assigned] (SPARK-14867) Make `build/mvn` to use the downloaded maven if it exist.

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14867: Assignee: (was: Apache Spark) > Make `build/mvn` to use the downloaded maven if it

[jira] [Commented] (SPARK-14867) Make `build/mvn` to use the downloaded maven if it exist.

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255144#comment-15255144 ] Apache Spark commented on SPARK-14867: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Resolved] (SPARK-14863) Cache TreeNode's hashCode

2016-04-22 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-14863. - Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12626

[jira] [Resolved] (SPARK-14855) Add "Exec" suffix to all physical operators

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14855. - Resolution: Fixed Fix Version/s: 2.0.0 > Add "Exec" suffix to all physical operators >

[jira] [Commented] (SPARK-14525) DataFrameWriter's save method should delegate to jdbc for jdbc datasource

2016-04-22 Thread Justin Pihony (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255130#comment-15255130 ] Justin Pihony commented on SPARK-14525: --- To address any concerns about taking Properties to a

[jira] [Resolved] (SPARK-14832) Refactor DataSource to ensure schema is inferred only once when creating a file stream

2016-04-22 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-14832. -- Resolution: Fixed Fix Version/s: 2.0.0 > Refactor DataSource to ensure schema is

[jira] [Commented] (SPARK-14654) New accumulator API

2016-04-22 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255123#comment-15255123 ] holdenk commented on SPARK-14654: - Giving it a bit of thought on the flight, is there a reason why we

[jira] [Commented] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2016-04-22 Thread Ben McCann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255122#comment-15255122 ] Ben McCann commented on SPARK-7008: --- I've found a number of implementations:

[jira] [Closed] (SPARK-13891) Issue an exception when hitting max iteration limit in testing

2016-04-22 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li closed SPARK-13891. --- Resolution: Duplicate > Issue an exception when hitting max iteration limit in testing >

[jira] [Created] (SPARK-14868) Enable NewLineAtEofChecker in checkstyle and fix lint-java errors

2016-04-22 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-14868: - Summary: Enable NewLineAtEofChecker in checkstyle and fix lint-java errors Key: SPARK-14868 URL: https://issues.apache.org/jira/browse/SPARK-14868 Project: Spark

[jira] [Created] (SPARK-14864) [MLLIB] Implement Doc2Vec

2016-04-22 Thread Peter Mountanos (JIRA)
Peter Mountanos created SPARK-14864: --- Summary: [MLLIB] Implement Doc2Vec Key: SPARK-14864 URL: https://issues.apache.org/jira/browse/SPARK-14864 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-14817) ML 2.0 QA: Programming guide update and migration guide

2016-04-22 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254974#comment-15254974 ] zhengruifeng commented on SPARK-14817: -- count me in too ☺ > ML 2.0 QA: Programming guide update and

[jira] [Commented] (SPARK-14654) New accumulator API

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255159#comment-15255159 ] Reynold Xin commented on SPARK-14654: - That only works for the built-in ones and can't work if there

[jira] [Resolved] (SPARK-14866) Break SQLQuerySuite out into smaller test suites

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14866. - Resolution: Fixed Fix Version/s: 2.0.0 > Break SQLQuerySuite out into smaller test suites

[jira] [Resolved] (SPARK-14551) Reduce number of NameNode calls in OrcRelation with FileSourceStrategy mode

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14551. - Resolution: Fixed Assignee: Rajesh Balamohan Fix Version/s: 2.0.0 > Reduce

[jira] [Assigned] (SPARK-14757) Incorrect behavior of Join operation in Spqrk SQL JOIN : "false" in the left table is joined to "null" on the right table

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14757: Assignee: (was: Apache Spark) > Incorrect behavior of Join operation in Spqrk SQL

[jira] [Commented] (SPARK-14757) Incorrect behavior of Join operation in Spqrk SQL JOIN : "false" in the left table is joined to "null" on the right table

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254939#comment-15254939 ] Apache Spark commented on SPARK-14757: -- User 'arashn' has created a pull request for this issue:

[jira] [Assigned] (SPARK-14757) Incorrect behavior of Join operation in Spqrk SQL JOIN : "false" in the left table is joined to "null" on the right table

2016-04-22 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14757: Assignee: Apache Spark > Incorrect behavior of Join operation in Spqrk SQL JOIN : "false"

[jira] [Comment Edited] (SPARK-14694) Thrift Server + Hive Metastore + Kerberos doesn't work

2016-04-22 Thread zhangguancheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255036#comment-15255036 ] zhangguancheng edited comment on SPARK-14694 at 4/23/16 2:08 AM: - Content

[jira] [Resolved] (SPARK-14842) Implement view creation in sql/core

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14842. - Resolution: Fixed Fix Version/s: 2.0.0 > Implement view creation in sql/core >

[jira] [Commented] (SPARK-14525) DataFrameWriter's save method should delegate to jdbc for jdbc datasource

2016-04-22 Thread Justin Pihony (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255136#comment-15255136 ] Justin Pihony commented on SPARK-14525: --- If I am to update the jdbc.DefaultSource to be a

[jira] [Comment Edited] (SPARK-14865) When creating a view, we should verify the generated SQL string

2016-04-22 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255137#comment-15255137 ] Xiao Li edited comment on SPARK-14865 at 4/23/16 4:54 AM: -- If nobody starts it,

[jira] [Commented] (SPARK-14865) When creating a view, we should verify the generated SQL string

2016-04-22 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15255137#comment-15255137 ] Xiao Li commented on SPARK-14865: - If nobody starts it, I can take it? We just verify if the SQL string

[jira] [Resolved] (SPARK-14582) Increase the parallelism for small tables

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-14582. - Resolution: Fixed Fix Version/s: 2.0.0 > Increase the parallelism for small tables >

[jira] [Commented] (SPARK-14757) Incorrect behavior of Join operation in Spqrk SQL JOIN : "false" in the left table is joined to "null" on the right table

2016-04-22 Thread Arash Nabili (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254949#comment-15254949 ] Arash Nabili commented on SPARK-14757: -- I have submitted a PR to the master branch which should fix

[jira] [Resolved] (SPARK-14807) Create a compatibility module

2016-04-22 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-14807. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12580

[jira] [Updated] (SPARK-14757) Incorrect behavior of Join operation in Spqrk SQL JOIN : "false" in the left table is joined to "null" on the right table

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14757: Target Version/s: 1.6.2, 2.0.0 (was: 1.6.1, 2.0.0) > Incorrect behavior of Join operation in

[jira] [Updated] (SPARK-14757) Incorrect behavior of Join operation in Spqrk SQL JOIN : "false" in the left table is joined to "null" on the right table

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14757: Target Version/s: 1.6.1, 2.0.0 > Incorrect behavior of Join operation in Spqrk SQL JOIN : "false"

[jira] [Resolved] (SPARK-14856) Returning batch unexpected from wide table

2016-04-22 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-14856. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12619

[jira] [Updated] (SPARK-14806) Alias original Hive options in Spark SQL conf

2016-04-22 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-14806: Description: There are couple options we should alias: spark.sql.variable.substitute and

[jira] [Commented] (SPARK-14658) when executor lost DagScheduer may submit one stage twice even if the first running taskset for this stage is not finished

2016-04-22 Thread yixiaohua (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253520#comment-15253520 ] yixiaohua commented on SPARK-14658: --- Owen thanks for your attention ,but i think it is not the

  1   2   3   >