[jira] [Commented] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15744519#comment-15744519 ] Shivaram Venkataraman commented on SPARK-18823: --- Ah I see your point - `wit

[jira] [Resolved] (SPARK-18804) Join doesn't work in Spark on Bigger tables

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18804. --- Resolution: Not A Problem (Please don't reopen if the discussion has not meaningfully changed. JIRA i

[jira] [Closed] (SPARK-18804) Join doesn't work in Spark on Bigger tables

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-18804. - > Join doesn't work in Spark on Bigger tables > --- > >

[jira] [Commented] (SPARK-18835) Do not expose shaded types in JavaTypeInference API

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15744600#comment-15744600 ] Sean Owen commented on SPARK-18835: --- (Ideally we should fix this for 2.1.0 because it c

[jira] [Updated] (SPARK-18837) It will not hidden if job or stage description too long

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18837: -- Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) I agree that at least this need

[jira] [Updated] (SPARK-18837) Very long stage descriptions do not wrap in the UI

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18837: -- Summary: Very long stage descriptions do not wrap in the UI (was: It will not hidden if job or stage d

[jira] [Assigned] (SPARK-18840) HDFSCredentialProvider throws exception in non-HDFS security environment

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18840: Assignee: Apache Spark > HDFSCredentialProvider throws exception in non-HDFS security envi

[jira] [Assigned] (SPARK-18840) HDFSCredentialProvider throws exception in non-HDFS security environment

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18840: Assignee: (was: Apache Spark) > HDFSCredentialProvider throws exception in non-HDFS se

[jira] [Commented] (SPARK-18840) HDFSCredentialProvider throws exception in non-HDFS security environment

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15744670#comment-15744670 ] Apache Spark commented on SPARK-18840: -- User 'jerryshao' has created a pull request

[jira] [Comment Edited] (SPARK-18840) HDFSCredentialProvider throws exception in non-HDFS security environment

2016-12-13 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15744370#comment-15744370 ] Saisai Shao edited comment on SPARK-18840 at 12/13/16 9:30 AM:

[jira] [Commented] (SPARK-18650) race condition in FileScanRDD.scala

2016-12-13 Thread Soumabrata Chakraborty (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15744695#comment-15744695 ] Soumabrata Chakraborty commented on SPARK-18650: I am facing the same iss

[jira] [Comment Edited] (SPARK-18650) race condition in FileScanRDD.scala

2016-12-13 Thread Soumabrata Chakraborty (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15744695#comment-15744695 ] Soumabrata Chakraborty edited comment on SPARK-18650 at 12/13/16 9:43 AM: -

[jira] [Commented] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-13 Thread Vicente Masip (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15744740#comment-15744740 ] Vicente Masip commented on SPARK-18823: --- Yes. I've been able to do it with your sug

[jira] [Updated] (SPARK-18752) "isSrcLocal" parameter to Hive loadTable / loadPartition should come from user

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18752: -- Assignee: Marcelo Vanzin > "isSrcLocal" parameter to Hive loadTable / loadPartition should come from us

[jira] [Updated] (SPARK-18796) StreamingQueryManager should not hold a lock when starting a query

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18796: -- Assignee: Shixiong Zhu > StreamingQueryManager should not hold a lock when starting a query > -

[jira] [Updated] (SPARK-18797) Update spark.logit in sparkr-vignettes

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18797: -- Assignee: Miao Wang > Update spark.logit in sparkr-vignettes > -- >

[jira] [Updated] (SPARK-18642) Spark SQL: Catalyst is scanning undesired columns

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18642: -- Assignee: Dongjoon Hyun > Spark SQL: Catalyst is scanning undesired columns > -

[jira] [Commented] (SPARK-17890) scala.ScalaReflectionException

2016-12-13 Thread Arkadiusz Komarzewski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15744964#comment-15744964 ] Arkadiusz Komarzewski commented on SPARK-17890: --- I hit this error recently,

[jira] [Created] (SPARK-18841) PushProjectionThroughUnion exception when there are same column

2016-12-13 Thread Song Jun (JIRA)
Song Jun created SPARK-18841: Summary: PushProjectionThroughUnion exception when there are same column Key: SPARK-18841 URL: https://issues.apache.org/jira/browse/SPARK-18841 Project: Spark Issu

[jira] [Created] (SPARK-18842) De-duplicate paths in classpaths in processes for local-cluster mode to work around the length limitation on Windows

2016-12-13 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-18842: Summary: De-duplicate paths in classpaths in processes for local-cluster mode to work around the length limitation on Windows Key: SPARK-18842 URL: https://issues.apache.org/jira/

[jira] [Commented] (SPARK-18609) [SQL] column mixup with CROSS JOIN

2016-12-13 Thread Song Jun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15745309#comment-15745309 ] Song Jun commented on SPARK-18609: -- open another jira SPARK-18841 > [SQL] column mixup

[jira] [Commented] (SPARK-18842) De-duplicate paths in classpaths in processes for local-cluster mode to work around the length limitation on Windows

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15745319#comment-15745319 ] Apache Spark commented on SPARK-18842: -- User 'HyukjinKwon' has created a pull reques

[jira] [Assigned] (SPARK-18842) De-duplicate paths in classpaths in processes for local-cluster mode to work around the length limitation on Windows

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18842: Assignee: (was: Apache Spark) > De-duplicate paths in classpaths in processes for loca

[jira] [Assigned] (SPARK-18842) De-duplicate paths in classpaths in processes for local-cluster mode to work around the length limitation on Windows

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18842: Assignee: Apache Spark > De-duplicate paths in classpaths in processes for local-cluster m

[jira] [Updated] (SPARK-18841) PushProjectionThroughUnion exception when there are same column

2016-12-13 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18841: -- Description: {noformat} DROP TABLE IF EXISTS p1 ; DROP TABLE IF EXISTS p2 ; DROP TABLE

[jira] [Assigned] (SPARK-18841) PushProjectionThroughUnion exception when there are same column

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18841: Assignee: Apache Spark > PushProjectionThroughUnion exception when there are same column >

[jira] [Commented] (SPARK-18841) PushProjectionThroughUnion exception when there are same column

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15745339#comment-15745339 ] Apache Spark commented on SPARK-18841: -- User 'windpiger' has created a pull request

[jira] [Assigned] (SPARK-18841) PushProjectionThroughUnion exception when there are same column

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18841: Assignee: (was: Apache Spark) > PushProjectionThroughUnion exception when there are sa

[jira] [Commented] (SPARK-18842) De-duplicate paths in classpaths in processes for local-cluster mode to work around the length limitation on Windows

2016-12-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15745350#comment-15745350 ] Hyukjin Kwon commented on SPARK-18842: -- This completes the parent task and I can pro

[jira] [Commented] (SPARK-3012) Standardized Distance Functions between two Vectors for MLlib

2016-12-13 Thread Dhaval Modi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15745388#comment-15745388 ] Dhaval Modi commented on SPARK-3012: I have implemented Mahalanobis Distance in Spark

[jira] [Updated] (SPARK-18842) De-duplicate paths in classpaths in processes for local-cluster mode to work around the length limitation on Windows

2016-12-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-18842: - Priority: Major (was: Minor) > De-duplicate paths in classpaths in processes for local-cluster m

[jira] [Commented] (SPARK-18823) Assignation by column name variable not available or bug?

2016-12-13 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15745633#comment-15745633 ] Shivaram Venkataraman commented on SPARK-18823: --- Thanks [~masip85] for veri

[jira] [Resolved] (SPARK-18675) CTAS for hive serde table should work for all hive versions

2016-12-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-18675. -- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16104 [https://github.com/

[jira] [Resolved] (SPARK-13747) Concurrent execution in SQL doesn't work with Scala ForkJoinPool

2016-12-13 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-13747. -- Resolution: Fixed Fix Version/s: (was: 2.0.2) (was: 2.1.0)

[jira] [Resolved] (SPARK-18835) Do not expose shaded types in JavaTypeInference API

2016-12-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-18835. Resolution: Fixed Assignee: Marcelo Vanzin Fix Version/s: 2.1.1 > Do not ex

[jira] [Created] (SPARK-18843) Fix timeout in awaitResultInForkJoinSafely

2016-12-13 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-18843: Summary: Fix timeout in awaitResultInForkJoinSafely Key: SPARK-18843 URL: https://issues.apache.org/jira/browse/SPARK-18843 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18786) pySpark SQLContext.getOrCreate(sc) take stopped sparkContext

2016-12-13 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15745867#comment-15745867 ] Bryan Cutler commented on SPARK-18786: -- The problem is that {{SQLContext.getOrCreate

[jira] [Assigned] (SPARK-18843) Fix timeout in awaitResultInForkJoinSafely

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18843: Assignee: Apache Spark (was: Shixiong Zhu) > Fix timeout in awaitResultInForkJoinSafely >

[jira] [Commented] (SPARK-18843) Fix timeout in awaitResultInForkJoinSafely

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15745871#comment-15745871 ] Apache Spark commented on SPARK-18843: -- User 'zsxwing' has created a pull request fo

[jira] [Assigned] (SPARK-18843) Fix timeout in awaitResultInForkJoinSafely

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18843: Assignee: Shixiong Zhu (was: Apache Spark) > Fix timeout in awaitResultInForkJoinSafely >

[jira] [Updated] (SPARK-18840) HDFSCredentialProvider throws exception in non-HDFS security environment

2016-12-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-18840: --- Assignee: Saisai Shao > HDFSCredentialProvider throws exception in non-HDFS security environm

[jira] [Updated] (SPARK-18840) HDFSCredentialProvider throws exception in non-HDFS security environment

2016-12-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-18840: --- Fix Version/s: 2.2.0 2.1.1 > HDFSCredentialProvider throws exception in no

[jira] [Updated] (SPARK-18835) Do not expose shaded types in JavaTypeInference API

2016-12-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-18835: --- Fix Version/s: 2.2.0 > Do not expose shaded types in JavaTypeInference API >

[jira] [Commented] (SPARK-18840) HDFSCredentialProvider throws exception in non-HDFS security environment

2016-12-13 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15745879#comment-15745879 ] Marcelo Vanzin commented on SPARK-18840: [~jerryshao] are you planning to fix thi

[jira] [Commented] (SPARK-18278) Support native submission of spark jobs to a kubernetes cluster

2016-12-13 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15745944#comment-15745944 ] Matt Cheah commented on SPARK-18278: [~rxin] - thanks for thinking about this! The c

[jira] [Created] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-13 Thread Zak Patterson (JIRA)
Zak Patterson created SPARK-18844: - Summary: Add more binary classification metrics to BinaryClassificationMetrics Key: SPARK-18844 URL: https://issues.apache.org/jira/browse/SPARK-18844 Project: Spar

[jira] [Updated] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-13 Thread Zak Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zak Patterson updated SPARK-18844: -- Remaining Estimate: 5h (was: 5m) Original Estimate: 5h (was: 5m) > Add more binary class

[jira] [Updated] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-13 Thread Zak Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zak Patterson updated SPARK-18844: -- Description: BinaryClassificationMetrics only implements Precision (positive predictive value)

[jira] [Commented] (SPARK-18281) toLocalIterator yields time out error on pyspark2

2016-12-13 Thread Mike Dusenberry (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746002#comment-15746002 ] Mike Dusenberry commented on SPARK-18281: - [~viirya] Thanks for taking on this bu

[jira] [Commented] (SPARK-18281) toLocalIterator yields time out error on pyspark2

2016-12-13 Thread Mike Dusenberry (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746023#comment-15746023 ] Mike Dusenberry commented on SPARK-18281: - Here's another interesting finding. T

[jira] [Comment Edited] (SPARK-18281) toLocalIterator yields time out error on pyspark2

2016-12-13 Thread Mike Dusenberry (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746023#comment-15746023 ] Mike Dusenberry edited comment on SPARK-18281 at 12/13/16 7:56 PM:

[jira] [Commented] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746212#comment-15746212 ] Sean Owen commented on SPARK-18844: --- Yeah I think we discussed something like this befo

[jira] [Commented] (SPARK-18838) High latency of event processing for large jobs

2016-12-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746217#comment-15746217 ] Shixiong Zhu commented on SPARK-18838: -- [~sitalke...@gmail.com] Instead of your prop

[jira] [Commented] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2016-12-13 Thread Zak Patterson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746293#comment-15746293 ] Zak Patterson commented on SPARK-18844: --- I'm not familiar with the python API much,

[jira] [Created] (SPARK-18845) PageRank has incorrect initialization value that leads to slow convergence

2016-12-13 Thread Andrew Ray (JIRA)
Andrew Ray created SPARK-18845: -- Summary: PageRank has incorrect initialization value that leads to slow convergence Key: SPARK-18845 URL: https://issues.apache.org/jira/browse/SPARK-18845 Project: Spark

[jira] [Created] (SPARK-18846) Fix flakiness in SchedulerIntegrationSuite

2016-12-13 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-18846: Summary: Fix flakiness in SchedulerIntegrationSuite Key: SPARK-18846 URL: https://issues.apache.org/jira/browse/SPARK-18846 Project: Spark Issue Type: Test

[jira] [Commented] (SPARK-18845) PageRank has incorrect initialization value that leads to slow convergence

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746316#comment-15746316 ] Sean Owen commented on SPARK-18845: --- See https://issues.apache.org/jira/browse/SPARK-70

[jira] [Resolved] (SPARK-18715) Fix wrong AIC calculation in Binomial GLM

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18715. --- Resolution: Fixed Issue resolved by pull request 16149 [https://github.com/apache/spark/pull/16149]

[jira] [Updated] (SPARK-18715) Fix wrong AIC calculation in Binomial GLM

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18715: -- Assignee: Wayne Zhang Priority: Major (was: Critical) > Fix wrong AIC calculation in Binomial GLM

[jira] [Resolved] (SPARK-18471) In treeAggregate, generate (big) zeros instead of sending them.

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18471. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16037 [https://github.co

[jira] [Updated] (SPARK-18471) In treeAggregate, generate (big) zeros instead of sending them.

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18471: -- Assignee: Anthony Truchet > In treeAggregate, generate (big) zeros instead of sending them. > -

[jira] [Created] (SPARK-18847) PageRank gives incorrect results for graphs with sinks

2016-12-13 Thread Andrew Ray (JIRA)
Andrew Ray created SPARK-18847: -- Summary: PageRank gives incorrect results for graphs with sinks Key: SPARK-18847 URL: https://issues.apache.org/jira/browse/SPARK-18847 Project: Spark Issue Type

[jira] [Assigned] (SPARK-18846) Fix flakiness in SchedulerIntegrationSuite

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18846: Assignee: Apache Spark (was: Imran Rashid) > Fix flakiness in SchedulerIntegrationSuite >

[jira] [Assigned] (SPARK-18846) Fix flakiness in SchedulerIntegrationSuite

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18846: Assignee: Imran Rashid (was: Apache Spark) > Fix flakiness in SchedulerIntegrationSuite >

[jira] [Commented] (SPARK-18846) Fix flakiness in SchedulerIntegrationSuite

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746326#comment-15746326 ] Apache Spark commented on SPARK-18846: -- User 'squito' has created a pull request for

[jira] [Created] (SPARK-18848) PageRank gives incorrect results for graphs with sinks

2016-12-13 Thread Andrew Ray (JIRA)
Andrew Ray created SPARK-18848: -- Summary: PageRank gives incorrect results for graphs with sinks Key: SPARK-18848 URL: https://issues.apache.org/jira/browse/SPARK-18848 Project: Spark Issue Type

[jira] [Commented] (SPARK-18847) PageRank gives incorrect results for graphs with sinks

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746333#comment-15746333 ] Sean Owen commented on SPARK-18847: --- Before you open more can you review old JIRAs abou

[jira] [Resolved] (SPARK-18848) PageRank gives incorrect results for graphs with sinks

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18848. --- Resolution: Duplicate > PageRank gives incorrect results for graphs with sinks >

[jira] [Resolved] (SPARK-18816) executor page fails to show log links if executors are added after an app is launched

2016-12-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18816. --- Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 Issue resolved by pull request

[jira] [Commented] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746350#comment-15746350 ] Joseph K. Bradley commented on SPARK-4591: -- Good point. It should be. I'll add

[jira] [Comment Edited] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746350#comment-15746350 ] Joseph K. Bradley edited comment on SPARK-4591 at 12/13/16 9:41 PM:

[jira] [Commented] (SPARK-18676) Spark 2.x query plan data size estimation can crash join queries versus 1.x

2016-12-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746370#comment-15746370 ] Davies Liu commented on SPARK-18676: I had a working prototype, but in introduce some

[jira] [Comment Edited] (SPARK-18676) Spark 2.x query plan data size estimation can crash join queries versus 1.x

2016-12-13 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746370#comment-15746370 ] Davies Liu edited comment on SPARK-18676 at 12/13/16 9:47 PM: -

[jira] [Assigned] (SPARK-18845) PageRank has incorrect initialization value that leads to slow convergence

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18845: Assignee: Apache Spark > PageRank has incorrect initialization value that leads to slow co

[jira] [Assigned] (SPARK-18845) PageRank has incorrect initialization value that leads to slow convergence

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18845: Assignee: (was: Apache Spark) > PageRank has incorrect initialization value that leads

[jira] [Commented] (SPARK-18845) PageRank has incorrect initialization value that leads to slow convergence

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746375#comment-15746375 ] Apache Spark commented on SPARK-18845: -- User 'aray' has created a pull request for t

[jira] [Commented] (SPARK-18676) Spark 2.x query plan data size estimation can crash join queries versus 1.x

2016-12-13 Thread Michael Allman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746382#comment-15746382 ] Michael Allman commented on SPARK-18676: Yeah, I was wondering how that would wor

[jira] [Commented] (SPARK-18845) PageRank has incorrect initialization value that leads to slow convergence

2016-12-13 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746385#comment-15746385 ] Andrew Ray commented on SPARK-18845: [~srowen] No that's a different thing just wheth

[jira] [Updated] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4591: - Description: This is an umbrella JIRA for porting spark.mllib implementations to use the

[jira] [Updated] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4591: - Description: This is an umbrella JIRA for porting spark.mllib implementations to use the

[jira] [Updated] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4591: - Description: This is an umbrella JIRA for porting spark.mllib implementations to use the

[jira] [Commented] (SPARK-18847) PageRank gives incorrect results for graphs with sinks

2016-12-13 Thread Andrew Ray (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746395#comment-15746395 ] Andrew Ray commented on SPARK-18847: I have and have not found any relevant. I'm curr

[jira] [Created] (SPARK-18849) Vignettes final checks for Spark 2.1

2016-12-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-18849: - Summary: Vignettes final checks for Spark 2.1 Key: SPARK-18849 URL: https://issues.apache.org/jira/browse/SPARK-18849 Project: Spark Issue Type: Documentat

[jira] [Updated] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4591: - Description: This is an umbrella JIRA for porting spark.mllib implementations to use the

[jira] [Updated] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4591: - Description: This is an umbrella JIRA for porting spark.mllib implementations to use the

[jira] [Created] (SPARK-18850) Make StreamExecution serializable

2016-12-13 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-18850: Summary: Make StreamExecution serializable Key: SPARK-18850 URL: https://issues.apache.org/jira/browse/SPARK-18850 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-18676) Spark 2.x query plan data size estimation can crash join queries versus 1.x

2016-12-13 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746414#comment-15746414 ] Reynold Xin commented on SPARK-18676: - That's the other option I was considering. It'

[jira] [Created] (SPARK-18851) DataSet limit.distinct Results in NPE in Codegen

2016-12-13 Thread Russell Spitzer (JIRA)
Russell Spitzer created SPARK-18851: --- Summary: DataSet limit.distinct Results in NPE in Codegen Key: SPARK-18851 URL: https://issues.apache.org/jira/browse/SPARK-18851 Project: Spark Issue

[jira] [Updated] (SPARK-18851) DataSet Limit into Aggregate Results in NPE in Codegen

2016-12-13 Thread Russell Spitzer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Russell Spitzer updated SPARK-18851: Summary: DataSet Limit into Aggregate Results in NPE in Codegen (was: DataSet limit.distin

[jira] [Resolved] (SPARK-18843) Fix timeout in awaitResultInForkJoinSafely

2016-12-13 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-18843. -- Resolution: Fixed Fix Version/s: 2.1.1 2.0.3 > Fix timeout in awaitRe

[jira] [Updated] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4591: - Description: This is an umbrella JIRA for porting spark.mllib implementations to use the

[jira] [Assigned] (SPARK-18850) Make StreamExecution serializable

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18850: Assignee: Shixiong Zhu (was: Apache Spark) > Make StreamExecution serializable >

[jira] [Assigned] (SPARK-18850) Make StreamExecution serializable

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18850: Assignee: Apache Spark (was: Shixiong Zhu) > Make StreamExecution serializable >

[jira] [Commented] (SPARK-18850) Make StreamExecution serializable

2016-12-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746436#comment-15746436 ] Apache Spark commented on SPARK-18850: -- User 'zsxwing' has created a pull request fo

[jira] [Resolved] (SPARK-18834) Expose event time time stats through StreamingQueryProgress

2016-12-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-18834. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 16258 [https://g

[jira] [Updated] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4591: - Description: This is an umbrella JIRA for porting spark.mllib implementations to use the

[jira] [Commented] (SPARK-14709) spark.ml API for linear SVM

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746462#comment-15746462 ] Joseph K. Bradley commented on SPARK-14709: --- Marking myself as shepherd per the

[jira] [Updated] (SPARK-14709) spark.ml API for linear SVM

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14709: -- Shepherd: Joseph K. Bradley Target Version/s: 2.2.0 > spark.ml API for line

[jira] [Updated] (SPARK-18851) DataSet Limit into Aggregate Results in NPE in Codegen

2016-12-13 Thread Russell Spitzer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Russell Spitzer updated SPARK-18851: Labels: regresion (was: ) > DataSet Limit into Aggregate Results in NPE in Codegen > -

[jira] [Commented] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2016-12-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15746471#comment-15746471 ] Joseph K. Bradley commented on SPARK-4591: -- I just updated this a bit. I did not

  1   2   >