[jira] [Resolved] (SPARK-18617) Close "kryo auto pick" feature for Spark Streaming

2016-11-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18617. - Resolution: Fixed Assignee: Genmao Yu Fix Version/s: 2.1.0 > Close "kryo auto

[jira] [Resolved] (SPARK-18622) Missing Reference in Multi Union Clauses Cause by TypeCoercion

2016-11-29 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-18622. - Resolution: Fixed Assignee: Herman van Hovell Fix Version/s: 2.1.0 > Missing

[jira] [Resolved] (SPARK-17680) Unicode Character Support for Column Names and Comments

2016-11-29 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-17680. - Resolution: Fixed > Unicode Character Support for Column Names and Comments >

[jira] [Updated] (SPARK-18643) SparkR hangs at session start when installed as a package without SPARK_HOME set

2016-11-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18643: - Description: 1) Install SparkR from source package, ie. R CMD INSTALL SparkR_2.1.0.tar.gz 2)

[jira] [Commented] (SPARK-17934) Support percentile scale in ml.feature

2016-11-29 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707710#comment-15707710 ] yuhao yang commented on SPARK-17934: We can probably implement something like Robust Scaler in

[jira] [Commented] (SPARK-18643) SparkR hangs at session start when installed as a package without SPARK_HOME set

2016-11-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707704#comment-15707704 ] Felix Cheung commented on SPARK-18643: -- A workaround is to start as sparkR.session(master="local") -

[jira] [Assigned] (SPARK-18643) SparkR hangs at session start when installed as a package without SPARK_HOME set

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18643: Assignee: Apache Spark (was: Felix Cheung) > SparkR hangs at session start when

[jira] [Assigned] (SPARK-18643) SparkR hangs at session start when installed as a package without SPARK_HOME set

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18643: Assignee: Felix Cheung (was: Apache Spark) > SparkR hangs at session start when

[jira] [Commented] (SPARK-18643) SparkR hangs at session start when installed as a package without SPARK_HOME set

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707701#comment-15707701 ] Apache Spark commented on SPARK-18643: -- User 'felixcheung' has created a pull request for this

[jira] [Commented] (SPARK-16848) Make jdbc() and read.format("jdbc") consistently throwing exception for user-specified schema

2016-11-29 Thread Pramod Anarase (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707699#comment-15707699 ] Pramod Anarase commented on SPARK-16848: +1 > Make jdbc() and read.format("jdbc") consistently

[jira] [Updated] (SPARK-18643) SparkR hangs at session start when installed as a package without SPARK_HOME set

2016-11-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18643: - Summary: SparkR hangs at session start when installed as a package without SPARK_HOME set (was:

[jira] [Updated] (SPARK-18643) SparkR hangs when installed as a package without SPARK_HOME set

2016-11-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18643: - Description: 1) Install SparkR from source package, ie. R CMD INSTALL SparkR_2.1.0.tar.gz 2)

[jira] [Updated] (SPARK-18643) SparkR hangs when installed as a package without SPARK_HOME set

2016-11-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18643: - Description: 1) Install SparkR from source package, ie. R CMD INSTALL SparkR_2.1.0.tar.gz 2)

[jira] [Commented] (SPARK-18324) ML, Graph 2.1 QA: Programming guide update and migration guide

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707675#comment-15707675 ] Apache Spark commented on SPARK-18324: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Created] (SPARK-18643) SparkR hangs when installed as a package without SPARK_HOME set

2016-11-29 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18643: Summary: SparkR hangs when installed as a package without SPARK_HOME set Key: SPARK-18643 URL: https://issues.apache.org/jira/browse/SPARK-18643 Project: Spark

[jira] [Commented] (SPARK-18643) SparkR hangs when installed as a package without SPARK_HOME set

2016-11-29 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707669#comment-15707669 ] Felix Cheung commented on SPARK-18643: -- Related PR: https://github.com/apache/spark/pull/15888 >

[jira] [Resolved] (SPARK-17692) Document ML/MLlib behavior changes in Spark 2.1

2016-11-29 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17692. - Resolution: Fixed Fix Version/s: 2.1.0 > Document ML/MLlib behavior changes in Spark 2.1

[jira] [Commented] (SPARK-17692) Document ML/MLlib behavior changes in Spark 2.1

2016-11-29 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707665#comment-15707665 ] Yanbo Liang commented on SPARK-17692: - All behavior changes has been documented in the PR of

[jira] [Commented] (SPARK-18608) Spark ML algorithms that check RDD cache level for internal caching double-cache data

2016-11-29 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707663#comment-15707663 ] yuhao yang commented on SPARK-18608: Agree. we can just add an extra parameter handlePersistence:

[jira] [Commented] (SPARK-18374) Incorrect words in StopWords/english.txt

2016-11-29 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707659#comment-15707659 ] yuhao yang commented on SPARK-18374: Yes. Currently we're discussing if we should put "wouldn't"

[jira] [Updated] (SPARK-18278) Support native submission of spark jobs to a kubernetes cluster

2016-11-29 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-18278: --- Attachment: SPARK-18278 - Spark on Kubernetes Design Proposal.pdf I attached a proposal outlining a

[jira] [Commented] (SPARK-18374) Incorrect words in StopWords/english.txt

2016-11-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707642#comment-15707642 ] Xiangrui Meng commented on SPARK-18374: --- See the discussion here:

[jira] [Commented] (SPARK-17680) Unicode Character Support for Column Names and Comments

2016-11-29 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707629#comment-15707629 ] Kazuaki Ishizaki commented on SPARK-17680: -- Sorry, it is my mistake. > Unicode Character

[jira] [Updated] (SPARK-18641) Show databases NullPointerException while Sentry turned on

2016-11-29 Thread zhangqw (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangqw updated SPARK-18641: Description: I've traced into source code, and it seems that of Sentry not set when spark sql started a

[jira] [Updated] (SPARK-18641) Show databases NullPointerException while Sentry turned on

2016-11-29 Thread zhangqw (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangqw updated SPARK-18641: Priority: Major (was: Minor) > Show databases NullPointerException while Sentry turned on >

[jira] [Updated] (SPARK-18641) Show databases NullPointerException while Sentry turned on

2016-11-29 Thread zhangqw (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangqw updated SPARK-18641: Priority: Minor (was: Major) > Show databases NullPointerException while Sentry turned on >

[jira] [Updated] (SPARK-18642) Spark SQL: Catalyst is scanning undesired columns

2016-11-29 Thread Mohit (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mohit updated SPARK-18642: -- Description: When doing a left-join between two tables, say A and B, Catalyst has information about the

[jira] [Updated] (SPARK-18642) Spark SQL: Catalyst is scanning undesired columns

2016-11-29 Thread Mohit (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mohit updated SPARK-18642: -- Description: When doing a left-join between two tables, say A and B, Catalyst has information about the

[jira] [Created] (SPARK-18642) Spark SQL: Catalyst is scanning undesired columns

2016-11-29 Thread Mohit (JIRA)
Mohit created SPARK-18642: - Summary: Spark SQL: Catalyst is scanning undesired columns Key: SPARK-18642 URL: https://issues.apache.org/jira/browse/SPARK-18642 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-17732) ALTER TABLE DROP PARTITION should support comparators

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707594#comment-15707594 ] Apache Spark commented on SPARK-17732: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Commented] (SPARK-18374) Incorrect words in StopWords/english.txt

2016-11-29 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707560#comment-15707560 ] yuhao yang commented on SPARK-18374: cc [~mengxr] to see if he recalls any specific reason. >

[jira] [Commented] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-29 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707542#comment-15707542 ] yuhao yang commented on SPARK-18531: [~tuxdna] Does it work for you? > Apache Spark FPGrowth

[jira] [Resolved] (SPARK-15819) Add KMeanSummary in KMeans of PySpark

2016-11-29 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-15819. - Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 > Add KMeanSummary in

[jira] [Resolved] (SPARK-18145) Update documentation for hive partition management in 2.1

2016-11-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18145. - Resolution: Fixed Assignee: Eric Liang Fix Version/s: 2.1.0 > Update

[jira] [Resolved] (SPARK-17861) Store data source partitions in metastore and push partition pruning into metastore

2016-11-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17861. - Resolution: Fixed Fix Version/s: 2.1.0 > Store data source partitions in metastore and

[jira] [Resolved] (SPARK-18632) AggregateFunction should not ImplicitCastInputTypes

2016-11-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18632. - Resolution: Fixed Fix Version/s: 2.2.0 > AggregateFunction should not

[jira] [Commented] (SPARK-15369) Investigate selectively using Jython for parts of PySpark

2016-11-29 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707435#comment-15707435 ] holdenk commented on SPARK-15369: - So I'm probably going to be busy until after the 2.1 release (also

[jira] [Comment Edited] (SPARK-15369) Investigate selectively using Jython for parts of PySpark

2016-11-29 Thread Marius Van Niekerk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707428#comment-15707428 ] Marius Van Niekerk edited comment on SPARK-15369 at 11/30/16 3:49 AM:

[jira] [Commented] (SPARK-15369) Investigate selectively using Jython for parts of PySpark

2016-11-29 Thread Marius Van Niekerk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707428#comment-15707428 ] Marius Van Niekerk commented on SPARK-15369: Oh yeah, once we have a pip installable spark it

[jira] [Commented] (SPARK-15369) Investigate selectively using Jython for parts of PySpark

2016-11-29 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707421#comment-15707421 ] holdenk commented on SPARK-15369: - That looks like a great start :) Probably the packaging is going to be

[jira] [Commented] (SPARK-15369) Investigate selectively using Jython for parts of PySpark

2016-11-29 Thread Marius Van Niekerk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707411#comment-15707411 ] Marius Van Niekerk commented on SPARK-15369: I'm in the process of an initial stab at turning

[jira] [Commented] (SPARK-18516) Separate instantaneous state from progress performance statistics

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707399#comment-15707399 ] Apache Spark commented on SPARK-18516: -- User 'tdas' has created a pull request for this issue:

[jira] [Updated] (SPARK-18641) Show databases NullPointerException while Sentry turned on

2016-11-29 Thread zhangqw (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangqw updated SPARK-18641: Summary: Show databases NullPointerException while Sentry turned on (was: Show databases

[jira] [Updated] (SPARK-18641) Show databases NullPointerException while sentry turned on

2016-11-29 Thread zhangqw (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangqw updated SPARK-18641: Description: I've traced into source code, and it seems that of Sentry not set when spark sql started a

[jira] [Updated] (SPARK-18641) Show databases NullPointerException while sentry turned on

2016-11-29 Thread zhangqw (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangqw updated SPARK-18641: Affects Version/s: (was: 2.0.1) 2.0.0 > Show databases NullPointerException

[jira] [Updated] (SPARK-18641) Show databases NullPointerException while sentry turned on

2016-11-29 Thread zhangqw (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhangqw updated SPARK-18641: Description: I've traced into source code, and it seems that of Sentry not set when spark sql started a

[jira] [Created] (SPARK-18641) Show databases NullPointerException while sentry turned on

2016-11-29 Thread zhangqw (JIRA)
zhangqw created SPARK-18641: --- Summary: Show databases NullPointerException while sentry turned on Key: SPARK-18641 URL: https://issues.apache.org/jira/browse/SPARK-18641 Project: Spark Issue Type:

[jira] [Commented] (SPARK-18613) spark.ml LDA classes should not expose spark.mllib in APIs

2016-11-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707320#comment-15707320 ] Joseph K. Bradley commented on SPARK-18613: --- I can after 2.1 QA, but feel free to go ahead if

[jira] [Resolved] (SPARK-18319) ML, Graph 2.1 QA: API: Experimental, DeveloperApi, final, sealed audit

2016-11-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-18319. --- Resolution: Fixed Fix Version/s: 2.2.0 2.1.1 Issue

[jira] [Assigned] (SPARK-18145) Update documentation for hive partition management in 2.1

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18145: Assignee: (was: Apache Spark) > Update documentation for hive partition management in

[jira] [Assigned] (SPARK-18145) Update documentation for hive partition management in 2.1

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18145: Assignee: Apache Spark > Update documentation for hive partition management in 2.1 >

[jira] [Commented] (SPARK-18145) Update documentation for hive partition management in 2.1

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707288#comment-15707288 ] Apache Spark commented on SPARK-18145: -- User 'ericl' has created a pull request for this issue:

[jira] [Commented] (SPARK-18640) Fix minor synchronization issue in TaskSchedulerImpl.runningTasksByExecutors

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707265#comment-15707265 ] Apache Spark commented on SPARK-18640: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Created] (SPARK-18640) Fix minor synchronization issue in TaskSchedulerImpl.runningTasksByExecutors

2016-11-29 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-18640: -- Summary: Fix minor synchronization issue in TaskSchedulerImpl.runningTasksByExecutors Key: SPARK-18640 URL: https://issues.apache.org/jira/browse/SPARK-18640 Project:

[jira] [Assigned] (SPARK-18640) Fix minor synchronization issue in TaskSchedulerImpl.runningTasksByExecutors

2016-11-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-18640: -- Assignee: Josh Rosen > Fix minor synchronization issue in

[jira] [Assigned] (SPARK-18639) Build only a single pip package

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18639: Assignee: Reynold Xin (was: Apache Spark) > Build only a single pip package >

[jira] [Commented] (SPARK-18639) Build only a single pip package

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707258#comment-15707258 ] Apache Spark commented on SPARK-18639: -- User 'rxin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18639) Build only a single pip package

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18639: Assignee: Apache Spark (was: Reynold Xin) > Build only a single pip package >

[jira] [Created] (SPARK-18639) Build only a single pip package

2016-11-29 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-18639: --- Summary: Build only a single pip package Key: SPARK-18639 URL: https://issues.apache.org/jira/browse/SPARK-18639 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-18635) Partition name/values not escaped correctly in some cases

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18635: Assignee: Apache Spark > Partition name/values not escaped correctly in some cases >

[jira] [Assigned] (SPARK-18635) Partition name/values not escaped correctly in some cases

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18635: Assignee: (was: Apache Spark) > Partition name/values not escaped correctly in some

[jira] [Commented] (SPARK-18635) Partition name/values not escaped correctly in some cases

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707211#comment-15707211 ] Apache Spark commented on SPARK-18635: -- User 'ericl' has created a pull request for this issue:

[jira] [Commented] (SPARK-14437) Spark using Netty RPC gets wrong address in some setups

2016-11-29 Thread Alex Jiang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707201#comment-15707201 ] Alex Jiang commented on SPARK-14437: [~hogeland] Did you get your issue resolved in 2.0.0? We are

[jira] [Commented] (SPARK-18502) Spark does not handle columns that contain backquote (`)

2016-11-29 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707196#comment-15707196 ] Takeshi Yamamuro commented on SPARK-18502: -- Currently, AFAIK no. However, the SQL standard

[jira] [Resolved] (SPARK-18516) Separate instantaneous state from progress performance statistics

2016-11-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-18516. -- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15954

[jira] [Commented] (SPARK-18553) Executor loss may cause TaskSetManager to be leaked

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707105#comment-15707105 ] Apache Spark commented on SPARK-18553: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Commented] (SPARK-18638) Upgrade sbt to 0.13.13

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15707063#comment-15707063 ] Apache Spark commented on SPARK-18638: -- User 'weiqingy' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18638) Upgrade sbt to 0.13.13

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18638: Assignee: (was: Apache Spark) > Upgrade sbt to 0.13.13 > -- > >

[jira] [Assigned] (SPARK-18638) Upgrade sbt to 0.13.13

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18638: Assignee: Apache Spark > Upgrade sbt to 0.13.13 > -- > >

[jira] [Updated] (SPARK-18553) Executor loss may cause TaskSetManager to be leaked

2016-11-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-18553: --- Fix Version/s: 2.2.0 2.1.0 > Executor loss may cause TaskSetManager to be leaked

[jira] [Created] (SPARK-18638) Upgrade sbt to 0.13.13

2016-11-29 Thread Weiqing Yang (JIRA)
Weiqing Yang created SPARK-18638: Summary: Upgrade sbt to 0.13.13 Key: SPARK-18638 URL: https://issues.apache.org/jira/browse/SPARK-18638 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-18637) Stateful UDF should be considered as nondeterministic

2016-11-29 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706961#comment-15706961 ] Zhan Zhang commented on SPARK-18637: [~hvanhovell] It is an annotation. /** * UDFType annotations

[jira] [Comment Edited] (SPARK-18637) Stateful UDF should be considered as nondeterministic

2016-11-29 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706961#comment-15706961 ] Zhan Zhang edited comment on SPARK-18637 at 11/29/16 11:52 PM: ---

[jira] [Assigned] (SPARK-18637) Stateful UDF should be considered as nondeterministic

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18637: Assignee: Apache Spark > Stateful UDF should be considered as nondeterministic >

[jira] [Updated] (SPARK-18637) Stateful UDF should be considered as nondeterministic

2016-11-29 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhan Zhang updated SPARK-18637: --- Component/s: SQL > Stateful UDF should be considered as nondeterministic >

[jira] [Commented] (SPARK-18637) Stateful UDF should be considered as nondeterministic

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706935#comment-15706935 ] Apache Spark commented on SPARK-18637: -- User 'zhzhan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18637) Stateful UDF should be considered as nondeterministic

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18637: Assignee: (was: Apache Spark) > Stateful UDF should be considered as nondeterministic

[jira] [Comment Edited] (SPARK-18637) Stateful UDF should be considered as nondeterministic

2016-11-29 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706928#comment-15706928 ] Herman van Hovell edited comment on SPARK-18637 at 11/29/16 11:35 PM:

[jira] [Commented] (SPARK-18637) Stateful UDF should be considered as nondeterministic

2016-11-29 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706928#comment-15706928 ] Herman van Hovell commented on SPARK-18637: --- {{UDFType}} is a Hive contruct right? > Stateful

[jira] [Resolved] (SPARK-18614) Incorrect predicate pushdown from ExistenceJoin

2016-11-29 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-18614. --- Resolution: Fixed Assignee: Nattavut Sutyanyong Fix Version/s: 2.1.0

[jira] [Commented] (SPARK-18637) Stateful UDF should be considered as nondeterministic

2016-11-29 Thread Zhan Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706905#comment-15706905 ] Zhan Zhang commented on SPARK-18637: Here is the comments from UDFType /** * If a UDF stores

[jira] [Created] (SPARK-18637) Stateful UDF should be considered as nondeterministic

2016-11-29 Thread Zhan Zhang (JIRA)
Zhan Zhang created SPARK-18637: -- Summary: Stateful UDF should be considered as nondeterministic Key: SPARK-18637 URL: https://issues.apache.org/jira/browse/SPARK-18637 Project: Spark Issue

[jira] [Resolved] (SPARK-18631) Avoid making data skew worse in ExchangeCoordinator

2016-11-29 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-18631. -- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16065

[jira] [Created] (SPARK-18636) UnsafeShuffleWriter and DiskBlockObjectWriter do not consider encryption / compression in metrics

2016-11-29 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-18636: -- Summary: UnsafeShuffleWriter and DiskBlockObjectWriter do not consider encryption / compression in metrics Key: SPARK-18636 URL:

[jira] [Commented] (SPARK-18475) Be able to provide higher parallelization for StructuredStreaming Kafka Source

2016-11-29 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706814#comment-15706814 ] Cody Koeninger commented on SPARK-18475: Glad you agree it shouldn't be enabled by default. If

[jira] [Commented] (SPARK-18475) Be able to provide higher parallelization for StructuredStreaming Kafka Source

2016-11-29 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706788#comment-15706788 ] Burak Yavuz commented on SPARK-18475: - I'd be happy to share performance results. You're right, I

[jira] [Commented] (SPARK-16551) Accumulator Examples should demonstrate different use case from UDAFs

2016-11-29 Thread Ruiming Zhou (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706764#comment-15706764 ] Ruiming Zhou commented on SPARK-16551: -- I can look at this issue. > Accumulator Examples should

[jira] [Commented] (SPARK-18475) Be able to provide higher parallelization for StructuredStreaming Kafka Source

2016-11-29 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706758#comment-15706758 ] Cody Koeninger commented on SPARK-18475: Burak hasn't empirically shown that it is of benefit for

[jira] [Reopened] (SPARK-18475) Be able to provide higher parallelization for StructuredStreaming Kafka Source

2016-11-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reopened SPARK-18475: -- > Be able to provide higher parallelization for StructuredStreaming Kafka Source >

[jira] [Commented] (SPARK-18475) Be able to provide higher parallelization for StructuredStreaming Kafka Source

2016-11-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706692#comment-15706692 ] Michael Armbrust commented on SPARK-18475: -- I think that this suggestion was closed prematurely.

[jira] [Commented] (SPARK-17897) not isnotnull is converted to the always false condition isnotnull && not isnotnull

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15706606#comment-15706606 ] Apache Spark commented on SPARK-17897: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Updated] (SPARK-18635) Partition name/values not escaped correctly in some cases

2016-11-29 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-18635: --- Target Version/s: 2.1.0 Priority: Critical (was: Major) > Partition name/values not

[jira] [Created] (SPARK-18635) Partition name/values not escaped correctly in some cases

2016-11-29 Thread Eric Liang (JIRA)
Eric Liang created SPARK-18635: -- Summary: Partition name/values not escaped correctly in some cases Key: SPARK-18635 URL: https://issues.apache.org/jira/browse/SPARK-18635 Project: Spark Issue

[jira] [Updated] (SPARK-18545) Verify number of hive client RPCs in PartitionedTablePerfStatsSuite

2016-11-29 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-18545: --- Issue Type: Sub-task (was: Test) Parent: SPARK-17861 > Verify number of hive client RPCs in

[jira] [Updated] (SPARK-18507) Major performance regression in SHOW PARTITIONS on partitioned Hive tables

2016-11-29 Thread Eric Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Liang updated SPARK-18507: --- Issue Type: Sub-task (was: Bug) Parent: SPARK-17861 > Major performance regression in SHOW

[jira] [Updated] (SPARK-18429) SQL aggregate function for CountMinSketch

2016-11-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18429: Issue Type: Sub-task (was: New Feature) Parent: SPARK-16026 > SQL aggregate function for

[jira] [Resolved] (SPARK-18429) SQL aggregate function for CountMinSketch

2016-11-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18429. - Resolution: Fixed Assignee: Zhenhua Wang Fix Version/s: 2.2.0 > SQL aggregate

[jira] [Updated] (SPARK-18429) SQL aggregate function for CountMinSketch

2016-11-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18429: Summary: SQL aggregate function for CountMinSketch (was: implement a new Aggregate for

[jira] [Updated] (SPARK-18632) AggregateFunction should not ImplicitCastInputTypes

2016-11-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-18632: Target Version/s: 2.2.0 > AggregateFunction should not ImplicitCastInputTypes >

[jira] [Assigned] (SPARK-18632) AggregateFunction should not ImplicitCastInputTypes

2016-11-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18632: Assignee: Herman van Hovell (was: Apache Spark) > AggregateFunction should not

  1   2   3   >