[jira] [Commented] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-11-21 Thread Christian Zorneck (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15682855#comment-15682855 ] Christian Zorneck commented on SPARK-18134: --- I also do not see why this feature

[jira] [Commented] (SPARK-18521) Add `NoRedundantStringInterpolator` Scala rule

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15682866#comment-15682866 ] Apache Spark commented on SPARK-18521: -- User 'weiqingy' has created a pull request f

[jira] [Assigned] (SPARK-18521) Add `NoRedundantStringInterpolator` Scala rule

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18521: Assignee: (was: Apache Spark) > Add `NoRedundantStringInterpolator` Scala rule > -

[jira] [Assigned] (SPARK-18521) Add `NoRedundantStringInterpolator` Scala rule

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18521: Assignee: Apache Spark > Add `NoRedundantStringInterpolator` Scala rule >

[jira] [Commented] (SPARK-18249) StackOverflowError when saving dataset to parquet

2016-11-21 Thread Damian Momot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15682893#comment-15682893 ] Damian Momot commented on SPARK-18249: -- Hi, nope it's simple case class, exactly thi

[jira] [Comment Edited] (SPARK-18249) StackOverflowError when saving dataset to parquet

2016-11-21 Thread Damian Momot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15682893#comment-15682893 ] Damian Momot edited comment on SPARK-18249 at 11/21/16 8:35 AM: ---

[jira] [Comment Edited] (SPARK-18249) StackOverflowError when saving dataset to parquet

2016-11-21 Thread Damian Momot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15682893#comment-15682893 ] Damian Momot edited comment on SPARK-18249 at 11/21/16 8:37 AM: ---

[jira] [Commented] (SPARK-18475) Be able to provide higher parallelization for StructuredStreaming Kafka Source

2016-11-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15682903#comment-15682903 ] Sean Owen commented on SPARK-18475: --- I tend to agree that this is the wrong way to addr

[jira] [Resolved] (SPARK-16377) Spark MLlib: MultilayerPerceptronClassifier - error while training

2016-11-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16377. --- Resolution: Cannot Reproduce > Spark MLlib: MultilayerPerceptronClassifier - error while training > -

[jira] [Commented] (SPARK-18004) DataFrame filter Predicate push-down fails for Oracle Timestamp type columns

2016-11-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15682989#comment-15682989 ] Takeshi Yamamuro commented on SPARK-18004: -- The current spark Jdbc interface (Jd

[jira] [Created] (SPARK-18523) OOM killer may leave SparkContext in broken state causing Connection Refused errors

2016-11-21 Thread Alexander Shorin (JIRA)
Alexander Shorin created SPARK-18523: Summary: OOM killer may leave SparkContext in broken state causing Connection Refused errors Key: SPARK-18523 URL: https://issues.apache.org/jira/browse/SPARK-18523

[jira] [Created] (SPARK-18524) Cannot create dataframe on jdbc data source from spark 2.0.2

2016-11-21 Thread Som K (JIRA)
Som K created SPARK-18524: - Summary: Cannot create dataframe on jdbc data source from spark 2.0.2 Key: SPARK-18524 URL: https://issues.apache.org/jira/browse/SPARK-18524 Project: Spark Issue Type:

[jira] [Commented] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-11-21 Thread Christian Zorneck (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683172#comment-15683172 ] Christian Zorneck commented on SPARK-18134: --- I can't use arrays of structs. I h

[jira] [Comment Edited] (SPARK-16377) Spark MLlib: MultilayerPerceptronClassifier - error while training

2016-11-21 Thread Mikhail Shiryaev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683194#comment-15683194 ] Mikhail Shiryaev edited comment on SPARK-16377 at 11/21/16 10:55 AM: --

[jira] [Commented] (SPARK-16377) Spark MLlib: MultilayerPerceptronClassifier - error while training

2016-11-21 Thread Mikhail Shiryaev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683194#comment-15683194 ] Mikhail Shiryaev commented on SPARK-16377: -- Yes, you can close it. The original

[jira] [Created] (SPARK-18525) Kafka DirectInputStream cannot be aware of new partition

2016-11-21 Thread Zhiwen Sun (JIRA)
Zhiwen Sun created SPARK-18525: -- Summary: Kafka DirectInputStream cannot be aware of new partition Key: SPARK-18525 URL: https://issues.apache.org/jira/browse/SPARK-18525 Project: Spark Issue Ty

[jira] [Assigned] (SPARK-18523) OOM killer may leave SparkContext in broken state causing Connection Refused errors

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18523: Assignee: (was: Apache Spark) > OOM killer may leave SparkContext in broken state caus

[jira] [Commented] (SPARK-18523) OOM killer may leave SparkContext in broken state causing Connection Refused errors

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683200#comment-15683200 ] Apache Spark commented on SPARK-18523: -- User 'kxepal' has created a pull request for

[jira] [Assigned] (SPARK-18523) OOM killer may leave SparkContext in broken state causing Connection Refused errors

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18523: Assignee: Apache Spark > OOM killer may leave SparkContext in broken state causing Connect

[jira] [Created] (SPARK-18526) [Kafka] The property max.poll.records is set to a very low non overridable default.

2016-11-21 Thread Prashant Sharma (JIRA)
Prashant Sharma created SPARK-18526: --- Summary: [Kafka] The property max.poll.records is set to a very low non overridable default. Key: SPARK-18526 URL: https://issues.apache.org/jira/browse/SPARK-18526

[jira] [Commented] (SPARK-18526) [Kafka] The property max.poll.records is set to a very low non overridable default.

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683382#comment-15683382 ] Apache Spark commented on SPARK-18526: -- User 'ScrapCodes' has created a pull request

[jira] [Assigned] (SPARK-18526) [Kafka] The property max.poll.records is set to a very low non overridable default.

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18526: Assignee: Apache Spark > [Kafka] The property max.poll.records is set to a very low non ov

[jira] [Assigned] (SPARK-18526) [Kafka] The property max.poll.records is set to a very low non overridable default.

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18526: Assignee: (was: Apache Spark) > [Kafka] The property max.poll.records is set to a very

[jira] [Commented] (SPARK-18073) Migrate wiki to spark.apache.org web site

2016-11-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683399#comment-15683399 ] Sean Owen commented on SPARK-18073: --- For anyone following along, I'm going to merge the

[jira] [Resolved] (SPARK-18524) Cannot create dataframe on jdbc data source from spark 2.0.2

2016-11-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18524. --- Resolution: Not A Problem This is an Hadoop + Windows env problem. It looks like you don't have winut

[jira] [Updated] (SPARK-18520) Add missing setXXXCol methods for BisectingKMeansModel and GaussianMixtureModel

2016-11-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18520: -- Priority: Minor (was: Major) > Add missing setXXXCol methods for BisectingKMeansModel and > GaussianM

[jira] [Resolved] (SPARK-18398) Fix nullabilities of MapObjects and optimize not to check null if lambda is not nullable.

2016-11-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-18398. --- Resolution: Fixed Assignee: Takuya Ueshin Fix Version/s: 2.1.0 > Fix

[jira] [Resolved] (SPARK-18413) Add a property to control the number of partitions when save a jdbc rdd

2016-11-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18413. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 15868 [https://github.co

[jira] [Updated] (SPARK-18413) Add a property to control the number of partitions when save a jdbc rdd

2016-11-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18413: -- Assignee: Dongjoon Hyun Priority: Minor (was: Major) Issue Type: Improvement (was: Wish)

[jira] [Commented] (SPARK-18471) In treeAggregate, generate (big) zeros instead of sending them.

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683636#comment-15683636 ] Apache Spark commented on SPARK-18471: -- User 'AnthonyTruchet' has created a pull req

[jira] [Commented] (SPARK-18356) Issue + Resolution: Kmeans Spark Performances (ML package)

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683669#comment-15683669 ] Apache Spark commented on SPARK-18356: -- User 'ZakariaHili' has created a pull reques

[jira] [Assigned] (SPARK-18356) Issue + Resolution: Kmeans Spark Performances (ML package)

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18356: Assignee: (was: Apache Spark) > Issue + Resolution: Kmeans Spark Performances (ML pack

[jira] [Assigned] (SPARK-18356) Issue + Resolution: Kmeans Spark Performances (ML package)

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18356: Assignee: Apache Spark > Issue + Resolution: Kmeans Spark Performances (ML package) >

[jira] [Commented] (SPARK-18356) Issue + Resolution: Kmeans Spark Performances (ML package)

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683700#comment-15683700 ] Apache Spark commented on SPARK-18356: -- User 'ZakariaHili' has created a pull reques

[jira] [Issue Comment Deleted] (SPARK-18455) General support for subquery processing

2016-11-21 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nattavut Sutyanyong updated SPARK-18455: Comment: was deleted (was: Incorrect results problem) > General support for subque

[jira] [Commented] (SPARK-18455) General support for subquery processing

2016-11-21 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683798#comment-15683798 ] Nattavut Sutyanyong commented on SPARK-18455: - Incorrect results problem > G

[jira] [Comment Edited] (SPARK-18455) General support for subquery processing

2016-11-21 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15668577#comment-15668577 ] Nattavut Sutyanyong edited comment on SPARK-18455 at 11/21/16 3:02 PM:

[jira] [Created] (SPARK-18527) UDAFPercentile (bigint, array) needs explicity cast to double

2016-11-21 Thread Fabian Boehnlein (JIRA)
Fabian Boehnlein created SPARK-18527: Summary: UDAFPercentile (bigint, array) needs explicity cast to double Key: SPARK-18527 URL: https://issues.apache.org/jira/browse/SPARK-18527 Project: Spark

[jira] [Commented] (SPARK-14222) Cross-publish jackson-module-scala for Scala 2.12

2016-11-21 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15683998#comment-15683998 ] Steve Loughran commented on SPARK-14222: Hadoop 2.9 just went to Java 2.7.8; late

[jira] [Assigned] (SPARK-12978) Skip unnecessary final group-by when input data already clustered with group-by keys

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12978: Assignee: Apache Spark (was: Takeshi Yamamuro) > Skip unnecessary final group-by when inp

[jira] [Assigned] (SPARK-12978) Skip unnecessary final group-by when input data already clustered with group-by keys

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12978: Assignee: Takeshi Yamamuro (was: Apache Spark) > Skip unnecessary final group-by when inp

[jira] [Created] (SPARK-18528) limit + groupBy leads to java.lang.NullPointerException

2016-11-21 Thread Corey (JIRA)
Corey created SPARK-18528: - Summary: limit + groupBy leads to java.lang.NullPointerException Key: SPARK-18528 URL: https://issues.apache.org/jira/browse/SPARK-18528 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-18529) Timeouts shouldn't be AssertionErrors

2016-11-21 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-18529: Summary: Timeouts shouldn't be AssertionErrors Key: SPARK-18529 URL: https://issues.apache.org/jira/browse/SPARK-18529 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-18513) Record and recover watermark

2016-11-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18513: - Target Version/s: 2.1.0 > Record and recover watermark > > >

[jira] [Updated] (SPARK-18513) Record and recover watermark

2016-11-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18513: - Priority: Blocker (was: Major) > Record and recover watermark >

[jira] [Updated] (SPARK-18339) Don't push down current_timestamp for filters in StructuredStreaming

2016-11-21 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-18339: - Priority: Critical (was: Major) > Don't push down current_timestamp for filters in Struc

[jira] [Created] (SPARK-18530) Kafka timestamp should be TimestampType

2016-11-21 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-18530: Summary: Kafka timestamp should be TimestampType Key: SPARK-18530 URL: https://issues.apache.org/jira/browse/SPARK-18530 Project: Spark Issue Type: B

[jira] [Commented] (SPARK-16532) Provide a REST API for submitting and tracking status of jobs

2016-11-21 Thread Dan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684232#comment-15684232 ] Dan commented on SPARK-16532: - Is there any update on this? Is the existing API supported and

[jira] [Comment Edited] (SPARK-15513) Bzip2Factory in Hadoop 2.7.1 is not thread safe

2016-11-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684302#comment-15684302 ] Yin Huai edited comment on SPARK-15513 at 11/21/16 6:17 PM: I

[jira] [Commented] (SPARK-15513) Bzip2Factory in Hadoop 2.7.1 is not thread safe

2016-11-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684302#comment-15684302 ] Yin Huai commented on SPARK-15513: -- I am closing this jira since the fix has been releas

[jira] [Resolved] (SPARK-15513) Bzip2Factory in Hadoop 2.7.1 is not thread safe

2016-11-21 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-15513. -- Resolution: Won't Fix > Bzip2Factory in Hadoop 2.7.1 is not thread safe > -

[jira] [Resolved] (SPARK-17765) org.apache.spark.mllib.linalg.VectorUDT cannot be cast to org.apache.spark.sql.types.StructType

2016-11-21 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-17765. - Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.1.0 > org.apache.spark.

[jira] [Commented] (SPARK-18515) AlterTableDropPartitions fails for non-string columns

2016-11-21 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684512#comment-15684512 ] Dongjoon Hyun commented on SPARK-18515: --- This is tightly related with `AlterTableAd

[jira] [Comment Edited] (SPARK-18515) AlterTableDropPartitions fails for non-string columns

2016-11-21 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684512#comment-15684512 ] Dongjoon Hyun edited comment on SPARK-18515 at 11/21/16 7:38 PM: --

[jira] [Updated] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated SPARK-18531: -- Description: More details can be found here: https://gist.github.com/tuxdna/37a69b53e6f9a9442f

[jira] [Updated] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated SPARK-18531: -- Description: More details can be found here: https://gist.github.com/tuxdna/37a69b53e6f9a9442f

[jira] [Updated] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated SPARK-18531: -- Description: More details can be found here: https://gist.github.com/tuxdna/37a69b53e6f9a9442f

[jira] [Created] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread Saleem Ansari (JIRA)
Saleem Ansari created SPARK-18531: - Summary: Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError Key: SPARK-18531 URL: https://issues.apache.org/jira/browse/SPARK-18531

[jira] [Updated] (SPARK-18531) Apache Spark FPGrowth algorithm implementation fails with java.lang.StackOverflowError

2016-11-21 Thread Saleem Ansari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saleem Ansari updated SPARK-18531: -- Description: More details can be found here: https://gist.github.com/tuxdna/37a69b53e6f9a9442f

[jira] [Commented] (SPARK-18413) Add a property to control the number of partitions when save a jdbc rdd

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684631#comment-15684631 ] Apache Spark commented on SPARK-18413: -- User 'dongjoon-hyun' has created a pull requ

[jira] [Commented] (SPARK-18403) ObjectHashAggregateSuite is being flaky (occasional OOM errors)

2016-11-21 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684659#comment-15684659 ] Cheng Lian commented on SPARK-18403: Here is a minimal test case (add it to {{ObjectH

[jira] [Commented] (SPARK-17850) HadoopRDD should not swallow EOFException

2016-11-21 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684685#comment-15684685 ] Mark Grover commented on SPARK-17850: - Hi [~zsxwing] and [~srowen], the JIRA fix vers

[jira] [Updated] (SPARK-17850) HadoopRDD should not swallow EOFException

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-17850: - Target Version/s: 2.1.0 (was: 2.0.2, 2.1.0) > HadoopRDD should not swallow EOFException > --

[jira] [Commented] (SPARK-17850) HadoopRDD should not swallow EOFException

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684699#comment-15684699 ] Shixiong Zhu commented on SPARK-17850: -- [~mgrover] you're right. This is only in 2.1

[jira] [Updated] (SPARK-17850) HadoopRDD should not swallow EOFException

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-17850: - Fix Version/s: (was: 2.0.2) > HadoopRDD should not swallow EOFException > ---

[jira] [Commented] (SPARK-18529) Timeouts shouldn't be AssertionErrors

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684703#comment-15684703 ] Shixiong Zhu commented on SPARK-18529: -- This will be fixed in https://github.com/apa

[jira] [Updated] (SPARK-18361) Expose RDD localCheckpoint in PySpark

2016-11-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-18361: -- Assignee: Gabriel Huang > Expose RDD localCheckpoint in PySpark > -

[jira] [Resolved] (SPARK-18361) Expose RDD localCheckpoint in PySpark

2016-11-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-18361. --- Resolution: Fixed Fix Version/s: 2.1.0 Target Version/s: 2.1.0 > Expose RDD localChe

[jira] [Assigned] (SPARK-18530) Kafka timestamp should be TimestampType

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reassigned SPARK-18530: Assignee: Shixiong Zhu > Kafka timestamp should be TimestampType > ---

[jira] [Assigned] (SPARK-18073) Migrate wiki to spark.apache.org web site

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18073: Assignee: Apache Spark (was: Sean Owen) > Migrate wiki to spark.apache.org web site > ---

[jira] [Commented] (SPARK-18073) Migrate wiki to spark.apache.org web site

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684748#comment-15684748 ] Apache Spark commented on SPARK-18073: -- User 'srowen' has created a pull request for

[jira] [Resolved] (SPARK-18517) DROP TABLE IF EXISTS should not warn for non-existing tables

2016-11-21 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or resolved SPARK-18517. --- Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.1.0 Target Ver

[jira] [Assigned] (SPARK-18073) Migrate wiki to spark.apache.org web site

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18073: Assignee: Sean Owen (was: Apache Spark) > Migrate wiki to spark.apache.org web site > ---

[jira] [Commented] (SPARK-9487) Use the same num. worker threads in Scala/Python unit tests

2016-11-21 Thread Saikat Kanjilal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684758#comment-15684758 ] Saikat Kanjilal commented on SPARK-9487: [~srowen] following up, thoughts on how t

[jira] [Created] (SPARK-18532) Code generation memory issue

2016-11-21 Thread Georg Heiler (JIRA)
Georg Heiler created SPARK-18532: Summary: Code generation memory issue Key: SPARK-18532 URL: https://issues.apache.org/jira/browse/SPARK-18532 Project: Spark Issue Type: Bug Compon

[jira] [Commented] (SPARK-18506) kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on a multi partition topic

2016-11-21 Thread Heji Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684844#comment-15684844 ] Heji Kim commented on SPARK-18506: -- Firstly thank you Cody for the quick response. Our i

[jira] [Commented] (SPARK-18512) FileNotFoundException on _temporary directory with Spark Streaming 2.0.1 and S3A

2016-11-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684863#comment-15684863 ] Shixiong Zhu commented on SPARK-18512: -- Did you enable speculation? > FileNotFoundE

[jira] [Commented] (SPARK-18532) Code generation memory issue

2016-11-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684908#comment-15684908 ] Herman van Hovell commented on SPARK-18532: --- The code generated by whole stage

[jira] [Commented] (SPARK-18403) ObjectHashAggregateSuite is being flaky (occasional OOM errors)

2016-11-21 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684928#comment-15684928 ] Herman van Hovell commented on SPARK-18403: --- The 5a5a5a5a5a5a means that the pa

[jira] [Commented] (SPARK-18506) kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on a multi partition topic

2016-11-21 Thread Heji Kim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15684995#comment-15684995 ] Heji Kim commented on SPARK-18506: -- Just confirming that when I use ConsumerStrategy.Ass

[jira] [Created] (SPARK-18533) Raise correct error upon specification of schema for datasource tables created through CTAS

2016-11-21 Thread Dilip Biswal (JIRA)
Dilip Biswal created SPARK-18533: Summary: Raise correct error upon specification of schema for datasource tables created through CTAS Key: SPARK-18533 URL: https://issues.apache.org/jira/browse/SPARK-18533

[jira] [Commented] (SPARK-18533) Raise correct error upon specification of schema for datasource tables created through CTAS

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15685018#comment-15685018 ] Apache Spark commented on SPARK-18533: -- User 'dilipbiswal' has created a pull reques

[jira] [Assigned] (SPARK-18533) Raise correct error upon specification of schema for datasource tables created through CTAS

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18533: Assignee: (was: Apache Spark) > Raise correct error upon specification of schema for d

[jira] [Assigned] (SPARK-18533) Raise correct error upon specification of schema for datasource tables created through CTAS

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18533: Assignee: Apache Spark > Raise correct error upon specification of schema for datasource t

[jira] [Assigned] (SPARK-18530) Kafka timestamp should be TimestampType

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18530: Assignee: Apache Spark (was: Shixiong Zhu) > Kafka timestamp should be TimestampType > --

[jira] [Assigned] (SPARK-18530) Kafka timestamp should be TimestampType

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18530: Assignee: Shixiong Zhu (was: Apache Spark) > Kafka timestamp should be TimestampType > --

[jira] [Commented] (SPARK-18530) Kafka timestamp should be TimestampType

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15685040#comment-15685040 ] Apache Spark commented on SPARK-18530: -- User 'zsxwing' has created a pull request fo

[jira] [Created] (SPARK-18534) Datasets Aggregation with Maps

2016-11-21 Thread Anton Okolnychyi (JIRA)
Anton Okolnychyi created SPARK-18534: Summary: Datasets Aggregation with Maps Key: SPARK-18534 URL: https://issues.apache.org/jira/browse/SPARK-18534 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15685164#comment-15685164 ] Apache Spark commented on SPARK-18134: -- User 'hvanhovell' has created a pull request

[jira] [Assigned] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18134: Assignee: (was: Apache Spark) > SQL: MapType in Group BY and Joins not working > -

[jira] [Assigned] (SPARK-18134) SQL: MapType in Group BY and Joins not working

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18134: Assignee: Apache Spark > SQL: MapType in Group BY and Joins not working >

[jira] [Commented] (SPARK-18506) kafka 0.10 with Spark 2.02 auto.offset.reset=earliest will only read from a single partition on a multi partition topic

2016-11-21 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15685190#comment-15685190 ] Cody Koeninger commented on SPARK-18506: I'd try to isolate aws vs gce as a possi

[jira] [Created] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Mark Grover (JIRA)
Mark Grover created SPARK-18535: --- Summary: Redact sensitive information from Spark logs and UI Key: SPARK-18535 URL: https://issues.apache.org/jira/browse/SPARK-18535 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18535: Assignee: Apache Spark > Redact sensitive information from Spark logs and UI > ---

[jira] [Assigned] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18535: Assignee: (was: Apache Spark) > Redact sensitive information from Spark logs and UI >

[jira] [Commented] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15685222#comment-15685222 ] Apache Spark commented on SPARK-18535: -- User 'markgrover' has created a pull request

[jira] [Updated] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Grover updated SPARK-18535: Attachment: redacted.png > Redact sensitive information from Spark logs and UI > --

[jira] [Commented] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15685225#comment-15685225 ] Mark Grover commented on SPARK-18535: - I just issued a PR for this, that adds a new c

[jira] [Resolved] (SPARK-18282) Add model summaries for Python GMM and BisectingKMeans

2016-11-21 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-18282. - Resolution: Fixed Fix Version/s: 2.1.0 > Add model summaries for Python GMM and BisectingK

[jira] [Comment Edited] (SPARK-18535) Redact sensitive information from Spark logs and UI

2016-11-21 Thread Mark Grover (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15685225#comment-15685225 ] Mark Grover edited comment on SPARK-18535 at 11/22/16 12:36 AM: ---

  1   2   >