[jira] [Created] (SPARK-6735) Provide options to make maximum executor failure count ( which kills the application ) relative to a window duration or disable it.

2015-04-07 Thread Twinkle Sachdeva (JIRA)
Twinkle Sachdeva created SPARK-6735: --- Summary: Provide options to make maximum executor failure count ( which kills the application ) relative to a window duration or disable it. Key: SPARK-6735 URL:

[jira] [Commented] (SPARK-6431) Couldn't find leader offsets exception when creating KafkaDirectStream

2015-04-07 Thread Alberto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482752#comment-14482752 ] Alberto commented on SPARK-6431: I think you're right Cody. I've been having a look at my

[jira] [Comment Edited] (SPARK-6431) Couldn't find leader offsets exception when creating KafkaDirectStream

2015-04-07 Thread Alberto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482752#comment-14482752 ] Alberto edited comment on SPARK-6431 at 4/7/15 7:43 AM: You're

[jira] [Updated] (SPARK-6721) IllegalStateException when connecting to MongoDB using spark-submit

2015-04-07 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Luis Rodríguez Trejo updated SPARK-6721: Summary: IllegalStateException when connecting to MongoDB using spark-submit (was:

[jira] [Created] (SPARK-6736) Example of Graph#aggregateMessages has error

2015-04-07 Thread Sasaki Toru (JIRA)
Sasaki Toru created SPARK-6736: -- Summary: Example of Graph#aggregateMessages has error Key: SPARK-6736 URL: https://issues.apache.org/jira/browse/SPARK-6736 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6708) Using Hive UDTF may throw ClassNotFoundException

2015-04-07 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482805#comment-14482805 ] Cheng Lian commented on SPARK-6708: --- Thanks for pointing out this. I followed SPARK-4854

[jira] [Updated] (SPARK-4811) Custom UDTFs not working in Spark SQL

2015-04-07 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-4811: -- Affects Version/s: 1.2.1 1.3.0 Custom UDTFs not working in Spark SQL

[jira] [Commented] (SPARK-6736) [GraphX]Example of Graph#aggregateMessages has error

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482813#comment-14482813 ] Apache Spark commented on SPARK-6736: - User 'sasakitoa' has created a pull request for

[jira] [Updated] (SPARK-6737) OutputCommitCoordinator.authorizedCommittersByStage map out of memory

2015-04-07 Thread Tao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tao Li updated SPARK-6737: -- Description: I am using spark streaming(1.3.1) as a long time running service and out of memory after running

[jira] [Commented] (SPARK-6514) For Kinesis Streaming, use the same region for DynamoDB (KCL checkpoints) as the Kinesis stream itself

2015-04-07 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482749#comment-14482749 ] Paweł Kopiczko commented on SPARK-6514: --- I named the param `regionName` after the

[jira] [Commented] (SPARK-6721) IllegalStateException when connecting to MongoDB using spark-submit

2015-04-07 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-6721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482798#comment-14482798 ] Luis Rodríguez Trejo commented on SPARK-6721: - [~sowen] thank you for your

[jira] [Resolved] (SPARK-4811) Custom UDTFs not working in Spark SQL

2015-04-07 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-4811. --- Resolution: Duplicate Although this ticket was opened earlier, I mark this a duplicate of SPARK-6708

[jira] [Updated] (SPARK-6737) OutputCommitCoordinator.authorizedCommittersByStage map out of memory

2015-04-07 Thread Tao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tao Li updated SPARK-6737: -- Description: I am using spark streaming(1.3.1) as a long time running service and out of memory after running

[jira] [Created] (SPARK-6737) OutputCommitCoordinator.authorizedCommittersByStage map out of memory

2015-04-07 Thread Tao Li (JIRA)
Tao Li created SPARK-6737: - Summary: OutputCommitCoordinator.authorizedCommittersByStage map out of memory Key: SPARK-6737 URL: https://issues.apache.org/jira/browse/SPARK-6737 Project: Spark Issue

[jira] [Commented] (SPARK-6695) Add an external iterator: a hadoop-like output collector

2015-04-07 Thread uncleGen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482768#comment-14482768 ] uncleGen commented on SPARK-6695: - [~srowen] Thanks for your patience. Yeah, it is a

[jira] [Commented] (SPARK-4854) Custom UDTF with Lateral View throws ClassNotFound exception in Spark SQL CLI

2015-04-07 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482802#comment-14482802 ] Cheng Lian commented on SPARK-4854: --- [~wanshenghua] Is the XXX in ClassNotFoundException

[jira] [Assigned] (SPARK-6736) [GraphX]Example of Graph#aggregateMessages has error

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6736: --- Assignee: (was: Apache Spark) [GraphX]Example of Graph#aggregateMessages has error

[jira] [Assigned] (SPARK-6736) [GraphX]Example of Graph#aggregateMessages has error

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6736: --- Assignee: Apache Spark [GraphX]Example of Graph#aggregateMessages has error

[jira] [Updated] (SPARK-6736) [GraphX]Example of Graph#aggregateMessages has error

2015-04-07 Thread Sasaki Toru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sasaki Toru updated SPARK-6736: --- Summary: [GraphX]Example of Graph#aggregateMessages has error (was: Example of

[jira] [Commented] (SPARK-6736) [GraphX]Example of Graph#aggregateMessages has error

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6736?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482803#comment-14482803 ] Apache Spark commented on SPARK-6736: - User 'sasakitoa' has created a pull request for

[jira] [Updated] (SPARK-6736) [GraphX]Example of Graph#aggregateMessages has error

2015-04-07 Thread Sasaki Toru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sasaki Toru updated SPARK-6736: --- Component/s: Documentation [GraphX]Example of Graph#aggregateMessages has error

[jira] [Resolved] (SPARK-6716) Change SparkContext.DRIVER_IDENTIFIER from 'driver' to 'driver'

2015-04-07 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6716. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5372

[jira] [Commented] (SPARK-6691) Abstract and add a dynamic RateLimiter for Spark Streaming

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482685#comment-14482685 ] Apache Spark commented on SPARK-6691: - User 'jerryshao' has created a pull request for

[jira] [Resolved] (SPARK-6636) Use public DNS hostname everywhere in spark_ec2.py

2015-04-07 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-6636. --- Resolution: Fixed Fix Version/s: 1.4.0 1.3.2 Issue resolved by pull request

[jira] [Updated] (SPARK-6636) Use public DNS hostname everywhere in spark_ec2.py

2015-04-07 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-6636: -- Assignee: Matt Aasted Use public DNS hostname everywhere in spark_ec2.py

[jira] [Commented] (SPARK-6682) Deprecate static train and use builder instead for Scala/Java

2015-04-07 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482711#comment-14482711 ] Yu Ishikawa commented on SPARK-6682: I think so. In conclusion, I agree with the

[jira] [Assigned] (SPARK-6691) Abstract and add a dynamic RateLimiter for Spark Streaming

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6691: --- Assignee: (was: Apache Spark) Abstract and add a dynamic RateLimiter for Spark

[jira] [Assigned] (SPARK-6691) Abstract and add a dynamic RateLimiter for Spark Streaming

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6691: --- Assignee: Apache Spark Abstract and add a dynamic RateLimiter for Spark Streaming

[jira] [Resolved] (SPARK-6736) [GraphX]Example of Graph#aggregateMessages has error

2015-04-07 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave resolved SPARK-6736. --- Resolution: Fixed Issue resolved by pull request 5388 [https://github.com/apache/spark/pull/5388]

[jira] [Assigned] (SPARK-6739) Spark SQL Example gives errors due to missing import of Types org.apache.spark.sql.types

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6739: --- Assignee: Apache Spark Spark SQL Example gives errors due to missing import of Types

[jira] [Commented] (SPARK-6739) Spark SQL Example gives errors due to missing import of Types org.apache.spark.sql.types

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482944#comment-14482944 ] Apache Spark commented on SPARK-6739: - User 'tijoparacka' has created a pull request

[jira] [Assigned] (SPARK-6739) Spark SQL Example gives errors due to missing import of Types org.apache.spark.sql.types

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6739: --- Assignee: (was: Apache Spark) Spark SQL Example gives errors due to missing import of

[jira] [Commented] (SPARK-6682) Deprecate static train and use builder instead for Scala/Java

2015-04-07 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482917#comment-14482917 ] Yu Ishikawa commented on SPARK-6682: Thanks for replying, [~mengxr]. How about

[jira] [Created] (SPARK-6739) Spark SQL Example gives errors due to missing import of Types org.apache.spark.sql.types

2015-04-07 Thread Tijo Thomas (JIRA)
Tijo Thomas created SPARK-6739: -- Summary: Spark SQL Example gives errors due to missing import of Types org.apache.spark.sql.types Key: SPARK-6739 URL: https://issues.apache.org/jira/browse/SPARK-6739

[jira] [Created] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-07 Thread Hong Shen (JIRA)
Hong Shen created SPARK-6738: Summary: EstimateSize is difference with spill file size Key: SPARK-6738 URL: https://issues.apache.org/jira/browse/SPARK-6738 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-6682) Deprecate static train and use builder instead for Scala/Java

2015-04-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482884#comment-14482884 ] Xiangrui Meng commented on SPARK-6682: -- +1 on deprecating the static train methods

[jira] [Commented] (SPARK-6721) IllegalStateException when connecting to MongoDB using spark-submit

2015-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482950#comment-14482950 ] Sean Owen commented on SPARK-6721: -- From the stack trace, the error is coming from the

[jira] [Resolved] (SPARK-6732) Scala existentials warning during compilation

2015-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6732. -- Resolution: Duplicate Scala existentials warning during compilation

[jira] [Created] (SPARK-6740) SQL operator and condition precedence is not honoured

2015-04-07 Thread Santiago M. Mola (JIRA)
Santiago M. Mola created SPARK-6740: --- Summary: SQL operator and condition precedence is not honoured Key: SPARK-6740 URL: https://issues.apache.org/jira/browse/SPARK-6740 Project: Spark

[jira] [Resolved] (SPARK-6695) Add an external iterator: a hadoop-like output collector

2015-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6695. -- Resolution: Won't Fix I suppose my problem with that is that it would be duplicating Spark's spill

[jira] [Commented] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482987#comment-14482987 ] Sean Owen commented on SPARK-6738: -- Is that the only file spilled though? I'm not an

[jira] [Updated] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6738: - Description: ExternalAppendOnlyMap spill 1100M data to disk: {code} 15/04/07 16:39:48 INFO

[jira] [Updated] (SPARK-6733) Suppression of usage of Scala existential code should be done

2015-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6733: - Priority: Trivial (was: Major) What file are you talking about? Suppression of usage of Scala

[jira] [Commented] (SPARK-6739) Spark SQL Example gives errors due to missing import of Types org.apache.spark.sql.types

2015-04-07 Thread Tijo Thomas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483034#comment-14483034 ] Tijo Thomas commented on SPARK-6739: Please close this duplicate issue Spark SQL

[jira] [Comment Edited] (SPARK-6739) Spark SQL Example gives errors due to missing import of Types org.apache.spark.sql.types

2015-04-07 Thread Tijo Thomas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483034#comment-14483034 ] Tijo Thomas edited comment on SPARK-6739 at 4/7/15 11:26 AM: -

[jira] [Resolved] (SPARK-6739) Spark SQL Example gives errors due to missing import of Types org.apache.spark.sql.types

2015-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6739. -- Resolution: Not A Problem It's because the site hasn't been published again with a next release since

[jira] [Updated] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-07 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-6738: - Description: ExternalAppendOnlyMap spill 1100M data to disk: {code} 15/04/07 16:39:48 INFO

[jira] [Commented] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-07 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483011#comment-14483011 ] Hong Shen commented on SPARK-6738: -- Yes, it spill lots of files, but each one has only

[jira] [Updated] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-07 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-6738: - Description: ExternalAppendOnlyMap spill 1100M data to disk: {code} 15/04/07 16:39:48 INFO

[jira] [Updated] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-07 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-6738: - Description: ExternalAppendOnlyMap spill 1100M data to disk: {code} 15/04/07 16:39:48 INFO

[jira] [Commented] (SPARK-6735) Provide options to make maximum executor failure count ( which kills the application ) relative to a window duration or disable it.

2015-04-07 Thread Twinkle Sachdeva (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483018#comment-14483018 ] Twinkle Sachdeva commented on SPARK-6735: - Created a PR here :

[jira] [Commented] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483025#comment-14483025 ] Sean Owen commented on SPARK-6738: -- Do you observe a problem? is it possible that you are

[jira] [Comment Edited] (SPARK-6739) Spark SQL Example gives errors due to missing import of Types org.apache.spark.sql.types

2015-04-07 Thread Tijo Thomas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483034#comment-14483034 ] Tijo Thomas edited comment on SPARK-6739 at 4/7/15 11:25 AM: -

[jira] [Commented] (SPARK-6733) Suppression of usage of Scala existential code should be done

2015-04-07 Thread Raymond Tay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483036#comment-14483036 ] Raymond Tay commented on SPARK-6733: Apologize. It's DAGScheduler.scala Suppression

[jira] [Resolved] (SPARK-6420) Driver's Block Manager does not use spark.driver.host in Yarn-Client mode

2015-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-6420. -- Resolution: Duplicate Driver's Block Manager does not use spark.driver.host in Yarn-Client mode

[jira] [Created] (SPARK-6741) Add support for SELECT ALL syntax

2015-04-07 Thread Santiago M. Mola (JIRA)
Santiago M. Mola created SPARK-6741: --- Summary: Add support for SELECT ALL syntax Key: SPARK-6741 URL: https://issues.apache.org/jira/browse/SPARK-6741 Project: Spark Issue Type:

[jira] [Updated] (SPARK-6742) Spark pushes down filters in old parquet path that reference partitioning columns

2015-04-07 Thread Yash Datta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yash Datta updated SPARK-6742: -- This is same as SPARK-6554 for new parquet path Spark pushes down filters in old parquet path that

[jira] [Created] (SPARK-6742) Spark pushes down filters in old parquet path that reference partitioning columns

2015-04-07 Thread Yash Datta (JIRA)
Yash Datta created SPARK-6742: - Summary: Spark pushes down filters in old parquet path that reference partitioning columns Key: SPARK-6742 URL: https://issues.apache.org/jira/browse/SPARK-6742 Project:

[jira] [Assigned] (SPARK-6742) Spark pushes down filters in old parquet path that reference partitioning columns

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6742: --- Assignee: Apache Spark Spark pushes down filters in old parquet path that reference

[jira] [Assigned] (SPARK-6742) Spark pushes down filters in old parquet path that reference partitioning columns

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6742: --- Assignee: (was: Apache Spark) Spark pushes down filters in old parquet path that

[jira] [Commented] (SPARK-6742) Spark pushes down filters in old parquet path that reference partitioning columns

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483095#comment-14483095 ] Apache Spark commented on SPARK-6742: - User 'saucam' has created a pull request for

[jira] [Updated] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-07 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hong Shen updated SPARK-6738: - Description: ExternalAppendOnlyMap spill 2.2 GB data to disk: {code} 15/04/07 20:27:37 INFO

[jira] [Commented] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483103#comment-14483103 ] Sean Owen commented on SPARK-6738: -- To be clear I am asking how big the data being

[jira] [Commented] (SPARK-6738) EstimateSize is difference with spill file size

2015-04-07 Thread Hong Shen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483104#comment-14483104 ] Hong Shen commented on SPARK-6738: -- I don't think it's serialized cause the problem. the

[jira] [Commented] (SPARK-6612) Python KMeans parity

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483109#comment-14483109 ] Apache Spark commented on SPARK-6612: - User 'FlytxtRnD' has created a pull request for

[jira] [Assigned] (SPARK-6612) Python KMeans parity

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6612: --- Assignee: Hrishikesh (was: Apache Spark) Python KMeans parity

[jira] [Assigned] (SPARK-6612) Python KMeans parity

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6612: --- Assignee: Apache Spark (was: Hrishikesh) Python KMeans parity

[jira] [Updated] (SPARK-6743) Join with empty projection on one side produces invalid results

2015-04-07 Thread Santiago M. Mola (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Santiago M. Mola updated SPARK-6743: Priority: Critical (was: Major) Join with empty projection on one side produces invalid

[jira] [Created] (SPARK-6743) Join with empty projection on one side produces invalid results

2015-04-07 Thread Santiago M. Mola (JIRA)
Santiago M. Mola created SPARK-6743: --- Summary: Join with empty projection on one side produces invalid results Key: SPARK-6743 URL: https://issues.apache.org/jira/browse/SPARK-6743 Project: Spark

[jira] [Comment Edited] (SPARK-3276) Provide a API to specify MIN_REMEMBER_DURATION for files to consider as input in streaming

2015-04-07 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483174#comment-14483174 ] Emre Sevinç edited comment on SPARK-3276 at 4/7/15 2:36 PM:

[jira] [Resolved] (SPARK-3591) Provide fire and forget option for YARN cluster mode

2015-04-07 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves resolved SPARK-3591. -- Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Tao Wang Target

[jira] [Commented] (SPARK-6602) Replace direct use of Akka with Spark RPC interface

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483275#comment-14483275 ] Apache Spark commented on SPARK-6602: - User 'zsxwing' has created a pull request for

[jira] [Commented] (SPARK-3276) Provide a API to specify MIN_REMEMBER_DURATION for files to consider as input in streaming

2015-04-07 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-3276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483174#comment-14483174 ] Emre Sevinç commented on SPARK-3276: [~srowen] would it be fine if I added a public

[jira] [Created] (SPARK-6744) Add support for CROSS JOIN syntax

2015-04-07 Thread Santiago M. Mola (JIRA)
Santiago M. Mola created SPARK-6744: --- Summary: Add support for CROSS JOIN syntax Key: SPARK-6744 URL: https://issues.apache.org/jira/browse/SPARK-6744 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5242) ec2/spark_ec2.py lauch does not work with VPC if no public DNS or IP is available

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483164#comment-14483164 ] Apache Spark commented on SPARK-5242: - User 'mdagost' has created a pull request for

[jira] [Updated] (SPARK-6747) Support List as a return type in Hive UDF

2015-04-07 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-6747: Summary: Support List as a return type in Hive UDF (was: Support List as a return type in

[jira] [Created] (SPARK-6746) Refactor large functions in DAGScheduler to improve readibility

2015-04-07 Thread Ilya Ganelin (JIRA)
Ilya Ganelin created SPARK-6746: --- Summary: Refactor large functions in DAGScheduler to improve readibility Key: SPARK-6746 URL: https://issues.apache.org/jira/browse/SPARK-6746 Project: Spark

[jira] [Updated] (SPARK-6747) Support List as a return type in Hive UDF

2015-04-07 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-6747: Description: The current implementation can't handle List as a return type in Hive UDF. We

[jira] [Updated] (SPARK-6747) Support List as a return type in Hive UDF

2015-04-07 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-6747: Description: The current implementation can't handle List as a return type in Hive UDF. We

[jira] [Created] (SPARK-6745) Develop a general filter function to be used in PrunedFilteredScan and CatalystScan

2015-04-07 Thread Alex Liu (JIRA)
Alex Liu created SPARK-6745: --- Summary: Develop a general filter function to be used in PrunedFilteredScan and CatalystScan Key: SPARK-6745 URL: https://issues.apache.org/jira/browse/SPARK-6745 Project:

[jira] [Created] (SPARK-6747) Support List as a return type in Hive UDF

2015-04-07 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-6747: --- Summary: Support List as a return type in Hive UDF Key: SPARK-6747 URL: https://issues.apache.org/jira/browse/SPARK-6747 Project: Spark Issue Type:

[jira] [Commented] (SPARK-5114) Should Evaluator be a PipelineStage

2015-04-07 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483335#comment-14483335 ] Peter Rudenko commented on SPARK-5114: -- +1 for should. For my use case (create

[jira] [Assigned] (SPARK-6746) Refactor large functions in DAGScheduler to improve readibility

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6746: --- Assignee: Apache Spark Refactor large functions in DAGScheduler to improve readibility

[jira] [Assigned] (SPARK-5818) unable to use add jar in hql

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5818: --- Assignee: (was: Apache Spark) unable to use add jar in hql

[jira] [Commented] (SPARK-5818) unable to use add jar in hql

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483376#comment-14483376 ] Apache Spark commented on SPARK-5818: - User 'gvramana' has created a pull request for

[jira] [Assigned] (SPARK-5818) unable to use add jar in hql

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5818: --- Assignee: Apache Spark unable to use add jar in hql --

[jira] [Assigned] (SPARK-6746) Refactor large functions in DAGScheduler to improve readibility

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6746: --- Assignee: (was: Apache Spark) Refactor large functions in DAGScheduler to improve

[jira] [Commented] (SPARK-6746) Refactor large functions in DAGScheduler to improve readibility

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483488#comment-14483488 ] Apache Spark commented on SPARK-6746: - User 'ilganeli' has created a pull request for

[jira] [Assigned] (SPARK-6747) Support List as a return type in Hive UDF

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6747: --- Assignee: Apache Spark Support List as a return type in Hive UDF

[jira] [Assigned] (SPARK-6750) Upgrade ScalaStyle to 0.7

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6750: --- Assignee: Apache Spark (was: Reynold Xin) Upgrade ScalaStyle to 0.7

[jira] [Assigned] (SPARK-6750) Upgrade ScalaStyle to 0.7

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6750: --- Assignee: Reynold Xin (was: Apache Spark) Upgrade ScalaStyle to 0.7

[jira] [Comment Edited] (SPARK-3219) K-Means clusterer should support Bregman distance functions

2015-04-07 Thread Sai Nishanth Parepally (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14482297#comment-14482297 ] Sai Nishanth Parepally edited comment on SPARK-3219 at 4/7/15 6:10 PM:

[jira] [Created] (SPARK-6751) Spark History Server support multiple application attempts

2015-04-07 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-6751: Summary: Spark History Server support multiple application attempts Key: SPARK-6751 URL: https://issues.apache.org/jira/browse/SPARK-6751 Project: Spark

[jira] [Resolved] (SPARK-6733) Suppression of usage of Scala existential code should be done

2015-04-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-6733. Resolution: Fixed Fix Version/s: 1.4.0 Suppression of usage of Scala existential code

[jira] [Commented] (SPARK-6750) Upgrade ScalaStyle to 0.7

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483642#comment-14483642 ] Apache Spark commented on SPARK-6750: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-6746) Refactor large functions in DAGScheduler to improve readibility

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483547#comment-14483547 ] Apache Spark commented on SPARK-6746: - User 'ilganeli' has created a pull request for

[jira] [Created] (SPARK-6748) QueryPlan.schema should be a lazy val to avoid creating excessive duplicate StructType objects

2015-04-07 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-6748: - Summary: QueryPlan.schema should be a lazy val to avoid creating excessive duplicate StructType objects Key: SPARK-6748 URL: https://issues.apache.org/jira/browse/SPARK-6748

[jira] [Commented] (SPARK-6737) OutputCommitCoordinator.authorizedCommittersByStage map out of memory

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483609#comment-14483609 ] Apache Spark commented on SPARK-6737: - User 'JoshRosen' has created a pull request for

[jira] [Assigned] (SPARK-6737) OutputCommitCoordinator.authorizedCommittersByStage map out of memory

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6737: --- Assignee: Apache Spark (was: Josh Rosen)

[jira] [Assigned] (SPARK-6737) OutputCommitCoordinator.authorizedCommittersByStage map out of memory

2015-04-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6737: --- Assignee: Josh Rosen (was: Apache Spark)

  1   2   >