[jira] [Resolved] (SPARK-4791) Create SchemaRDD from case classes with multiple constructors

2014-12-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4791. - Resolution: Fixed Assignee: Joseph K. Bradley > Create SchemaRDD from case classes w

[jira] [Commented] (SPARK-4827) Max iterations (100) reached for batch Resolution with deeply nested projects and project *s

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242271#comment-14242271 ] Apache Spark commented on SPARK-4827: - User 'marmbrus' has created a pull request for

[jira] [Created] (SPARK-4827) Max iterations (100) reached for batch Resolution with deeply nested projects and project *s

2014-12-10 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-4827: --- Summary: Max iterations (100) reached for batch Resolution with deeply nested projects and project *s Key: SPARK-4827 URL: https://issues.apache.org/jira/browse/SPARK-4827

[jira] [Comment Edited] (SPARK-4817) [streaming]Print the specified number of data and handle all of the elements in RDD

2014-12-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242177#comment-14242177 ] 宿荣全 edited comment on SPARK-4817 at 12/11/14 7:15 AM: -- [~srowen] Alwa

[jira] [Commented] (SPARK-4825) CTAS fails to resolve when created using saveAsTable

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242259#comment-14242259 ] Apache Spark commented on SPARK-4825: - User 'chenghao-intel' has created a pull reques

[jira] [Commented] (SPARK-4790) Flaky test in ReceivedBlockTrackerSuite: "block addition, block to batch allocation, and cleanup with write ahead log"

2014-12-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242250#comment-14242250 ] Josh Rosen commented on SPARK-4790: --- Actually, scratch that theory: here's a failure in

[jira] [Commented] (SPARK-4826) Possible flaky tests in WriteAheadLogBackedBlockRDDSuite: "java.lang.IllegalStateException: File exists and there is no append support!"

2014-12-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242248#comment-14242248 ] Josh Rosen commented on SPARK-4826: --- Actually, it turns out that this has happened twice

[jira] [Created] (SPARK-4826) Possible flaky tests in WriteAheadLogBackedBlockRDDSuite: "java.lang.IllegalStateException: File exists and there is no append support!"

2014-12-10 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-4826: - Summary: Possible flaky tests in WriteAheadLogBackedBlockRDDSuite: "java.lang.IllegalStateException: File exists and there is no append support!" Key: SPARK-4826 URL: https://issues.apa

[jira] [Updated] (SPARK-1600) flaky test case in streaming.CheckpointSuite

2014-12-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-1600: -- Labels: flaky-test (was: ) > flaky test case in streaming.CheckpointSuite > ---

[jira] [Updated] (SPARK-4790) Flaky test in ReceivedBlockTrackerSuite: "block addition, block to batch allocation, and cleanup with write ahead log"

2014-12-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4790: -- Labels: flaky-test (was: ) > Flaky test in ReceivedBlockTrackerSuite: "block addition, block to batch

[jira] [Commented] (SPARK-4790) Flaky test in ReceivedBlockTrackerSuite: "block addition, block to batch allocation, and cleanup with write ahead log"

2014-12-10 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242243#comment-14242243 ] Josh Rosen commented on SPARK-4790: --- Curiously, it looks like this test hasn't failed re

[jira] [Comment Edited] (SPARK-4817) [streaming]Print the specified number of data and handle all of the elements in RDD

2014-12-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242177#comment-14242177 ] 宿荣全 edited comment on SPARK-4817 at 12/11/14 7:03 AM: -- [~srowen] Alwa

[jira] [Comment Edited] (SPARK-4817) [streaming]Print the specified number of data and handle all of the elements in RDD

2014-12-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242177#comment-14242177 ] 宿荣全 edited comment on SPARK-4817 at 12/11/14 6:59 AM: -- [~srowen] Alwa

[jira] [Commented] (SPARK-4700) Add Http support to Spark Thrift server

2014-12-10 Thread Judy Nash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242218#comment-14242218 ] Judy Nash commented on SPARK-4700: -- pull request created at https://github.com/apache/spa

[jira] [Commented] (SPARK-4700) Add Http support to Spark Thrift server

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242219#comment-14242219 ] Apache Spark commented on SPARK-4700: - User 'judynash' has created a pull request for

[jira] [Updated] (SPARK-4700) Add Http support to Spark Thrift server

2014-12-10 Thread Judy Nash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Judy Nash updated SPARK-4700: - Affects Version/s: 1.3.0 > Add Http support to Spark Thrift server > -

[jira] [Updated] (SPARK-4720) Remainder should also return null if the divider is 0.

2014-12-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4720: Target Version/s: 1.3.0 (was: 1.2.0) > Remainder should also return null if the divider is

[jira] [Updated] (SPARK-4742) The name of Parquet File generated by AppendingParquetOutputFormat should be zero padded

2014-12-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4742: Target Version/s: 1.3.0 (was: 1.2.0) > The name of Parquet File generated by AppendingParqu

[jira] [Resolved] (SPARK-4554) Set fair scheduler pool for JDBC client session in hive 13

2014-12-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4554. - Resolution: Duplicate > Set fair scheduler pool for JDBC client session in hive 13 > -

[jira] [Comment Edited] (SPARK-4817) [streaming]Print the specified number of data and handle all of the elements in RDD

2014-12-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242177#comment-14242177 ] 宿荣全 edited comment on SPARK-4817 at 12/11/14 6:19 AM: -- [~srowen] Alwa

[jira] [Updated] (SPARK-4554) Set fair scheduler pool for JDBC client session in hive 13

2014-12-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4554: Priority: Critical (was: Major) > Set fair scheduler pool for JDBC client session in hive 1

[jira] [Updated] (SPARK-3575) Hive Schema is ignored when using convertMetastoreParquet

2014-12-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3575: Target Version/s: 1.3.0 (was: 1.2.0) > Hive Schema is ignored when using convertMetastorePa

[jira] [Updated] (SPARK-3860) Improve dimension joins

2014-12-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3860: Target Version/s: 1.3.0 (was: 1.2.0) > Improve dimension joins > --- >

[jira] [Updated] (SPARK-4699) Make caseSensitive configurable in Analyzer.scala

2014-12-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4699: Target Version/s: 1.3.0 (was: 1.2.0) > Make caseSensitive configurable in Analyzer.scala >

[jira] [Updated] (SPARK-4811) Custom UDTFs not working in Spark SQL

2014-12-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4811: Fix Version/s: (was: 1.2.0) > Custom UDTFs not working in Spark SQL > --

[jira] [Resolved] (SPARK-3262) CREATE VIEW is not supported but the error message is not clear

2014-12-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3262. - Resolution: Duplicate Fix Version/s: 1.2.0 > CREATE VIEW is not supported but the e

[jira] [Commented] (SPARK-4817) [streaming]Print the specified number of data and handle all of the elements in RDD

2014-12-10 Thread Yi Tian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242193#comment-14242193 ] Yi Tian commented on SPARK-4817: Hi, [~sowen] I think the main idea of [~surq] is * For t

[jira] [Created] (SPARK-4825) CTAS fails to resolve when created using saveAsTable

2014-12-10 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-4825: --- Summary: CTAS fails to resolve when created using saveAsTable Key: SPARK-4825 URL: https://issues.apache.org/jira/browse/SPARK-4825 Project: Spark Issu

[jira] [Comment Edited] (SPARK-4817) [streaming]Print the specified number of data and handle all of the elements in RDD

2014-12-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242177#comment-14242177 ] 宿荣全 edited comment on SPARK-4817 at 12/11/14 5:57 AM: -- [~srowen] Alwa

[jira] [Comment Edited] (SPARK-4817) [streaming]Print the specified number of data and handle all of the elements in RDD

2014-12-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242177#comment-14242177 ] 宿荣全 edited comment on SPARK-4817 at 12/11/14 5:35 AM: -- [~srowen] Alwa

[jira] [Commented] (SPARK-4817) [streaming]Print the specified number of data and handle all of the elements in RDD

2014-12-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242177#comment-14242177 ] 宿荣全 commented on SPARK-4817: Always call foreachRDD, and operate on all of the RDD, and then c

[jira] [Updated] (SPARK-4700) Add Http support to Spark Thrift server

2014-12-10 Thread Judy Nash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Judy Nash updated SPARK-4700: - Description: Currently thrift only supports TCP connection. The JIRA is to add HTTP support to spark thr

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242101#comment-14242101 ] Reynold Xin commented on SPARK-4740: I can't really think of a reason why the Netty on

[jira] [Commented] (SPARK-4817) [streaming]Print the specified number of data and handle all of the elements in RDD

2014-12-10 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-4817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242090#comment-14242090 ] 宿荣全 commented on SPARK-4817: [~srowen] ' Neither prints the "top" elements. Did you mean "fir

[jira] [Commented] (SPARK-4687) SparkContext#addFile doesn't keep file folder information

2014-12-10 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242086#comment-14242086 ] Xuefu Zhang commented on SPARK-4687: I concur with [~sandyr]'s account for the need of

[jira] [Commented] (SPARK-4818) Join operation should use iterator/lazy evaluation

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242064#comment-14242064 ] Apache Spark commented on SPARK-4818: - User 'zsxwing' has created a pull request for t

[jira] [Closed] (SPARK-4824) Join should use `Iterator` rather than `Iterable`

2014-12-10 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu closed SPARK-4824. --- Resolution: Duplicate > Join should use `Iterator` rather than `Iterable` > --

[jira] [Commented] (SPARK-4824) Join should use `Iterator` rather than `Iterable`

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242056#comment-14242056 ] Apache Spark commented on SPARK-4824: - User 'zsxwing' has created a pull request for t

[jira] [Commented] (SPARK-4687) SparkContext#addFile doesn't keep file folder information

2014-12-10 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242055#comment-14242055 ] Sandy Ryza commented on SPARK-4687: --- I think [~xuefuz] can probably motivate this better

[jira] [Created] (SPARK-4824) Join should use `Iterator` rather than `Iterable`

2014-12-10 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-4824: --- Summary: Join should use `Iterator` rather than `Iterable` Key: SPARK-4824 URL: https://issues.apache.org/jira/browse/SPARK-4824 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4675) Find similar products and similar users in MatrixFactorizationModel

2014-12-10 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242034#comment-14242034 ] Debasish Das commented on SPARK-4675: - [~josephkb] how do we validate that low dimensi

[jira] [Commented] (SPARK-4823) rowSimilarities

2014-12-10 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242031#comment-14242031 ] Debasish Das commented on SPARK-4823: - I am considering coming up with a baseline vers

[jira] [Commented] (SPARK-4814) Enable assertions in SBT, Maven tests / AssertionError from Hive's LazyBinaryInteger

2014-12-10 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14242030#comment-14242030 ] Michael Armbrust commented on SPARK-4814: - Hive throws a lot of warning here for s

[jira] [Created] (SPARK-4823) rowSimilarities

2014-12-10 Thread Reza Zadeh (JIRA)
Reza Zadeh created SPARK-4823: - Summary: rowSimilarities Key: SPARK-4823 URL: https://issues.apache.org/jira/browse/SPARK-4823 Project: Spark Issue Type: Improvement Components: MLlib

[jira] [Commented] (SPARK-4814) Enable assertions in SBT, Maven tests / AssertionError from Hive's LazyBinaryInteger

2014-12-10 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241983#comment-14241983 ] Cheng Lian commented on SPARK-4814: --- Thanks [~srowen]! I'll take a look. > Enable asser

[jira] [Comment Edited] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-10 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241973#comment-14241973 ] Saisai Shao edited comment on SPARK-4740 at 12/11/14 1:34 AM: --

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-10 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241973#comment-14241973 ] Saisai Shao commented on SPARK-4740: Hi Reynold, the code I pasted is just the example

[jira] [Commented] (SPARK-4687) SparkContext#addFile doesn't keep file folder information

2014-12-10 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241974#comment-14241974 ] Patrick Wendell commented on SPARK-4687: I commented a bit on the JIRA after seein

[jira] [Commented] (SPARK-4687) SparkContext#addFile doesn't keep file folder information

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241961#comment-14241961 ] Apache Spark commented on SPARK-4687: - User 'sryza' has created a pull request for thi

[jira] [Commented] (SPARK-4675) Find similar products and similar users in MatrixFactorizationModel

2014-12-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241952#comment-14241952 ] Joseph K. Bradley commented on SPARK-4675: -- Just to make sure I get your last que

[jira] [Commented] (SPARK-2892) Socket Receiver does not stop when streaming context is stopped

2014-12-10 Thread Ilayaperumal Gopinathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241951#comment-14241951 ] Ilayaperumal Gopinathan commented on SPARK-2892: To add more info: When t

[jira] [Created] (SPARK-4822) Use sphinx tags for Python doc annotations

2014-12-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-4822: Summary: Use sphinx tags for Python doc annotations Key: SPARK-4822 URL: https://issues.apache.org/jira/browse/SPARK-4822 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-3526) Docs section on data locality

2014-12-10 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3526. Resolution: Fixed Fix Version/s: 1.2.0 Thanks [~aash] for contributing. > Docs secti

[jira] [Closed] (SPARK-4633) Support gzip in spark.compression.io.codec

2014-12-10 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell closed SPARK-4633. -- Resolution: Won't Fix I'd like to close this issue for now until we get a better understanding o

[jira] [Commented] (SPARK-4821) pyspark.mllib.rand docs not generated correctly

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241871#comment-14241871 ] Apache Spark commented on SPARK-4821: - User 'jkbradley' has created a pull request for

[jira] [Updated] (SPARK-4759) Deadlock in complex spark job in local mode

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4759: - Fix Version/s: 1.1.2 1.3.0 > Deadlock in complex spark job in local mode >

[jira] [Updated] (SPARK-4759) Deadlock in complex spark job in local mode

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4759: - Labels: backport-needed (was: ) > Deadlock in complex spark job in local mode > -

[jira] [Updated] (SPARK-4759) Deadlock in complex spark job in local mode

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4759: - Target Version/s: 1.3.0, 1.1.2, 1.2.1 > Deadlock in complex spark job in local mode >

[jira] [Created] (SPARK-4821) pyspark.mllib.rand docs not generated correctly

2014-12-10 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-4821: Summary: pyspark.mllib.rand docs not generated correctly Key: SPARK-4821 URL: https://issues.apache.org/jira/browse/SPARK-4821 Project: Spark Issue T

[jira] [Updated] (SPARK-4569) Rename "externalSorting" in Aggregator

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4569: - Labels: backport-needed (was: ) > Rename "externalSorting" in Aggregator > --

[jira] [Updated] (SPARK-4569) Rename "externalSorting" in Aggregator

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4569: - Target Version/s: 1.3.0, 1.1.2, 1.2.1 Fix Version/s: 1.3.0 > Rename "externalSorting" in Aggregator

[jira] [Updated] (SPARK-4820) Spark build encounters "File name too long" on some encrypted filesystems

2014-12-10 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-4820: --- Description: This was reported by Luchesar Cekov on github along with a proposed fix. The fix

[jira] [Created] (SPARK-4820) Spark build encounters "File name too long" on some encrypted filesystems

2014-12-10 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-4820: -- Summary: Spark build encounters "File name too long" on some encrypted filesystems Key: SPARK-4820 URL: https://issues.apache.org/jira/browse/SPARK-4820 Project:

[jira] [Created] (SPARK-4819) Remove Guava's "Optional" from public API

2014-12-10 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-4819: - Summary: Remove Guava's "Optional" from public API Key: SPARK-4819 URL: https://issues.apache.org/jira/browse/SPARK-4819 Project: Spark Issue Type: Task

[jira] [Updated] (SPARK-4793) way to find assembly jar is too strict

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4793: - Target Version/s: 1.3.0, 1.1.2, 1.2.1 Fix Version/s: 1.3.0 > way to find assembly jar is too strict

[jira] [Updated] (SPARK-4793) way to find assembly jar is too strict

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4793: - Affects Version/s: 1.1.0 > way to find assembly jar is too strict > --

[jira] [Updated] (SPARK-4793) way to find assembly jar is too strict

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4793: - Assignee: Adrian Wang > way to find assembly jar is too strict > -- >

[jira] [Updated] (SPARK-4215) Allow requesting executors only on Yarn (for now)

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4215: - Fix Version/s: 1.3.0 > Allow requesting executors only on Yarn (for now) > ---

[jira] [Updated] (SPARK-4215) Allow requesting executors only on Yarn (for now)

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4215: - Labels: backport-needed (was: ) > Allow requesting executors only on Yarn (for now) > ---

[jira] [Commented] (SPARK-2075) Anonymous classes are missing from Spark distribution

2014-12-10 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241701#comment-14241701 ] Pat Ferrel commented on SPARK-2075: --- If the explanation is correct this needs to be file

[jira] [Closed] (SPARK-4771) Document standalone --supervise feature

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4771. Resolution: Fixed Fix Version/s: 1.2.1 1.1.2 > Document standalone --supervise fea

[jira] [Updated] (SPARK-4771) Document standalone --supervise feature

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4771: - Fix Version/s: 1.3.0 > Document standalone --supervise feature > --- >

[jira] [Updated] (SPARK-4771) Document standalone --supervise feature

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4771: - Target Version/s: 1.3.0, 1.1.2, 1.2.1 (was: 1.1.2, 1.2.1) > Document standalone --supervise feature > ---

[jira] [Closed] (SPARK-4329) Add indexing feature for HistoryPage

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-4329. Resolution: Fixed Assignee: Kousuke Saruta > Add indexing feature for HistoryPage > --

[jira] [Updated] (SPARK-4329) Add indexing feature for HistoryPage

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4329: - Fix Version/s: 1.3.0 > Add indexing feature for HistoryPage > > >

[jira] [Updated] (SPARK-4161) Spark shell class path is not correctly set if "spark.driver.extraClassPath" is set in defaults.conf

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4161: - Labels: backport-needed (was: ) > Spark shell class path is not correctly set if "spark.driver.extraClass

[jira] [Updated] (SPARK-4161) Spark shell class path is not correctly set if "spark.driver.extraClassPath" is set in defaults.conf

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4161: - Fix Version/s: 1.3.0 > Spark shell class path is not correctly set if "spark.driver.extraClassPath" > is

[jira] [Updated] (SPARK-4161) Spark shell class path is not correctly set if "spark.driver.extraClassPath" is set in defaults.conf

2014-12-10 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4161: - Target Version/s: 1.3.0, 1.1.2, 1.2.1 (was: 1.1.2, 1.2.1) > Spark shell class path is not correctly set i

[jira] [Commented] (SPARK-2951) SerDeUtils.pythonToPairRDD fails on RDDs of pickled array.arrays in Python 2.6

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241654#comment-14241654 ] Apache Spark commented on SPARK-2951: - User 'davies' has created a pull request for th

[jira] [Commented] (SPARK-3918) Forget Unpersist in RandomForest.scala(train Method)

2014-12-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241649#comment-14241649 ] Joseph K. Bradley commented on SPARK-3918: -- Oops! I forgot to update that PR's n

[jira] [Updated] (SPARK-3702) Standardize MLlib classes for learners, models

2014-12-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3702: - Description: Summary: Create a class hierarchy for learning algorithms and the models thos

[jira] [Commented] (SPARK-3702) Standardize MLlib classes for learners, models

2014-12-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241611#comment-14241611 ] Joseph K. Bradley commented on SPARK-3702: -- APIs for Classifiers, Regressors > S

[jira] [Updated] (SPARK-3702) Standardize MLlib classes for learners, models

2014-12-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3702: - Description: Summary: Create a class hierarchy for learning algorithms and the models thos

[jira] [Assigned] (SPARK-4789) Standardize ML Prediction APIs

2014-12-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-4789: Assignee: Joseph K. Bradley > Standardize ML Prediction APIs >

[jira] [Updated] (SPARK-4789) Standardize ML Prediction APIs

2014-12-10 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-4789: - Issue Type: Sub-task (was: New Feature) Parent: SPARK-1856 > Standardize ML Predi

[jira] [Commented] (SPARK-4675) Find similar products and similar users in MatrixFactorizationModel

2014-12-10 Thread Debasish Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241535#comment-14241535 ] Debasish Das commented on SPARK-4675: - There are few issues: 1. Batch API for topK si

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241524#comment-14241524 ] Apache Spark commented on SPARK-4740: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-10 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241523#comment-14241523 ] Reynold Xin commented on SPARK-4740: Also [~jerryshao] when I asked you to disable tra

[jira] [Commented] (SPARK-4740) Netty's network throughput is about 1/2 of NIO's in spark-perf sortByKey

2014-12-10 Thread Aaron Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241463#comment-14241463 ] Aaron Davidson commented on SPARK-4740: --- Clarification: The merged version of Reynol

[jira] [Commented] (SPARK-4569) Rename "externalSorting" in Aggregator

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241455#comment-14241455 ] Apache Spark commented on SPARK-4569: - User 'ilganeli' has created a pull request for

[jira] [Commented] (SPARK-1037) the name of findTaskFromList & findTask in TaskSetManager.scala is confusing

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1037?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241442#comment-14241442 ] Apache Spark commented on SPARK-1037: - User 'ilganeli' has created a pull request for

[jira] [Commented] (SPARK-3607) ConnectionManager threads.max configs on the thread pools don't work

2014-12-10 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241431#comment-14241431 ] Apache Spark commented on SPARK-3607: - User 'ilganeli' has created a pull request for

[jira] [Commented] (SPARK-4746) integration tests should be separated from faster unit tests

2014-12-10 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241383#comment-14241383 ] Imran Rashid commented on SPARK-4746: - I think this can be done two ways: (1) by movin

[jira] [Created] (SPARK-4818) Join operation should use iterator/lazy evaluation

2014-12-10 Thread Johannes Simon (JIRA)
Johannes Simon created SPARK-4818: - Summary: Join operation should use iterator/lazy evaluation Key: SPARK-4818 URL: https://issues.apache.org/jira/browse/SPARK-4818 Project: Spark Issue Type

[jira] [Commented] (SPARK-2892) Socket Receiver does not stop when streaming context is stopped

2014-12-10 Thread Mark Fisher (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241362#comment-14241362 ] Mark Fisher commented on SPARK-2892: [~srowen] SPARK-4802 is only related to the recei

[jira] [Updated] (SPARK-4746) integration tests should be separated from faster unit tests

2014-12-10 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-4746: Summary: integration tests should be separated from faster unit tests (was: integration tests shoul

[jira] [Commented] (SPARK-1338) Create Additional Style Rules

2014-12-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14241297#comment-14241297 ] Sean Owen commented on SPARK-1338: -- The PR for this was abandoned. What's the thinking on

[jira] [Resolved] (SPARK-1380) Add sort-merge based cogroup/joins.

2014-12-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1380. -- Resolution: Won't Fix The PR discussion suggests this is WontFix. > Add sort-merge based cogroup/joins.

[jira] [Resolved] (SPARK-1385) Use existing code-path for JSON de/serialization of BlockId

2014-12-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1385. -- Resolution: Fixed PR is https://github.com/apache/spark/pull/289. This was merged in https://github.com

[jira] [Resolved] (SPARK-1127) Add saveAsHBase to PairRDDFunctions

2014-12-10 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1127. -- Resolution: Won't Fix Fix Version/s: (was: 1.2.0) Given the discussion in both PRs, this look

  1   2   >