[jira] [Commented] (SPARK-4633) Support gzip in spark.compression.io.codec

2014-12-17 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14249632#comment-14249632 ] Takeshi Yamamuro commented on SPARK-4633: - Ok, thanks. > Support gzip in spark.co

[jira] [Updated] (SPARK-3432) Fix logging of unit test execution time

2014-12-17 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-3432: Description: [Per Reynold|http://mail-archives.apache.org/mod_mbox/spark-dev/201408.mbox/%3

[jira] [Commented] (SPARK-4872) Provide sample format of training/test data in MLlib programming guide

2014-12-17 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14249656#comment-14249656 ] Sean Owen commented on SPARK-4872: -- Yes, you map categorical features to numeric values,

[jira] [Created] (SPARK-4873) WriteAheadLogBasedBlockHandler improvement

2014-12-17 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-4873: --- Summary: WriteAheadLogBasedBlockHandler improvement Key: SPARK-4873 URL: https://issues.apache.org/jira/browse/SPARK-4873 Project: Spark Issue Type: Improvemen

[jira] [Commented] (SPARK-4873) WriteAheadLogBasedBlockHandler improvement

2014-12-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14249716#comment-14249716 ] Apache Spark commented on SPARK-4873: - User 'zsxwing' has created a pull request for t

[jira] [Commented] (SPARK-3533) Add saveAsTextFileByKey() method to RDDs

2014-12-17 Thread Alister Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14249752#comment-14249752 ] Alister Lee commented on SPARK-3533: The stackoverflow question has a good answer (I h

[jira] [Commented] (SPARK-1812) Support cross-building with Scala 2.11

2014-12-17 Thread Tobias Bertelsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250023#comment-14250023 ] Tobias Bertelsen commented on SPARK-1812: - [~schmmd] Looks like [~pwendell] create

[jira] [Comment Edited] (SPARK-1812) Support cross-building with Scala 2.11

2014-12-17 Thread Tobias Bertelsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250023#comment-14250023 ] Tobias Bertelsen edited comment on SPARK-1812 at 12/17/14 3:54 PM: -

[jira] [Comment Edited] (SPARK-1812) Support cross-building with Scala 2.11

2014-12-17 Thread Tobias Bertelsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250023#comment-14250023 ] Tobias Bertelsen edited comment on SPARK-1812 at 12/17/14 4:02 PM: -

[jira] [Comment Edited] (SPARK-1812) Support cross-building with Scala 2.11

2014-12-17 Thread Tobias Bertelsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250023#comment-14250023 ] Tobias Bertelsen edited comment on SPARK-1812 at 12/17/14 4:02 PM: -

[jira] [Commented] (SPARK-3533) Add saveAsTextFileByKey() method to RDDs

2014-12-17 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250227#comment-14250227 ] Nicholas Chammas commented on SPARK-3533: - [~alilee] Do you mean [this answer|http

[jira] [Commented] (SPARK-4417) New API: sample RDD to fixed number of items

2014-12-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250260#comment-14250260 ] Apache Spark commented on SPARK-4417: - User 'ilganeli' has created a pull request for

[jira] [Created] (SPARK-4874) Report number of records read/written in a task

2014-12-17 Thread Kostas Sakellis (JIRA)
Kostas Sakellis created SPARK-4874: -- Summary: Report number of records read/written in a task Key: SPARK-4874 URL: https://issues.apache.org/jira/browse/SPARK-4874 Project: Spark Issue Type:

[jira] [Commented] (SPARK-4874) Report number of records read/written in a task

2014-12-17 Thread Kostas Sakellis (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250304#comment-14250304 ] Kostas Sakellis commented on SPARK-4874: I'm working on this. > Report number of

[jira] [Commented] (SPARK-3698) Case sensitive check in spark sql is incompleted.

2014-12-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250323#comment-14250323 ] Apache Spark commented on SPARK-3698: - User 'marmbrus' has created a pull request for

[jira] [Created] (SPARK-4875) Separate Transformer, Estimator params

2014-12-17 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-4875: Summary: Separate Transformer, Estimator params Key: SPARK-4875 URL: https://issues.apache.org/jira/browse/SPARK-4875 Project: Spark Issue Type: Impr

[jira] [Resolved] (SPARK-4595) Spark MetricsServlet is not worked because of initialization ordering

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4595. --- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Assignee: Saisai Shao

[jira] [Resolved] (SPARK-4625) Support "Sort By" in both DSL & SimpleSQLParser

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4625. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3481 [https:/

[jira] [Updated] (SPARK-4625) Support "Sort By" in both DSL & SimpleSQLParser

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4625: Assignee: Cheng Hao > Support "Sort By" in both DSL & SimpleSQLParser >

[jira] [Resolved] (SPARK-4764) Ensure that files are fetched atomically

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4764. --- Resolution: Fixed Fix Version/s: 1.2.1 I've backported this into {{branch-1.2}} for inclusion i

[jira] [Created] (SPARK-4876) An exception thrown when accessing a Spark SQL table using a JDBC driver from a standalone app.

2014-12-17 Thread Leonid Mikhailov (JIRA)
Leonid Mikhailov created SPARK-4876: --- Summary: An exception thrown when accessing a Spark SQL table using a JDBC driver from a standalone app. Key: SPARK-4876 URL: https://issues.apache.org/jira/browse/SPARK-487

[jira] [Updated] (SPARK-4764) Ensure that files are fetched atomically

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4764: -- Labels: (was: backport-needed) > Ensure that files are fetched atomically > --

[jira] [Commented] (SPARK-4595) Spark MetricsServlet is not worked because of initialization ordering

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250432#comment-14250432 ] Josh Rosen commented on SPARK-4595: --- It looks like this bug was introduced in https://g

[jira] [Resolved] (SPARK-4750) Dynamic allocation - we need to synchronize kills

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4750. --- Resolution: Fixed Fix Version/s: 1.2.1 I've backported this into {{branch-1.2}} for inclusion i

[jira] [Updated] (SPARK-4750) Dynamic allocation - we need to synchronize kills

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4750: -- Labels: (was: backport-needed) > Dynamic allocation - we need to synchronize kills > -

[jira] [Updated] (SPARK-3926) result of JavaRDD collectAsMap() is not serializable

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-3926: -- Labels: (was: backport-needed) > result of JavaRDD collectAsMap() is not serializable > --

[jira] [Resolved] (SPARK-3926) result of JavaRDD collectAsMap() is not serializable

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-3926. --- Resolution: Fixed Fix Version/s: (was: 1.1.2) 1.2.1 I've merged this int

[jira] [Updated] (SPARK-4691) Restructure a few lines in shuffle code

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4691: -- Labels: (was: backport-needed) > Restructure a few lines in shuffle code > ---

[jira] [Resolved] (SPARK-4691) Restructure a few lines in shuffle code

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4691. --- Resolution: Fixed Fix Version/s: 1.2.1 I've merged this into branch-1.2 for inclusion in 1.2.1,

[jira] [Updated] (SPARK-4714) BlockManager should check whether blocks have already been removed Checking block is null or not after having gotten info.lock in remove block method

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4714: -- Labels: (was: backport-needed) > BlockManager should check whether blocks have already been removed Ch

[jira] [Resolved] (SPARK-4714) BlockManager should check whether blocks have already been removed Checking block is null or not after having gotten info.lock in remove block method

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4714. --- Resolution: Fixed Fix Version/s: 1.2.1 I've merged this into {{branch-1.2}}, so this will be in

[jira] [Created] (SPARK-4877) userClassPathFirst doesn't handle user classes inheriting from parent

2014-12-17 Thread Stephen Haberman (JIRA)
Stephen Haberman created SPARK-4877: --- Summary: userClassPathFirst doesn't handle user classes inheriting from parent Key: SPARK-4877 URL: https://issues.apache.org/jira/browse/SPARK-4877 Project: Sp

[jira] [Resolved] (SPARK-4772) Accumulators leak memory, both temporarily and permanently

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4772. --- Resolution: Fixed Fix Version/s: 1.2.1 > Accumulators leak memory, both temporarily and permane

[jira] [Updated] (SPARK-4772) Accumulators leak memory, both temporarily and permanently

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4772: -- Labels: accumulators (was: accumulators backport-needed) > Accumulators leak memory, both temporarily a

[jira] [Commented] (SPARK-4772) Accumulators leak memory, both temporarily and permanently

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250460#comment-14250460 ] Josh Rosen commented on SPARK-4772: --- I've merged this into {{branch-1.2}}, so this will

[jira] [Commented] (SPARK-4877) userClassPathFirst doesn't handle user classes inheriting from parent

2014-12-17 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250461#comment-14250461 ] Stephen Haberman commented on SPARK-4877: - Stack trace: {code} 2014-12-17 05:07:3

[jira] [Updated] (SPARK-785) ClosureCleaner not invoked on most PairRDDFunctions

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-785: - Labels: (was: backport-needed) > ClosureCleaner not invoked on most PairRDDFunctions > --

[jira] [Resolved] (SPARK-785) ClosureCleaner not invoked on most PairRDDFunctions

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-785. -- Resolution: Fixed Fix Version/s: 1.2.1 Target Version/s: (was: 1.2.1) I've merged this

[jira] [Updated] (SPARK-4841) Batch serializer bug in PySpark's RDD.zip

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-4841: -- Labels: (was: backport-needed) > Batch serializer bug in PySpark's RDD.zip > -

[jira] [Resolved] (SPARK-4841) Batch serializer bug in PySpark's RDD.zip

2014-12-17 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-4841. --- Resolution: Fixed Fix Version/s: 1.2.1 Target Version/s: (was: 1.2.1) I've merged t

[jira] [Updated] (SPARK-4876) An exception thrown when accessing a Spark SQL table using a JDBC driver from a standalone app.

2014-12-17 Thread Leonid Mikhailov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Leonid Mikhailov updated SPARK-4876: Description: I am running Spark version 1.1.1 (built it on Mac using: mvn -Pyarn -Phadoop-2.

[jira] [Resolved] (SPARK-4694) Long-run user thread(such as HiveThriftServer2) causes the 'process leak' in yarn-client mode

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4694. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3576 [https:/

[jira] [Commented] (SPARK-4877) userClassPathFirst doesn't handle user classes inheriting from parent

2014-12-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250473#comment-14250473 ] Apache Spark commented on SPARK-4877: - User 'stephenh' has created a pull request for

[jira] [Resolved] (SPARK-3698) Case sensitive check in spark sql is incompleted.

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3698. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3724 [https:/

[jira] [Updated] (SPARK-3698) Case sensitive check in spark sql is incompleted.

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3698: Assignee: Michael Armbrust > Case sensitive check in spark sql is incompleted. > ---

[jira] [Resolved] (SPARK-4493) Don't pushdown Eq, NotEq, Lt, LtEq, Gt and GtEq predicates with nulls for Parquet

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4493. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3367 [https:/

[jira] [Updated] (SPARK-4493) Don't pushdown Eq, NotEq, Lt, LtEq, Gt and GtEq predicates with nulls for Parquet

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4493: Assignee: Cheng Lian > Don't pushdown Eq, NotEq, Lt, LtEq, Gt and GtEq predicates with nulls

[jira] [Commented] (SPARK-4790) Flaky test in ReceivedBlockTrackerSuite: "block addition, block to batch allocation, and cleanup with write ahead log"

2014-12-17 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250514#comment-14250514 ] Hari Shreedharan commented on SPARK-4790: - Looks like an HDFS bug - and this looks

[jira] [Updated] (SPARK-4755) SQRT(negative value) should return null

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4755: Assignee: Adrian Wang > SQRT(negative value) should return null > --

[jira] [Resolved] (SPARK-4755) SQRT(negative value) should return null

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4755. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3616 [https:/

[jira] [Commented] (SPARK-4790) Flaky test in ReceivedBlockTrackerSuite: "block addition, block to batch allocation, and cleanup with write ahead log"

2014-12-17 Thread Hari Shreedharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250525#comment-14250525 ] Hari Shreedharan commented on SPARK-4790: - Ah, so this is the issue: tracker3.clea

[jira] [Commented] (SPARK-4790) Flaky test in ReceivedBlockTrackerSuite: "block addition, block to batch allocation, and cleanup with write ahead log"

2014-12-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250610#comment-14250610 ] Apache Spark commented on SPARK-4790: - User 'harishreedharan' has created a pull reque

[jira] [Commented] (SPARK-4296) Throw "Expression not in GROUP BY" when using same expression in group by clause and select clause

2014-12-17 Thread David Ross (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250611#comment-14250611 ] David Ross commented on SPARK-4296: --- I can still reproduce this issue. The test case abo

[jira] [Resolved] (SPARK-3739) Too many splits for small source file in table scanning

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3739. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 2589 [https:/

[jira] [Commented] (SPARK-4296) Throw "Expression not in GROUP BY" when using same expression in group by clause and select clause

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250654#comment-14250654 ] Michael Armbrust commented on SPARK-4296: - David, is that using Spark 1.2? > Thro

[jira] [Resolved] (SPARK-4821) pyspark.mllib.rand docs not generated correctly

2014-12-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4821. -- Resolution: Fixed Fix Version/s: 1.2.1 1.3.0 Issue resolved by pull re

[jira] [Updated] (SPARK-4821) pyspark.mllib.rand docs not generated correctly

2014-12-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4821: - Target Version/s: 1.3.0, 1.2.1 (was: 1.2.0) > pyspark.mllib.rand docs not generated correctly > -

[jira] [Commented] (SPARK-4296) Throw "Expression not in GROUP BY" when using same expression in group by clause and select clause

2014-12-17 Thread David Ross (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250674#comment-14250674 ] David Ross commented on SPARK-4296: --- Hi Michael, We are trunk: {{1.3.0-SNAPSHOT}}, as of

[jira] [Reopened] (SPARK-4296) Throw "Expression not in GROUP BY" when using same expression in group by clause and select clause

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust reopened SPARK-4296: - Assignee: Cheng Lian > Throw "Expression not in GROUP BY" when using same expression in

[jira] [Closed] (SPARK-4875) Separate Transformer, Estimator params

2014-12-17 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-4875. Resolution: Duplicate > Separate Transformer, Estimator params > ---

[jira] [Created] (SPARK-4878) driverPropsFetcher causes spurious Akka disassociate errors

2014-12-17 Thread Stephen Haberman (JIRA)
Stephen Haberman created SPARK-4878: --- Summary: driverPropsFetcher causes spurious Akka disassociate errors Key: SPARK-4878 URL: https://issues.apache.org/jira/browse/SPARK-4878 Project: Spark

[jira] [Commented] (SPARK-4878) driverPropsFetcher causes spurious Akka disassociate errors

2014-12-17 Thread Stephen Haberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250711#comment-14250711 ] Stephen Haberman commented on SPARK-4878: - Here are some of the log messages in qu

[jira] [Resolved] (SPARK-4856) Null & empty string should not be considered as StringType at begining in Json schema inferring

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4856. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3708 [https:/

[jira] [Updated] (SPARK-4856) Null & empty string should not be considered as StringType at begining in Json schema inferring

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4856: Assignee: Cheng Hao > Null & empty string should not be considered as StringType at begining

[jira] [Commented] (SPARK-4069) [SPARK-YARN] ApplicationMaster should release all executors' containers before unregistering itself from Yarn RM

2014-12-17 Thread David McWhorter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14250762#comment-14250762 ] David McWhorter commented on SPARK-4069: Seeing the same behavior, a spark applica

[jira] [Resolved] (SPARK-3891) Support Hive Percentile UDAF with array of percentile values

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3891. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 2802 [https:/

[jira] [Updated] (SPARK-4854) Custom UDTF with Lateral View throws ClassNotFound exception in Spark SQL CLI

2014-12-17 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4854: Target Version/s: 1.3.0 > Custom UDTF with Lateral View throws ClassNotFound exception in Sp

[jira] [Updated] (SPARK-4006) Spark Driver crashes whenever an Executor is registered twice

2014-12-17 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4006: - Fix Version/s: 1.0.3 > Spark Driver crashes whenever an Executor is registered twice > ---

[jira] [Updated] (SPARK-4006) Spark Driver crashes whenever an Executor is registered twice

2014-12-17 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4006: - Target Version/s: 1.1.1, 1.2.0, 1.0.3 (was: 1.1.1, 1.2.0) > Spark Driver crashes whenever an Executor is

[jira] [Updated] (SPARK-4822) Use sphinx tags for Python doc annotations

2014-12-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4822: - Assignee: Kai Sasaki > Use sphinx tags for Python doc annotations > --

[jira] [Resolved] (SPARK-4822) Use sphinx tags for Python doc annotations

2014-12-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4822. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 3685 [https://githu

[jira] [Created] (SPARK-4879) Missing output partitions after job completes with speculative execution

2014-12-17 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-4879: - Summary: Missing output partitions after job completes with speculative execution Key: SPARK-4879 URL: https://issues.apache.org/jira/browse/SPARK-4879 Project: Spark

[jira] [Commented] (SPARK-4872) Provide sample format of training/test data in MLlib programming guide

2014-12-17 Thread zhang jun wei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14251033#comment-14251033 ] zhang jun wei commented on SPARK-4872: -- Hi Sean, thanks for the response. I changed

[jira] [Created] (SPARK-4880) remove spark.locality.wait setting in examples/graphx/Analytics.scala

2014-12-17 Thread Ernest (JIRA)
Ernest created SPARK-4880: - Summary: remove spark.locality.wait setting in examples/graphx/Analytics.scala Key: SPARK-4880 URL: https://issues.apache.org/jira/browse/SPARK-4880 Project: Spark Issue

[jira] [Commented] (SPARK-4844) SGD should support custom sampling.

2014-12-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14251117#comment-14251117 ] Apache Spark commented on SPARK-4844: - User 'witgo' has created a pull request for thi

[jira] [Commented] (SPARK-4880) remove spark.locality.wait setting in examples/graphx/Analytics.scala

2014-12-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14251118#comment-14251118 ] Apache Spark commented on SPARK-4880: - User 'Earne' has created a pull request for thi

[jira] [Closed] (SPARK-2686) Add Length support to Spark SQL and HQL and Strlen support to SQL

2014-12-17 Thread Stephen Boesch (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stephen Boesch closed SPARK-2686. - Resolution: Later Michael Armbrust requested this be closed while the new UDF structure is being

[jira] [Updated] (SPARK-4874) Report number of records read/written in a task

2014-12-17 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-4874: -- Assignee: Kostas Sakellis > Report number of records read/written in a task > --

[jira] [Commented] (SPARK-4140) Document the dynamic allocation feature

2014-12-17 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14251305#comment-14251305 ] Apache Spark commented on SPARK-4140: - User 'andrewor14' has created a pull request fo

[jira] [Commented] (SPARK-2087) Clean Multi-user semantics for thrift JDBC/ODBC server.

2014-12-17 Thread guowei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14251343#comment-14251343 ] guowei commented on SPARK-2087: --- I'm not sure that a full SQLContext per session is a good i