[jira] [Updated] (SPARK-8592) CoarseGrainedExecutorBackend: Cannot register with driver => NPE

2015-06-24 Thread Sjoerd Mulder (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sjoerd Mulder updated SPARK-8592: - Component/s: Scheduler > CoarseGrainedExecutorBackend: Cannot register with driver => NPE > --

[jira] [Updated] (SPARK-8622) Spark 1.3.1 and 1.4.0 doesn't put executor working directory on executor classpath

2015-06-24 Thread Baswaraj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Baswaraj updated SPARK-8622: Description: I ran into an issue that executor not able to pickup my configs/ function from my custom jar i

[jira] [Updated] (SPARK-8622) Spark 1.3.1 and 1.4.0 doesn't put executor working directory on executor classpath

2015-06-24 Thread Baswaraj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Baswaraj updated SPARK-8622: Description: I ran into an issue that executor not able to pickup my configs/ function from my custom jar i

[jira] [Updated] (SPARK-8622) Spark 1.3.1 and 1.4.0 doesn't put executor working directory on executor classpath

2015-06-24 Thread Baswaraj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Baswaraj updated SPARK-8622: Description: I ran into an issue that executor not able to pickup my configs/ function from my custom jar i

[jira] [Updated] (SPARK-8622) Spark 1.3.1 and 1.4.0 doesn't put executor working directory on executor classpath

2015-06-24 Thread Baswaraj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Baswaraj updated SPARK-8622: Summary: Spark 1.3.1 and 1.4.0 doesn't put executor working directory on executor classpath (was: Spark 1.3

[jira] [Created] (SPARK-8622) Spark 1.3.1 and 1.4.0 doesn't put executor working directory on execitor classpath

2015-06-24 Thread Baswaraj (JIRA)
Baswaraj created SPARK-8622: --- Summary: Spark 1.3.1 and 1.4.0 doesn't put executor working directory on execitor classpath Key: SPARK-8622 URL: https://issues.apache.org/jira/browse/SPARK-8622 Project: Spark

[jira] [Updated] (SPARK-8590) add code gen for ExtractValue

2015-06-24 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-8590: --- Issue Type: Improvement (was: Bug) > add code gen for ExtractValue > - >

[jira] [Updated] (SPARK-8589) cleanup DateTimeUtils

2015-06-24 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-8589: --- Issue Type: Improvement (was: Bug) > cleanup DateTimeUtils > - > >

[jira] [Updated] (SPARK-8620) cleanup CodeGenContext

2015-06-24 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-8620: --- Issue Type: Improvement (was: Bug) > cleanup CodeGenContext > -- > >

[jira] [Assigned] (SPARK-8620) cleanup CodeGenContext

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8620: --- Assignee: Apache Spark > cleanup CodeGenContext > -- > >

[jira] [Assigned] (SPARK-8620) cleanup CodeGenContext

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8620: --- Assignee: (was: Apache Spark) > cleanup CodeGenContext > -- > >

[jira] [Commented] (SPARK-8620) cleanup CodeGenContext

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600733#comment-14600733 ] Apache Spark commented on SPARK-8620: - User 'cloud-fan' has created a pull request for

[jira] [Created] (SPARK-8621) crosstab exception when one of the value is empty

2015-06-24 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-8621: -- Summary: crosstab exception when one of the value is empty Key: SPARK-8621 URL: https://issues.apache.org/jira/browse/SPARK-8621 Project: Spark Issue Type: Sub-t

[jira] [Commented] (SPARK-8567) Flaky test: o.a.s.sql.hive.HiveSparkSubmitSuite --jars

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600711#comment-14600711 ] Apache Spark commented on SPARK-8567: - User 'yhuai' has created a pull request for thi

[jira] [Created] (SPARK-8620) cleanup CodeGenContext

2015-06-24 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-8620: -- Summary: cleanup CodeGenContext Key: SPARK-8620 URL: https://issues.apache.org/jira/browse/SPARK-8620 Project: Spark Issue Type: Bug Components: SQL

[jira] [Resolved] (SPARK-7884) Move block deserialization from BlockStoreShuffleFetcher to ShuffleReader

2015-06-24 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-7884. --- Resolution: Fixed Fix Version/s: 1.5.0 > Move block deserialization from BlockStoreShuf

[jira] [Commented] (SPARK-6724) Model import/export for FPGrowth

2015-06-24 Thread Hrishikesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600639#comment-14600639 ] Hrishikesh commented on SPARK-6724: --- [~MechCoder].. yes, I'm working on this. Will let y

[jira] [Issue Comment Deleted] (SPARK-6456) Spark Sql throwing exception on large partitioned data

2015-06-24 Thread pankaj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pankaj updated SPARK-6456: -- Comment: was deleted (was: It was the issue of large number of partition. actually the number was too high. i r

[jira] [Assigned] (SPARK-8619) Can't find the keytab file when recovering the streaming application.

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8619: --- Assignee: (was: Apache Spark) > Can't find the keytab file when recovering the streaming

[jira] [Commented] (SPARK-8619) Can't find the keytab file when recovering the streaming application.

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600627#comment-14600627 ] Apache Spark commented on SPARK-8619: - User 'SaintBacchus' has created a pull request

[jira] [Assigned] (SPARK-8619) Can't find the keytab file when recovering the streaming application.

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8619: --- Assignee: Apache Spark > Can't find the keytab file when recovering the streaming application

[jira] [Created] (SPARK-8619) Can't find the keytab file when recovering the streaming application.

2015-06-24 Thread SaintBacchus (JIRA)
SaintBacchus created SPARK-8619: --- Summary: Can't find the keytab file when recovering the streaming application. Key: SPARK-8619 URL: https://issues.apache.org/jira/browse/SPARK-8619 Project: Spark

[jira] [Assigned] (SPARK-8618) Obtain hbase token retries many times when having hbase class but no hbase configuration

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8618: --- Assignee: (was: Apache Spark) > Obtain hbase token retries many times when having hbase c

[jira] [Commented] (SPARK-8618) Obtain hbase token retries many times when having hbase class but no hbase configuration

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600576#comment-14600576 ] Apache Spark commented on SPARK-8618: - User 'XuTingjun' has created a pull request for

[jira] [Assigned] (SPARK-8618) Obtain hbase token retries many times when having hbase class but no hbase configuration

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8618: --- Assignee: Apache Spark > Obtain hbase token retries many times when having hbase class but no

[jira] [Created] (SPARK-8618) Obtain hbase token retries many times when having hbase class but no hbase configuration

2015-06-24 Thread meiyoula (JIRA)
meiyoula created SPARK-8618: --- Summary: Obtain hbase token retries many times when having hbase class but no hbase configuration Key: SPARK-8618 URL: https://issues.apache.org/jira/browse/SPARK-8618 Project:

[jira] [Commented] (SPARK-8587) Return cost and cluster index KMeansModel.predict

2015-06-24 Thread Rakesh Chalasani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600568#comment-14600568 ] Rakesh Chalasani commented on SPARK-8587: - Sure, I can add this on the KMeans pipe

[jira] [Commented] (SPARK-8603) In Windows,Not able to create a Spark context from R studio

2015-06-24 Thread Prakash Ponshankaarchinnusamy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8603?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600565#comment-14600565 ] Prakash Ponshankaarchinnusamy commented on SPARK-8603: -- 1) Downloaded

[jira] [Resolved] (SPARK-8595) CLONE - Spark Sql throwing exception on large partitioned data

2015-06-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-8595. -- Resolution: Invalid > CLONE - Spark Sql throwing exception on large partitioned data > -

[jira] [Created] (SPARK-8617) Handle history files better

2015-06-24 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-8617: Summary: Handle history files better Key: SPARK-8617 URL: https://issues.apache.org/jira/browse/SPARK-8617 Project: Spark Issue Type: Improvement C

[jira] [Created] (SPARK-8616) SQLContext doesn't handle tricky column names when loading from JDBC

2015-06-24 Thread Gergely Svigruha (JIRA)
Gergely Svigruha created SPARK-8616: --- Summary: SQLContext doesn't handle tricky column names when loading from JDBC Key: SPARK-8616 URL: https://issues.apache.org/jira/browse/SPARK-8616 Project: Spa

[jira] [Closed] (SPARK-8594) History Server doesn't show complete application when one attempt inprogress

2015-06-24 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves closed SPARK-8594. Resolution: Duplicate accidentally filed same jira twice > History Server doesn't show complete app

[jira] [Created] (SPARK-8615) sql programming guide recommends deprecated code

2015-06-24 Thread Gergely Svigruha (JIRA)
Gergely Svigruha created SPARK-8615: --- Summary: sql programming guide recommends deprecated code Key: SPARK-8615 URL: https://issues.apache.org/jira/browse/SPARK-8615 Project: Spark Issue Ty

[jira] [Created] (SPARK-8614) Row order preservation for operations on MLlib IndexedRowMatrix

2015-06-24 Thread Jan Luts (JIRA)
Jan Luts created SPARK-8614: --- Summary: Row order preservation for operations on MLlib IndexedRowMatrix Key: SPARK-8614 URL: https://issues.apache.org/jira/browse/SPARK-8614 Project: Spark Issue Ty

[jira] [Created] (SPARK-8612) Yarn application status is misreported for failed PySpark apps.

2015-06-24 Thread Juliet Hougland (JIRA)
Juliet Hougland created SPARK-8612: -- Summary: Yarn application status is misreported for failed PySpark apps. Key: SPARK-8612 URL: https://issues.apache.org/jira/browse/SPARK-8612 Project: Spark

[jira] [Created] (SPARK-8613) Add a param for disabling of feature scaling, default to true

2015-06-24 Thread holdenk (JIRA)
holdenk created SPARK-8613: -- Summary: Add a param for disabling of feature scaling, default to true Key: SPARK-8613 URL: https://issues.apache.org/jira/browse/SPARK-8613 Project: Spark Issue Type:

[jira] [Issue Comment Deleted] (SPARK-8337) KafkaUtils.createDirectStream for python is lacking API/feature parity with the Scala/Java version

2015-06-24 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-8337: --- Comment: was deleted (was: OK, well, I'd like to take a crack at it :).) > KafkaUtils.createDirectStr

[jira] [Commented] (SPARK-8337) KafkaUtils.createDirectStream for python is lacking API/feature parity with the Scala/Java version

2015-06-24 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600468#comment-14600468 ] Saisai Shao commented on SPARK-8337: OK, well, I'd like to take a crack at it :). > K

[jira] [Commented] (SPARK-8337) KafkaUtils.createDirectStream for python is lacking API/feature parity with the Scala/Java version

2015-06-24 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600469#comment-14600469 ] Saisai Shao commented on SPARK-8337: OK, well, I'd like to take a crack at it :). > K

[jira] [Commented] (SPARK-8337) KafkaUtils.createDirectStream for python is lacking API/feature parity with the Scala/Java version

2015-06-24 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600470#comment-14600470 ] Saisai Shao commented on SPARK-8337: OK, well, I'd like to take a crack at it :). > K

[jira] [Issue Comment Deleted] (SPARK-8337) KafkaUtils.createDirectStream for python is lacking API/feature parity with the Scala/Java version

2015-06-24 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-8337: --- Comment: was deleted (was: OK, well, I'd like to take a crack at it :).) > KafkaUtils.createDirectStr

[jira] [Created] (SPARK-8611) Spark Master WebUI's REST URI field is wrong on protocol

2015-06-24 Thread Wisely Chen (JIRA)
Wisely Chen created SPARK-8611: -- Summary: Spark Master WebUI's REST URI field is wrong on protocol Key: SPARK-8611 URL: https://issues.apache.org/jira/browse/SPARK-8611 Project: Spark Issue Typ

[jira] [Updated] (SPARK-8611) Standalone Cluster WebUI's REST URI field is wrong on protocol

2015-06-24 Thread Wisely Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wisely Chen updated SPARK-8611: --- Description: In Standalone Cluster WebUI, there is a field "REST URL". It should be http:// protocol b

[jira] [Updated] (SPARK-8611) Standalone Cluster WebUI's REST URI field is wrong on protocol

2015-06-24 Thread Wisely Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wisely Chen updated SPARK-8611: --- Summary: Standalone Cluster WebUI's REST URI field is wrong on protocol (was: Spark Master WebUI's R

[jira] [Commented] (SPARK-8611) Spark Master WebUI's REST URI field is wrong on protocol

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600442#comment-14600442 ] Apache Spark commented on SPARK-8611: - User 'thegiive' has created a pull request for

[jira] [Assigned] (SPARK-8611) Spark Master WebUI's REST URI field is wrong on protocol

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8611: --- Assignee: (was: Apache Spark) > Spark Master WebUI's REST URI field is wrong on protocol

[jira] [Assigned] (SPARK-8611) Spark Master WebUI's REST URI field is wrong on protocol

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8611: --- Assignee: Apache Spark > Spark Master WebUI's REST URI field is wrong on protocol >

[jira] [Commented] (SPARK-8559) Support association rule generation in FPGrowth

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600413#comment-14600413 ] Apache Spark commented on SPARK-8559: - User 'feynmanliang' has created a pull request

[jira] [Assigned] (SPARK-746) Automatically Use Avro Serialization for Avro Objects

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-746: -- Assignee: (was: Apache Spark) > Automatically Use Avro Serialization for Avro Objects >

[jira] [Assigned] (SPARK-746) Automatically Use Avro Serialization for Avro Objects

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-746: -- Assignee: Apache Spark > Automatically Use Avro Serialization for Avro Objects > ---

[jira] [Commented] (SPARK-746) Automatically Use Avro Serialization for Avro Objects

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600402#comment-14600402 ] Apache Spark commented on SPARK-746: User 'JDrit' has created a pull request for this i

[jira] [Commented] (SPARK-8599) Use a Random operator to handle Random distribution generating expressions

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600400#comment-14600400 ] Michael Armbrust commented on SPARK-8599: - What about this case? Random DF {code}

[jira] [Commented] (SPARK-8610) Separate Row and InternalRow (part 2)

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600383#comment-14600383 ] Apache Spark commented on SPARK-8610: - User 'davies' has created a pull request for th

[jira] [Assigned] (SPARK-8610) Separate Row and InternalRow (part 2)

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8610: --- Assignee: Apache Spark (was: Davies Liu) > Separate Row and InternalRow (part 2) > -

[jira] [Assigned] (SPARK-8610) Separate Row and InternalRow (part 2)

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8610: --- Assignee: Davies Liu (was: Apache Spark) > Separate Row and InternalRow (part 2) > -

[jira] [Commented] (SPARK-8606) Exceptions in RDD.getPreferredLocations() and getPartitions() should not be able to crash DAGScheduler

2015-06-24 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600381#comment-14600381 ] Josh Rosen commented on SPARK-8606: --- An example stacktrace exhibiting this bug: {code}

[jira] [Commented] (SPARK-7739) Improve ChiSqSelector example code in the user guide

2015-06-24 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600377#comment-14600377 ] Seth Hendrickson commented on SPARK-7739: - I will work on implementing this change

[jira] [Created] (SPARK-8610) Separate Row and InternalRow (part 2)

2015-06-24 Thread Davies Liu (JIRA)
Davies Liu created SPARK-8610: - Summary: Separate Row and InternalRow (part 2) Key: SPARK-8610 URL: https://issues.apache.org/jira/browse/SPARK-8610 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5562) LDA should handle empty documents

2015-06-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600373#comment-14600373 ] Joseph K. Bradley commented on SPARK-5562: -- Please go ahead, thanks! > LDA shoul

[jira] [Resolved] (SPARK-8075) apply type checking interface to more expressions

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-8075. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6723 [https:/

[jira] [Commented] (SPARK-5562) LDA should handle empty documents

2015-06-24 Thread Alok Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600357#comment-14600357 ] Alok Singh commented on SPARK-5562: --- I would like to take this. > LDA should handle emp

[jira] [Assigned] (SPARK-8344) Add internal metrics / logging for DAGScheduler to detect long pauses / blocking

2015-06-24 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-8344: - Assignee: Josh Rosen > Add internal metrics / logging for DAGScheduler to detect long pauses / >

[jira] [Assigned] (SPARK-8344) Add internal metrics / logging for DAGScheduler to detect long pauses / blocking

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8344: --- Assignee: Apache Spark > Add internal metrics / logging for DAGScheduler to detect long pause

[jira] [Commented] (SPARK-8344) Add internal metrics / logging for DAGScheduler to detect long pauses / blocking

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600337#comment-14600337 ] Apache Spark commented on SPARK-8344: - User 'JoshRosen' has created a pull request for

[jira] [Assigned] (SPARK-8344) Add internal metrics / logging for DAGScheduler to detect long pauses / blocking

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8344: --- Assignee: (was: Apache Spark) > Add internal metrics / logging for DAGScheduler to detect

[jira] [Commented] (SPARK-8599) Use a Random operator to handle Random distribution generating expressions

2015-06-24 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600315#comment-14600315 ] Burak Yavuz commented on SPARK-8599: cc [~marmbrus] [~rxin] > Use a Random operator t

[jira] [Created] (SPARK-8609) After initializing a DataFrame with random columns and a seed, ordering by that random column should return same sorted order

2015-06-24 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-8609: -- Summary: After initializing a DataFrame with random columns and a seed, ordering by that random column should return same sorted order Key: SPARK-8609 URL: https://issues.apache.org/j

[jira] [Created] (SPARK-8608) After initializing a DataFrame with random columns and a seed, df.show should return same value

2015-06-24 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-8608: -- Summary: After initializing a DataFrame with random columns and a seed, df.show should return same value Key: SPARK-8608 URL: https://issues.apache.org/jira/browse/SPARK-8608

[jira] [Assigned] (SPARK-8607) SparkR - Third party jars are not being added to classpath in SparkRBackend

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8607: --- Assignee: Apache Spark > SparkR - Third party jars are not being added to classpath in SparkR

[jira] [Assigned] (SPARK-8607) SparkR - Third party jars are not being added to classpath in SparkRBackend

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8607: --- Assignee: (was: Apache Spark) > SparkR - Third party jars are not being added to classpat

[jira] [Commented] (SPARK-8607) SparkR - Third party jars are not being added to classpath in SparkRBackend

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600301#comment-14600301 ] Apache Spark commented on SPARK-8607: - User 'cafreeman' has created a pull request for

[jira] [Commented] (SPARK-8607) SparkR - Third party jars are not being added to classpath in SparkRBackend

2015-06-24 Thread Chris Freeman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600299#comment-14600299 ] Chris Freeman commented on SPARK-8607: -- PR open at https://github.com/apache/spark/pu

[jira] [Updated] (SPARK-8607) SparkR - Third party jars are not being added to classpath in SparkRBackend

2015-06-24 Thread Chris Freeman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Freeman updated SPARK-8607: - Shepherd: Chris Freeman > SparkR - Third party jars are not being added to classpath in SparkRBack

[jira] [Created] (SPARK-8607) SparkR - Third party jars are not being added to classpath in SparkRBackend

2015-06-24 Thread Chris Freeman (JIRA)
Chris Freeman created SPARK-8607: Summary: SparkR - Third party jars are not being added to classpath in SparkRBackend Key: SPARK-8607 URL: https://issues.apache.org/jira/browse/SPARK-8607 Project: Sp

[jira] [Assigned] (SPARK-4666) "executor.memoryOverhead" config should take a "memory string"

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4666: --- Assignee: (was: Apache Spark) > "executor.memoryOverhead" config should take a "memory st

[jira] [Assigned] (SPARK-4666) "executor.memoryOverhead" config should take a "memory string"

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4666: --- Assignee: Apache Spark > "executor.memoryOverhead" config should take a "memory string" > ---

[jira] [Created] (SPARK-8606) Exceptions in RDD.getPreferredLocations() and getPartitions() should not be able to crash DAGScheduler

2015-06-24 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-8606: - Summary: Exceptions in RDD.getPreferredLocations() and getPartitions() should not be able to crash DAGScheduler Key: SPARK-8606 URL: https://issues.apache.org/jira/browse/SPARK-8606

[jira] [Updated] (SPARK-8558) Script /dev/run-tests fails when _JAVA_OPTIONS env var set

2015-06-24 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-8558: -- Affects Version/s: (was: 1.4.0) 1.5.0 > Script /dev/run-tests fails when _JAV

[jira] [Updated] (SPARK-8558) Script /dev/run-tests fails when _JAVA_OPTIONS env var set

2015-06-24 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-8558: -- Assignee: Oleksiy Dyagilev > Script /dev/run-tests fails when _JAVA_OPTIONS env var set > --

[jira] [Resolved] (SPARK-8558) Script /dev/run-tests fails when _JAVA_OPTIONS env var set

2015-06-24 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-8558. --- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6956 [https://github.com/

[jira] [Resolved] (SPARK-6777) Implement backwards-compatibility rules in Parquet schema converters

2015-06-24 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-6777. --- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6617 [https://github.com/

[jira] [Updated] (SPARK-6192) Enhance MLlib's Python API (GSoC 2015)

2015-06-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6192: - Target Version/s: 1.5.0 > Enhance MLlib's Python API (GSoC 2015) > ---

[jira] [Updated] (SPARK-7633) Streaming Logistic Regression- Python bindings

2015-06-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7633: - Assignee: Manoj Kumar > Streaming Logistic Regression- Python bindings > -

[jira] [Resolved] (SPARK-7633) Streaming Logistic Regression- Python bindings

2015-06-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7633. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6849 [https://githu

[jira] [Commented] (SPARK-8587) Return cost and cluster index KMeansModel.predict

2015-06-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600237#comment-14600237 ] Joseph K. Bradley commented on SPARK-8587: -- I agree; we should not change the beh

[jira] [Updated] (SPARK-8328) Add a CheckAnalysis rule to ensure that Union branches have the same schema

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8328: Target Version/s: 1.5.0, 1.4.2 (was: 1.4.1, 1.5.0) > Add a CheckAnalysis rule to ensure tha

[jira] [Updated] (SPARK-7821) Hide private SQL JDBC classes from Javadoc

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7821: Target Version/s: 1.5.0, 1.4.2 (was: 1.4.1) > Hide private SQL JDBC classes from Javadoc >

[jira] [Updated] (SPARK-8036) Ignores files whose name starts with "." while enumerating files in HadoopFsRelation

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8036: Target Version/s: 1.5.0, 1.4.2 (was: 1.4.1, 1.5.0) > Ignores files whose name starts with "

[jira] [Updated] (SPARK-8144) For PySpark SQL, automatically convert values provided in readwriter options to string

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8144: Target Version/s: 1.5.0, 1.4.2 (was: 1.4.1, 1.5.0) > For PySpark SQL, automatically convert

[jira] [Updated] (SPARK-8501) ORC data source may give empty schema if an ORC file containing zero rows is picked for schema discovery

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8501: Target Version/s: 1.5.0, 1.4.2 (was: 1.4.1, 1.5.0) > ORC data source may give empty schema

[jira] [Updated] (SPARK-8572) Type coercion for ScalaUDFs

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8572: Target Version/s: 1.5.0, 1.4.2 (was: 1.4.1, 1.5.0, 1.4.2) > Type coercion for ScalaUDFs > -

[jira] [Updated] (SPARK-8315) Better error when saving to parquet with duplicate columns

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8315: Target Version/s: 1.5.0, 1.4.2 (was: 1.4.1, 1.5.0) > Better error when saving to parquet wi

[jira] [Updated] (SPARK-8588) Could not use concat with UDF in where clause

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8588: Target Version/s: 1.5.0, 1.4.2 (was: 1.4.1, 1.5.0) > Could not use concat with UDF in where

[jira] [Updated] (SPARK-8445) MLlib 1.5 Roadmap

2015-06-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8445: - Description: We expect to see many MLlib contributors for the 1.5 release. To scale out the devel

[jira] [Updated] (SPARK-7710) User guide and example code for math/stat functions in DataFrames

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7710: Target Version/s: 1.4.2 (was: 1.4.1) > User guide and example code for math/stat functions

[jira] [Updated] (SPARK-8484) Add TrainValidationSplit to ml.tuning

2015-06-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8484: - Shepherd: Xiangrui Meng > Add TrainValidationSplit to ml.tuning >

[jira] [Commented] (SPARK-7292) Provide operator to truncate lineage without persisting RDD's

2015-06-24 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600229#comment-14600229 ] Andrew Or commented on SPARK-7292: -- Posted a short design doc. I'm going to go ahead and

[jira] [Updated] (SPARK-8357) Memory leakage on unsafe aggregation path with empty input

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8357: Target Version/s: 1.5.0, 1.4.2 (was: 1.4.1, 1.5.0) > Memory leakage on unsafe aggregation p

[jira] [Updated] (SPARK-7212) Frequent pattern mining for sequential item sets

2015-06-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7212: - Shepherd: Xiangrui Meng > Frequent pattern mining for sequential item sets > -

[jira] [Updated] (SPARK-7212) Frequent pattern mining for sequential item sets

2015-06-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7212: - Assignee: Feynman Liang Target Version/s: 1.5.0 > Frequent pattern mining for sequenti

  1   2   3   4   >