[jira] [Updated] (SPARK-3218) K-Means clusterer can fail on degenerate data

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3218: - Priority: Minor (was: Major) K-Means clusterer can fail on degenerate data

[jira] [Updated] (SPARK-5337) respect spark.task.cpus when launch executors

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5337: - Component/s: (was: Spark Core) Scheduler respect spark.task.cpus when launch

[jira] [Commented] (SPARK-5412) Cannot bind Master to a specific hostname as per the documentation

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545436#comment-14545436 ] Apache Spark commented on SPARK-5412: - User 'srowen' has created a pull request for

[jira] [Assigned] (SPARK-5412) Cannot bind Master to a specific hostname as per the documentation

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5412: --- Assignee: Apache Spark Cannot bind Master to a specific hostname as per the documentation

[jira] [Assigned] (SPARK-5412) Cannot bind Master to a specific hostname as per the documentation

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5412: --- Assignee: (was: Apache Spark) Cannot bind Master to a specific hostname as per the

[jira] [Resolved] (SPARK-5387) parquet writer runs into OOM during writing when number of rows is large

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5387. -- Resolution: Not A Problem I think this is simply a case of needing to either write fewer parquet files

[jira] [Resolved] (SPARK-3185) SPARK launch on Hadoop 2 in EC2 throws Tachyon exception when Formatting JOURNAL_FOLDER

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-3185. -- Resolution: Not A Problem SPARK launch on Hadoop 2 in EC2 throws Tachyon exception when Formatting

[jira] [Closed] (SPARK-1715) Ensure actor is self-contained in DAGScheduler

2015-05-15 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nan Zhu closed SPARK-1715. -- Resolution: Won't Fix Akka actor has been removed from DAGScheduler Ensure actor is self-contained in

[jira] [Resolved] (SPARK-1702) Mesos executor won't start because of a ClassNotFoundException

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1702. -- Resolution: Cannot Reproduce Mesos executor won't start because of a ClassNotFoundException

[jira] [Resolved] (SPARK-864) DAGScheduler Exception if A Node is Added then Deleted

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-864. - Resolution: Cannot Reproduce Closing as it looks exceptionally stale at this point and haven't seen

[jira] [Commented] (SPARK-7669) Builds against Hadoop 2.6+ get inconsistent curator dependencies

2015-05-15 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545636#comment-14545636 ] Steve Loughran commented on SPARK-7669: --- snippet of the maven dependencies with

[jira] [Reopened] (SPARK-4412) Parquet logger cannot be configured

2015-05-15 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yana Kadiyska reopened SPARK-4412: -- Reopening as the issue reappeared in 1.3.0 Parquet logger cannot be configured

[jira] [Commented] (SPARK-4412) Parquet logger cannot be configured

2015-05-15 Thread Yana Kadiyska (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545677#comment-14545677 ] Yana Kadiyska commented on SPARK-4412: -- I would like to reopen as I believe the issue

[jira] [Commented] (SPARK-7669) Builds against Hadoop 2.6+ get inconsistent curator dependencies

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545681#comment-14545681 ] Apache Spark commented on SPARK-7669: - User 'steveloughran' has created a pull request

[jira] [Assigned] (SPARK-7669) Builds against Hadoop 2.6+ get inconsistent curator dependencies

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7669: --- Assignee: (was: Apache Spark) Builds against Hadoop 2.6+ get inconsistent curator

[jira] [Assigned] (SPARK-7669) Builds against Hadoop 2.6+ get inconsistent curator dependencies

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7669: --- Assignee: Apache Spark Builds against Hadoop 2.6+ get inconsistent curator dependencies

[jira] [Updated] (SPARK-5225) Support coalesed Input Metrics from different sources

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5225: - Issue Type: Improvement (was: Bug) Support coalesed Input Metrics from different sources

[jira] [Assigned] (SPARK-5175) bug in updating counters when starting multiple workers/supervisors in actor-based receiver

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5175: --- Assignee: Apache Spark bug in updating counters when starting multiple workers/supervisors

[jira] [Resolved] (SPARK-5001) BlockRDD removed unreasonablly in streaming

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5001. -- Resolution: Not A Problem BlockRDD removed unreasonablly in streaming

[jira] [Updated] (SPARK-5174) Missing Document for starting multiple workers/supervisors in actor-based receiver

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5174: - Priority: Minor (was: Major) Missing Document for starting multiple workers/supervisors in actor-based

[jira] [Assigned] (SPARK-5174) Missing Document for starting multiple workers/supervisors in actor-based receiver

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-5174: --- Assignee: Apache Spark Missing Document for starting multiple workers/supervisors in

[jira] [Assigned] (SPARK-4556) binary distribution assembly can't run in local mode

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4556: --- Assignee: (was: Apache Spark) binary distribution assembly can't run in local mode

[jira] [Resolved] (SPARK-4539) History Server counts incomplete applications against the retainedApplications total, fails to show eligible completed applications

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4539. -- Resolution: Not A Problem History Server counts incomplete applications against the

[jira] [Resolved] (SPARK-4395) Running a Spark SQL SELECT command from PySpark causes a hang for ~ 1 hour

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-4395. -- Resolution: Cannot Reproduce Running a Spark SQL SELECT command from PySpark causes a hang for ~ 1

[jira] [Resolved] (SPARK-2769) Ganglia Support Broken / Not working

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-2769. -- Resolution: Cannot Reproduce Ganglia Support Broken / Not working

[jira] [Updated] (SPARK-1715) Ensure actor is self-contained in DAGScheduler

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-1715: - Issue Type: Improvement (was: Bug) Ensure actor is self-contained in DAGScheduler

[jira] [Resolved] (SPARK-1848) Executors are mysteriously dying when using Spark on Mesos

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-1848. -- Resolution: Cannot Reproduce I think this is at least stale at this point. Executors are mysteriously

[jira] [Commented] (SPARK-7669) Builds against Hadoop 2.6+ get inconsistent curator dependencies

2015-05-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545643#comment-14545643 ] Sean Owen commented on SPARK-7669: -- Yeah I'm familiar with this flavor of problem, and

[jira] [Assigned] (SPARK-6802) User Defined Aggregate Function Refactoring

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6802: --- Assignee: (was: Apache Spark) User Defined Aggregate Function Refactoring

[jira] [Commented] (SPARK-6802) User Defined Aggregate Function Refactoring

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545654#comment-14545654 ] Apache Spark commented on SPARK-6802: - User 'hqzizania' has created a pull request for

[jira] [Assigned] (SPARK-6802) User Defined Aggregate Function Refactoring

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6802: --- Assignee: Apache Spark User Defined Aggregate Function Refactoring

[jira] [Updated] (SPARK-7677) Enable Kafka In Scala 2.11 Build

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7677: --- Description: Now that we upgraded Kafka in SPARK-2808 we can enable it in the Scala 2.11

[jira] [Updated] (SPARK-7511) PySpark ML seed Param should be varied per class

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7511: - Summary: PySpark ML seed Param should be varied per class (was: PySpark ML seed Param

[jira] [Updated] (SPARK-7651) PySpark GMM predict, predictSoft should fail on bad input

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7651: - Fix Version/s: 1.3.2 PySpark GMM predict, predictSoft should fail on bad input

[jira] [Updated] (SPARK-7672) Number format exception with spark.kryoserializer.buffer.mb

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7672: --- Priority: Critical (was: Major) Number format exception with spark.kryoserializer.buffer.mb

[jira] [Updated] (SPARK-7672) Number format exception with spark.kryoserializer.buffer.mb

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7672: --- Component/s: Spark Core Number format exception with spark.kryoserializer.buffer.mb

[jira] [Updated] (SPARK-7284) Update streaming documentation for Spark 1.4.0 release

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7284: --- Priority: Critical (was: Blocker) Update streaming documentation for Spark 1.4.0 release

[jira] [Commented] (SPARK-6820) Convert NAs to null type in SparkR DataFrames

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546451#comment-14546451 ] Apache Spark commented on SPARK-6820: - User 'hqzizania' has created a pull request for

[jira] [Assigned] (SPARK-6820) Convert NAs to null type in SparkR DataFrames

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6820: --- Assignee: Apache Spark (was: Qian Huang) Convert NAs to null type in SparkR DataFrames

[jira] [Assigned] (SPARK-6820) Convert NAs to null type in SparkR DataFrames

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6820: --- Assignee: Qian Huang (was: Apache Spark) Convert NAs to null type in SparkR DataFrames

[jira] [Assigned] (SPARK-7073) Clean up Python data type hierarchy

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7073: --- Assignee: Apache Spark (was: Davies Liu) Clean up Python data type hierarchy

[jira] [Assigned] (SPARK-7073) Clean up Python data type hierarchy

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7073: --- Assignee: Davies Liu (was: Apache Spark) Clean up Python data type hierarchy

[jira] [Commented] (SPARK-7073) Clean up Python data type hierarchy

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546450#comment-14546450 ] Apache Spark commented on SPARK-7073: - User 'davies' has created a pull request for

[jira] [Assigned] (SPARK-7543) Break dataframe.py into multiple files

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7543: --- Assignee: Davies Liu (was: Apache Spark) Break dataframe.py into multiple files

[jira] [Assigned] (SPARK-7543) Break dataframe.py into multiple files

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7543: --- Assignee: Apache Spark (was: Davies Liu) Break dataframe.py into multiple files

[jira] [Created] (SPARK-7677) Enable Kafka In Scala 2.11 Build

2015-05-15 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-7677: -- Summary: Enable Kafka In Scala 2.11 Build Key: SPARK-7677 URL: https://issues.apache.org/jira/browse/SPARK-7677 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-7543) Break dataframe.py into multiple files

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546228#comment-14546228 ] Apache Spark commented on SPARK-7543: - User 'davies' has created a pull request for

[jira] [Resolved] (SPARK-7556) User guide update for feature transformer: Binarizer

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7556. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6116

[jira] [Commented] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546326#comment-14546326 ] Apache Spark commented on SPARK-7621: - User 'jerluc' has created a pull request for

[jira] [Assigned] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7621: --- Assignee: Apache Spark Report KafkaReceiver MessageHandler errors so StreamingListeners can

[jira] [Assigned] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7621: --- Assignee: (was: Apache Spark) Report KafkaReceiver MessageHandler errors so

[jira] [Created] (SPARK-7679) Update AWS SDK and KCL versions to 1.2.1

2015-05-15 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-7679: Summary: Update AWS SDK and KCL versions to 1.2.1 Key: SPARK-7679 URL: https://issues.apache.org/jira/browse/SPARK-7679 Project: Spark Issue Type:

[jira] [Updated] (SPARK-6811) Building binary R packages for SparkR

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6811: --- Assignee: Shivaram Venkataraman Building binary R packages for SparkR

[jira] [Updated] (SPARK-7563) OutputCommitCoordinator.stop() should only be executed in driver

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7563: --- Fix Version/s: 1.4.0 OutputCommitCoordinator.stop() should only be executed in driver

[jira] [Updated] (SPARK-7563) OutputCommitCoordinator.stop() should only be executed in driver

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7563: --- Target Version/s: 1.3.2, 1.4.0 OutputCommitCoordinator.stop() should only be executed in

[jira] [Commented] (SPARK-7661) Support for dynamic allocation of executors in Kinesis Spark Streaming

2015-05-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546216#comment-14546216 ] Tathagata Das commented on SPARK-7661: -- What do you mean by the currently the logic

[jira] [Updated] (SPARK-7676) Cleanup unnecessary code in the stage timeline view

2015-05-15 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-7676: -- Description: SPARK-7296 added a per-stage visualization to the UI. There's some unneeded code

[jira] [Updated] (SPARK-7676) Cleanup unnecessary code and fix small bug in the stage timeline view

2015-05-15 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-7676: -- Summary: Cleanup unnecessary code and fix small bug in the stage timeline view (was: Cleanup

[jira] [Updated] (SPARK-7511) PySpark ML seed Param should be varied per class

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7511: - Description: Currently, Scala's HasSeed mix-in uses a random Long as the default value

[jira] [Resolved] (SPARK-7677) Enable Kafka In Scala 2.11 Build

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7677. Resolution: Fixed Fix Version/s: 1.4.0 Fixed by pull request:

[jira] [Created] (SPARK-7678) Scala ML seed Param should vary per class

2015-05-15 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7678: Summary: Scala ML seed Param should vary per class Key: SPARK-7678 URL: https://issues.apache.org/jira/browse/SPARK-7678 Project: Spark Issue Type:

[jira] [Updated] (SPARK-7678) Scala ML seed Param should be fixed but vary per class

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7678: - Summary: Scala ML seed Param should be fixed but vary per class (was: Scala ML seed

[jira] [Commented] (SPARK-6902) Row() object can be mutated even though it should be immutable

2015-05-15 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546274#comment-14546274 ] Davies Liu commented on SPARK-6902: --- [~jarfa] Python is a dynamic language, it's not

[jira] [Assigned] (SPARK-7679) Update AWS SDK and KCL versions to 1.2.1

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7679: --- Assignee: Tathagata Das (was: Apache Spark) Update AWS SDK and KCL versions to 1.2.1

[jira] [Commented] (SPARK-7679) Update AWS SDK and KCL versions to 1.2.1

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546341#comment-14546341 ] Apache Spark commented on SPARK-7679: - User 'tdas' has created a pull request for this

[jira] [Assigned] (SPARK-7679) Update AWS SDK and KCL versions to 1.2.1

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7679: --- Assignee: Apache Spark (was: Tathagata Das) Update AWS SDK and KCL versions to 1.2.1

[jira] [Updated] (SPARK-7355) FlakyTest - o.a.s.DriverSuite

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7355: --- Priority: Critical (was: Blocker) FlakyTest - o.a.s.DriverSuite

[jira] [Resolved] (SPARK-7676) Cleanup unnecessary code and fix small bug in the stage timeline view

2015-05-15 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-7676. --- Resolution: Fixed Fix Version/s: 1.4.0 Cleanup unnecessary code and fix small bug in

[jira] [Commented] (SPARK-6411) PySpark DataFrames can't be created if any datetimes have timezones

2015-05-15 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546463#comment-14546463 ] Davies Liu commented on SPARK-6411: --- Since TimestampType in Spark SQL does not support

[jira] [Commented] (SPARK-6216) Check Python version in worker before run PySpark job

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546321#comment-14546321 ] Apache Spark commented on SPARK-6216: - User 'davies' has created a pull request for

[jira] [Updated] (SPARK-7644) Ensure all scoped RDD operations are tested and cleaned

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7644: --- Priority: Critical (was: Blocker) Ensure all scoped RDD operations are tested and cleaned

[jira] [Commented] (SPARK-2883) Spark Support for ORCFile format

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546394#comment-14546394 ] Patrick Wendell commented on SPARK-2883: Since this is a feature I'm going to drop

[jira] [Updated] (SPARK-2883) Spark Support for ORCFile format

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-2883: --- Priority: Critical (was: Blocker) Spark Support for ORCFile format

[jira] [Commented] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546426#comment-14546426 ] Apache Spark commented on SPARK-6980: - User 'BryanCutler' has created a pull request

[jira] [Commented] (SPARK-6289) PySpark doesn't maintain SQL date Types

2015-05-15 Thread Michael Nazario (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546425#comment-14546425 ] Michael Nazario commented on SPARK-6289: This does work for me, but it seems odd

[jira] [Updated] (SPARK-7676) Cleanup unnecessary code and fix small bug in the stage timeline view

2015-05-15 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-7676: -- Component/s: Web UI Cleanup unnecessary code and fix small bug in the stage timeline view

[jira] [Commented] (SPARK-7563) OutputCommitCoordinator.stop() should only be executed in driver

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546468#comment-14546468 ] Patrick Wendell commented on SPARK-7563: I pulled the fix into 1.4.0, but not yet

[jira] [Assigned] (SPARK-7676) Cleanup unnecessary code and fix small bug in the stage timeline view

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7676: --- Assignee: Kay Ousterhout (was: Apache Spark) Cleanup unnecessary code and fix small bug in

[jira] [Commented] (SPARK-7676) Cleanup unnecessary code and fix small bug in the stage timeline view

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546239#comment-14546239 ] Apache Spark commented on SPARK-7676: - User 'kayousterhout' has created a pull request

[jira] [Updated] (SPARK-7511) PySpark ML seed Param should be varied per class

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7511: - Description: Currently, Scala's HasSeed mix-in uses a random Long as the default value

[jira] [Assigned] (SPARK-7676) Cleanup unnecessary code and fix small bug in the stage timeline view

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7676: --- Assignee: Apache Spark (was: Kay Ousterhout) Cleanup unnecessary code and fix small bug in

[jira] [Updated] (SPARK-7511) PySpark ML seed Param should be varied per class

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7511: - Description: Currently, Scala's HasSeed mix-in uses a random Long as the default value

[jira] [Commented] (SPARK-6902) Row() object can be mutated even though it should be immutable

2015-05-15 Thread Jonathan Arfa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546417#comment-14546417 ] Jonathan Arfa commented on SPARK-6902: -- [~davies] it works for me simply because I

[jira] [Comment Edited] (SPARK-6411) PySpark DataFrames can't be created if any datetimes have timezones

2015-05-15 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546463#comment-14546463 ] Davies Liu edited comment on SPARK-6411 at 5/16/15 1:02 AM:

[jira] [Assigned] (SPARK-7652) Performance regression in naive Bayes prediction

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7652: --- Assignee: (was: Apache Spark) Performance regression in naive Bayes prediction

[jira] [Commented] (SPARK-7652) Performance regression in naive Bayes prediction

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545611#comment-14545611 ] Apache Spark commented on SPARK-7652: - User 'viirya' has created a pull request for

[jira] [Commented] (SPARK-5962) [MLLIB] Python support for Power Iteration Clustering

2015-05-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545591#comment-14545591 ] Yanbo Liang commented on SPARK-5962: [~javadba] Are you still work on it? If you are

[jira] [Assigned] (SPARK-7668) Matrix.map should preserve transpose property

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7668: --- Assignee: (was: Apache Spark) Matrix.map should preserve transpose property

[jira] [Commented] (SPARK-7668) Matrix.map should preserve transpose property

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545584#comment-14545584 ] Apache Spark commented on SPARK-7668: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-7668) Matrix.map should preserve transpose property

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7668: --- Assignee: Apache Spark Matrix.map should preserve transpose property

[jira] [Created] (SPARK-7669) Builds against Hadoop 2.6+ get inconsistent curator dependencies

2015-05-15 Thread Steve Loughran (JIRA)
Steve Loughran created SPARK-7669: - Summary: Builds against Hadoop 2.6+ get inconsistent curator dependencies Key: SPARK-7669 URL: https://issues.apache.org/jira/browse/SPARK-7669 Project: Spark

[jira] [Created] (SPARK-7668) Matrix.map should preserve transpose property

2015-05-15 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-7668: -- Summary: Matrix.map should preserve transpose property Key: SPARK-7668 URL: https://issues.apache.org/jira/browse/SPARK-7668 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-7652) Performance regression in naive Bayes prediction

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7652: --- Assignee: Apache Spark Performance regression in naive Bayes prediction

[jira] [Resolved] (SPARK-7543) Break dataframe.py into multiple files

2015-05-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7543. Resolution: Fixed Fix Version/s: 1.4.0 Break dataframe.py into multiple files

[jira] [Commented] (SPARK-7473) Use reservoir sample in RandomForest when choosing features per node

2015-05-15 Thread Ai He (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546546#comment-14546546 ] Ai He commented on SPARK-7473: -- Hi Joseph, it's AiHe. Thank you for reviewing and merging

[jira] [Issue Comment Deleted] (SPARK-7473) Use reservoir sample in RandomForest when choosing features per node

2015-05-15 Thread Ai He (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ai He updated SPARK-7473: - Comment: was deleted (was: Hi Joseph, it's AiHe. Thank you for reviewing and merging this PR. ) Use reservoir

[jira] [Commented] (SPARK-7473) Use reservoir sample in RandomForest when choosing features per node

2015-05-15 Thread Ai He (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546550#comment-14546550 ] Ai He commented on SPARK-7473: -- Hi Joseph, it's AiHe. Thank you for reviewing and merging

[jira] [Assigned] (SPARK-7681) Add SparseVector support for gemv with DenseMatrix

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7681: --- Assignee: Apache Spark Add SparseVector support for gemv with DenseMatrix

[jira] [Commented] (SPARK-7681) Add SparseVector support for gemv with DenseMatrix

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546561#comment-14546561 ] Apache Spark commented on SPARK-7681: - User 'viirya' has created a pull request for

[jira] [Assigned] (SPARK-7681) Add SparseVector support for gemv with DenseMatrix

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7681: --- Assignee: (was: Apache Spark) Add SparseVector support for gemv with DenseMatrix

<    1   2   3   4   >