[jira] [Updated] (SPARK-7654) DataFrameReader and DataFrameWriter for input/output API

2015-05-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7654: --- Priority: Blocker (was: Major) DataFrameReader and DataFrameWriter for input/output API

[jira] [Assigned] (SPARK-6964) Support Cancellation in the Thrift Server

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6964: --- Assignee: (was: Apache Spark) Support Cancellation in the Thrift Server

[jira] [Assigned] (SPARK-6964) Support Cancellation in the Thrift Server

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6964: --- Assignee: Apache Spark Support Cancellation in the Thrift Server

[jira] [Commented] (SPARK-6964) Support Cancellation in the Thrift Server

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546518#comment-14546518 ] Apache Spark commented on SPARK-6964: - User 'dongwang218' has created a pull request

[jira] [Updated] (SPARK-7673) DataSourceStrategy's buildPartitionedTableScan always list list file status for all data files

2015-05-15 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-7673: -- Summary: DataSourceStrategy's buildPartitionedTableScan always list list file status for all data files

[jira] [Updated] (SPARK-7473) Use reservoir sample in RandomForest when choosing features per node

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7473: - Assignee: Ai He Use reservoir sample in RandomForest when choosing features per node

[jira] [Resolved] (SPARK-7575) Example code for OneVsRest

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7575. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 6115

[jira] [Resolved] (SPARK-7473) Use reservoir sample in RandomForest when choosing features per node

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7473. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5988

[jira] [Commented] (SPARK-6649) DataFrame created through SQLContext.jdbc() failed if columns table must be quoted

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546552#comment-14546552 ] Apache Spark commented on SPARK-6649: - User 'frreiss' has created a pull request for

[jira] [Assigned] (SPARK-6649) DataFrame created through SQLContext.jdbc() failed if columns table must be quoted

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6649: --- Assignee: (was: Apache Spark) DataFrame created through SQLContext.jdbc() failed if

[jira] [Assigned] (SPARK-6649) DataFrame created through SQLContext.jdbc() failed if columns table must be quoted

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6649: --- Assignee: Apache Spark DataFrame created through SQLContext.jdbc() failed if columns table

[jira] [Created] (SPARK-7681) Add SparseVector support for gemv with DenseMatrix

2015-05-15 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-7681: -- Summary: Add SparseVector support for gemv with DenseMatrix Key: SPARK-7681 URL: https://issues.apache.org/jira/browse/SPARK-7681 Project: Spark Issue

[jira] [Updated] (SPARK-7673) DataSourceStrategy''s buildPartitionedTableScan always list list file status for all data files

2015-05-15 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-7673: Fix Version/s: (was: 1.4.0) DataSourceStrategy''s buildPartitionedTableScan always list list file

[jira] [Updated] (SPARK-7673) DataSourceStrategy''s buildPartitionedTableScan always list list file status for all data files

2015-05-15 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-7673: Target Version/s: 1.4.0 DataSourceStrategy''s buildPartitionedTableScan always list list file status for

[jira] [Updated] (SPARK-7673) DataSourceStrategy''s buildPartitionedTableScan always list list file status for all data files

2015-05-15 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-7673: Assignee: Cheng Lian DataSourceStrategy''s buildPartitionedTableScan always list list file status for

[jira] [Created] (SPARK-7680) Add a fake Receiver that generates random strings, useful for prototyping

2015-05-15 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-7680: Summary: Add a fake Receiver that generates random strings, useful for prototyping Key: SPARK-7680 URL: https://issues.apache.org/jira/browse/SPARK-7680 Project:

[jira] [Resolved] (SPARK-7073) Clean up Python data type hierarchy

2015-05-15 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7073. Resolution: Fixed Fix Version/s: 1.4.0 Clean up Python data type hierarchy

[jira] [Updated] (SPARK-7621) Report KafkaReceiver MessageHandler errors so StreamingListeners can take action

2015-05-15 Thread Jeremy A. Lucas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremy A. Lucas updated SPARK-7621: --- Fix Version/s: (was: 1.3.1) Report KafkaReceiver MessageHandler errors so

[jira] [Assigned] (SPARK-7491) Handle drivers for Metastore JDBC

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7491: --- Assignee: Apache Spark Handle drivers for Metastore JDBC -

[jira] [Updated] (SPARK-5947) First class partitioning support in data sources API

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5947: Fix Version/s: 1.4.0 First class partitioning support in data sources API

[jira] [Updated] (SPARK-5180) Data source API improvement

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5180: Fix Version/s: 1.4.0 Data source API improvement ---

[jira] [Created] (SPARK-7673) DataSourceStrategy''s buildPartitionedTableScan always list list file status for all data files

2015-05-15 Thread Yin Huai (JIRA)
Yin Huai created SPARK-7673: --- Summary: DataSourceStrategy''s buildPartitionedTableScan always list list file status for all data files Key: SPARK-7673 URL: https://issues.apache.org/jira/browse/SPARK-7673

[jira] [Updated] (SPARK-5948) Support writing to partitioned table for the Parquet data source

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5948: Fix Version/s: 1.4.0 Support writing to partitioned table for the Parquet data source

[jira] [Comment Edited] (SPARK-7673) DataSourceStrategy''s buildPartitionedTableScan always list list file status for all data files

2015-05-15 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546064#comment-14546064 ] Yin Huai edited comment on SPARK-7673 at 5/15/15 7:39 PM: -- This

[jira] [Commented] (SPARK-7673) DataSourceStrategy''s buildPartitionedTableScan always list list file status for all data files

2015-05-15 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546064#comment-14546064 ] Yin Huai commented on SPARK-7673: - This cause pretty significant performance regression

[jira] [Resolved] (SPARK-5920) Use a BufferedInputStream to read local shuffle data

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5920. Resolution: Won't Fix Per the discussion on this PR I am resolving this as won't fix.

[jira] [Updated] (SPARK-7532) Make StreamingContext.start() idempotent

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7532: --- Fix Version/s: 1.4.0 Make StreamingContext.start() idempotent

[jira] [Updated] (SPARK-7228) SparkR public API for 1.4 release

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-7228: --- Fix Version/s: 1.4.0 SparkR public API for 1.4 release -

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-05-15 Thread Benjamin Herta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546089#comment-14546089 ] Benjamin Herta commented on SPARK-4105: --- I haven't had a chance to test the patch

[jira] [Assigned] (SPARK-7549) Support aggregating over nested fields

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7549: --- Assignee: Apache Spark Support aggregating over nested fields

[jira] [Assigned] (SPARK-6126) Support UDTs in JSON

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6126: --- Assignee: (was: Apache Spark) Support UDTs in JSON

[jira] [Commented] (SPARK-6126) Support UDTs in JSON

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545761#comment-14545761 ] Apache Spark commented on SPARK-6126: - User 'drubbo' has created a pull request for

[jira] [Assigned] (SPARK-6126) Support UDTs in JSON

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-6126: --- Assignee: Apache Spark Support UDTs in JSON Key:

[jira] [Resolved] (SPARK-7668) Matrix.map should preserve transpose property

2015-05-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7668. -- Resolution: Fixed Fix Version/s: 1.4.0 1.3.2 Issue resolved by pull

[jira] [Comment Edited] (SPARK-7670) Failure when building with scala 2.11 (after 1.3.1

2015-05-15 Thread Fernando Ruben Otero (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545904#comment-14545904 ] Fernando Ruben Otero edited comment on SPARK-7670 at 5/15/15 6:09 PM:

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-05-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545907#comment-14545907 ] Josh Rosen commented on SPARK-4105: --- For FAILED_TO_UNCOMPRESS(5), here's a reproduction

[jira] [Updated] (SPARK-7233) ClosureCleaner#clean blocks concurrent job submitter threads

2015-05-15 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7233: - Fix Version/s: 1.4.0 ClosureCleaner#clean blocks concurrent job submitter threads

[jira] [Closed] (SPARK-7233) ClosureCleaner#clean blocks concurrent job submitter threads

2015-05-15 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7233. Resolution: Fixed Assignee: Oleksii Kostyliev ClosureCleaner#clean blocks concurrent job submitter

[jira] [Commented] (SPARK-7655) Akka timeout exception

2015-05-15 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545936#comment-14545936 ] Shixiong Zhu commented on SPARK-7655: - Found the following stack track that a thread

[jira] [Commented] (SPARK-7655) Akka timeout exception

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545939#comment-14545939 ] Apache Spark commented on SPARK-7655: - User 'zsxwing' has created a pull request for

[jira] [Assigned] (SPARK-7655) Akka timeout exception

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7655: --- Assignee: Shixiong Zhu (was: Apache Spark) Akka timeout exception --

[jira] [Assigned] (SPARK-7655) Akka timeout exception

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7655: --- Assignee: Apache Spark (was: Shixiong Zhu) Akka timeout exception --

[jira] [Commented] (SPARK-2883) Spark Support for ORCFile format

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545802#comment-14545802 ] Apache Spark commented on SPARK-2883: - User 'liancheng' has created a pull request for

[jira] [Closed] (SPARK-5412) Cannot bind Master to a specific hostname as per the documentation

2015-05-15 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-5412. Resolution: Fixed Fix Version/s: 1.4.0 1.3.2 1.2.3

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-05-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546091#comment-14546091 ] Josh Rosen commented on SPARK-4105: --- Check out my patch at

[jira] [Commented] (SPARK-5962) [MLLIB] Python support for Power Iteration Clustering

2015-05-15 Thread Stephen Boesch (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546099#comment-14546099 ] Stephen Boesch commented on SPARK-5962: --- Yes I had some other tasks jump ahead but

[jira] [Updated] (SPARK-6806) SparkR examples in programming guide

2015-05-15 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-6806: -- Priority: Critical (was: Blocker) SparkR examples in programming guide

[jira] [Updated] (SPARK-7671) Fix wrong URLs in MLlib Data Types Documentation

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7671: - Component/s: MLlib Fix wrong URLs in MLlib Data Types Documentation

[jira] [Updated] (SPARK-6917) Broken data returned to PySpark dataframe if any large numbers used in Scala land

2015-05-15 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-6917: -- Priority: Critical (was: Major) Broken data returned to PySpark dataframe if any large numbers used

[jira] [Updated] (SPARK-6917) Broken data returned to PySpark dataframe if any large numbers used in Scala land

2015-05-15 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-6917: -- Assignee: Yin Huai (was: Davies Liu) Broken data returned to PySpark dataframe if any large numbers

[jira] [Commented] (SPARK-4105) FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle

2015-05-15 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546090#comment-14546090 ] Josh Rosen commented on SPARK-4105: --- Check out my patch at

[jira] [Commented] (SPARK-7344) Spark hangs reading and writing to the same S3 bucket

2015-05-15 Thread Daniel Mahler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546093#comment-14546093 ] Daniel Mahler commented on SPARK-7344: -- The problem occurs even when I use `spark-ec2

[jira] [Created] (SPARK-7674) R-like stats for ML models

2015-05-15 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7674: Summary: R-like stats for ML models Key: SPARK-7674 URL: https://issues.apache.org/jira/browse/SPARK-7674 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-7675) PySpark spark.ml Params type conversions

2015-05-15 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7675: Summary: PySpark spark.ml Params type conversions Key: SPARK-7675 URL: https://issues.apache.org/jira/browse/SPARK-7675 Project: Spark Issue Type:

[jira] [Commented] (SPARK-6917) Broken data returned to PySpark dataframe if any large numbers used in Scala land

2015-05-15 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546160#comment-14546160 ] Davies Liu commented on SPARK-6917: --- [~yhuai] It's a bug in SQL or Parquet library:

[jira] [Comment Edited] (SPARK-6917) Broken data returned to PySpark dataframe if any large numbers used in Scala land

2015-05-15 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546160#comment-14546160 ] Davies Liu edited comment on SPARK-6917 at 5/15/15 8:58 PM:

[jira] [Commented] (SPARK-7080) Binary processing based aggregate operator

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546163#comment-14546163 ] Michael Armbrust commented on SPARK-7080: - That sounds like a good idea to me.

[jira] [Resolved] (SPARK-7296) Timeline view for Stage page

2015-05-15 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-7296. --- Resolution: Fixed Fix Version/s: 1.4.0 Target Version/s: 1.4.0 (was: 1.4.0,

[jira] [Created] (SPARK-7671) Fix wrong URLs in MLlib Data Types Documentation

2015-05-15 Thread JIRA
Favio Vázquez created SPARK-7671: Summary: Fix wrong URLs in MLlib Data Types Documentation Key: SPARK-7671 URL: https://issues.apache.org/jira/browse/SPARK-7671 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-7671) Fix wrong URLs in MLlib Data Types Documentation

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7671: --- Assignee: (was: Apache Spark) Fix wrong URLs in MLlib Data Types Documentation

[jira] [Commented] (SPARK-7671) Fix wrong URLs in MLlib Data Types Documentation

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7671?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545960#comment-14545960 ] Apache Spark commented on SPARK-7671: - User 'FavioVazquez' has created a pull request

[jira] [Closed] (SPARK-7664) DAG visualization: Fix incorrect link paths of DAG.

2015-05-15 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7664. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Kousuke Saruta DAG visualization: Fix

[jira] [Commented] (SPARK-7563) OutputCommitCoordinator.stop() should only be executed in driver

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546006#comment-14546006 ] Apache Spark commented on SPARK-7563: - User 'JoshRosen' has created a pull request for

[jira] [Assigned] (SPARK-7563) OutputCommitCoordinator.stop() should only be executed in driver

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7563: --- Assignee: Apache Spark OutputCommitCoordinator.stop() should only be executed in driver

[jira] [Commented] (SPARK-7672) Number format exception with spark.kryoserializer.buffer.mb

2015-05-15 Thread Nishkam Ravi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546020#comment-14546020 ] Nishkam Ravi commented on SPARK-7672: - In translating deprecated

[jira] [Updated] (SPARK-7563) OutputCommitCoordinator.stop() should only be executed in driver

2015-05-15 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7563: - Priority: Critical (was: Major) OutputCommitCoordinator.stop() should only be executed in driver

[jira] [Assigned] (SPARK-7672) Number format exception with spark.kryoserializer.buffer.mb

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7672: --- Assignee: Apache Spark Number format exception with spark.kryoserializer.buffer.mb

[jira] [Assigned] (SPARK-7672) Number format exception with spark.kryoserializer.buffer.mb

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7672: --- Assignee: (was: Apache Spark) Number format exception with

[jira] [Commented] (SPARK-7672) Number format exception with spark.kryoserializer.buffer.mb

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546025#comment-14546025 ] Apache Spark commented on SPARK-7672: - User 'nishkamravi2' has created a pull request

[jira] [Updated] (SPARK-5632) not able to resolve dot('.') in field name

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5632: --- Fix Version/s: 1.4.0 not able to resolve dot('.') in field name

[jira] [Commented] (SPARK-7636) Significant performance regression with GradientDescent in 1.4

2015-05-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546034#comment-14546034 ] Xiangrui Meng commented on SPARK-7636: -- False alarm. I was comparing a 16-node

[jira] [Closed] (SPARK-7636) Significant performance regression with GradientDescent in 1.4

2015-05-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-7636. Resolution: Not A Problem Significant performance regression with GradientDescent in 1.4

[jira] [Updated] (SPARK-5517) Add input types for Java UDFs

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5517: Target Version/s: 1.5.0 (was: 1.4.0) Add input types for Java UDFs

[jira] [Resolved] (SPARK-5947) First class partitioning support in data sources API

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5947. - Resolution: Fixed First class partitioning support in data sources API

[jira] [Updated] (SPARK-6831) Document how to use external data sources

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6831: Priority: Critical (was: Blocker) Document how to use external data sources

[jira] [Resolved] (SPARK-5948) Support writing to partitioned table for the Parquet data source

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5948. - Resolution: Fixed Support writing to partitioned table for the Parquet data source

[jira] [Updated] (SPARK-5707) Enabling spark.sql.codegen throws ClassNotFound exception

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5707: Target Version/s: 1.5.0 (was: 1.4.0) Enabling spark.sql.codegen throws ClassNotFound

[jira] [Updated] (SPARK-6784) Clean up all the inbound/outbound conversions for DateType

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6784: Assignee: Adrian Wang (was: Yin Huai) Clean up all the inbound/outbound conversions for

[jira] [Assigned] (SPARK-7671) Fix wrong URLs in MLlib Data Types Documentation

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7671: --- Assignee: Apache Spark Fix wrong URLs in MLlib Data Types Documentation

[jira] [Assigned] (SPARK-7563) OutputCommitCoordinator.stop() should only be executed in driver

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7563: --- Assignee: (was: Apache Spark) OutputCommitCoordinator.stop() should only be executed in

[jira] [Created] (SPARK-7672) Number format exception with spark.kryoserializer.buffer.mb

2015-05-15 Thread Nishkam Ravi (JIRA)
Nishkam Ravi created SPARK-7672: --- Summary: Number format exception with spark.kryoserializer.buffer.mb Key: SPARK-7672 URL: https://issues.apache.org/jira/browse/SPARK-7672 Project: Spark

[jira] [Closed] (SPARK-7504) NullPointerException when initializing SparkContext in YARN-cluster mode

2015-05-15 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-7504. Resolution: Fixed Fix Version/s: 1.4.0 Target Version/s: 1.4.0 NullPointerException when

[jira] [Updated] (SPARK-6595) DataFrame self joins with MetastoreRelations fail

2015-05-15 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-6595: --- Fix Version/s: 1.4.0 1.3.2 DataFrame self joins with MetastoreRelations

[jira] [Updated] (SPARK-5463) Fix Parquet filter push-down

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5463: Priority: Critical (was: Blocker) Fix Parquet filter push-down

[jira] [Updated] (SPARK-5463) Fix Parquet filter push-down

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5463: Target Version/s: 1.5.0 (was: 1.4.0) Fix Parquet filter push-down

[jira] [Updated] (SPARK-4867) UDF clean up

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4867: Target Version/s: 1.5.0 (was: 1.4.0) UDF clean up Key:

[jira] [Resolved] (SPARK-5180) Data source API improvement

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5180. - Resolution: Fixed Assignee: Cheng Lian Data source API improvement

[jira] [Updated] (SPARK-6906) Refactor Connection to Hive Metastore

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6906: Target Version/s: 1.5.0 (was: 1.4.0) Refactor Connection to Hive Metastore

[jira] [Updated] (SPARK-2873) Support disk spilling in Spark SQL aggregation

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2873: Target Version/s: 1.5.0 (was: 1.4.0) Support disk spilling in Spark SQL aggregation

[jira] [Assigned] (SPARK-7491) Handle drivers for Metastore JDBC

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7491: --- Assignee: (was: Apache Spark) Handle drivers for Metastore JDBC

[jira] [Commented] (SPARK-7491) Handle drivers for Metastore JDBC

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14546047#comment-14546047 ] Apache Spark commented on SPARK-7491: - User 'marmbrus' has created a pull request for

[jira] [Updated] (SPARK-2973) Use LocalRelation for all ExecutedCommands, avoid job for take/collect()

2015-05-15 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2973: Priority: Critical (was: Blocker) Use LocalRelation for all ExecutedCommands, avoid job

[jira] [Resolved] (SPARK-7651) PySpark GMM predict, predictSoft should fail on bad input

2015-05-15 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7651. -- Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Meethu Mathew PySpark

[jira] [Commented] (SPARK-6126) Support UDTs in JSON

2015-05-15 Thread Emiliano Leporati (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545752#comment-14545752 ] Emiliano Leporati commented on SPARK-6126: -- With a simple 1-liner path to

[jira] [Assigned] (SPARK-7549) Support aggregating over nested fields

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7549: --- Assignee: (was: Apache Spark) Support aggregating over nested fields

[jira] [Commented] (SPARK-7549) Support aggregating over nested fields

2015-05-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545756#comment-14545756 ] Apache Spark commented on SPARK-7549: - User 'kaka1992' has created a pull request for

[jira] [Updated] (SPARK-7668) Matrix.map should preserve transpose property

2015-05-15 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7668: - Target Version/s: 1.3.2, 1.4.0 Affects Version/s: 1.4.0 1.3.1

[jira] [Created] (SPARK-7670) Failure when building with scala 2.11 (after 1.3.1

2015-05-15 Thread Fernando Ruben Otero (JIRA)
Fernando Ruben Otero created SPARK-7670: --- Summary: Failure when building with scala 2.11 (after 1.3.1 Key: SPARK-7670 URL: https://issues.apache.org/jira/browse/SPARK-7670 Project: Spark

[jira] [Commented] (SPARK-7670) Failure when building with scala 2.11 (after 1.3.1

2015-05-15 Thread Fernando Ruben Otero (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14545904#comment-14545904 ] Fernando Ruben Otero commented on SPARK-7670: - BTW: 1.3.1 build without issues

[jira] [Resolved] (SPARK-6438) Indicate which tasks ran on which executors in per-stage visualization in UI

2015-05-15 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout resolved SPARK-6438. --- Resolution: Fixed Indicate which tasks ran on which executors in per-stage visualization in

<    1   2   3   4   >