[jira] [Updated] (SPARK-8552) Using incorrect database in multiple sessions

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8552: Target Version/s: 1.6.0 (was: 1.5.0) Using incorrect database in multiple sessions

[jira] [Updated] (SPARK-3864) Specialize join for tables with unique integer keys

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3864: Target Version/s: 1.6.0 (was: 1.5.0) Specialize join for tables with unique integer keys

[jira] [Updated] (SPARK-7245) Spearman correlation for DataFrames

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7245: Target Version/s: 1.6.0 (was: 1.5.0) Spearman correlation for DataFrames

[jira] [Updated] (SPARK-6189) Pandas to DataFrame conversion should check field names for periods

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6189: Target Version/s: 1.6.0 (was: 1.5.0) Pandas to DataFrame conversion should check field

[jira] [Updated] (SPARK-6377) Set the number of shuffle partitions for Exchange operator automatically based on the size of input tables and the reduce-side operation.

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6377: Target Version/s: 1.6.0 (was: 1.5.0) Set the number of shuffle partitions for Exchange

[jira] [Updated] (SPARK-6489) Optimize lateral view with explode to not read unnecessary columns

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6489: Target Version/s: 1.6.0 (was: 1.5.0) Optimize lateral view with explode to not read

[jira] [Updated] (SPARK-6380) Resolution of equi-join key in post-join projection

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6380: Target Version/s: 1.6.0 (was: 1.5.0) Resolution of equi-join key in post-join projection

[jira] [Updated] (SPARK-9487) Use the same num. worker threads in Scala/Python unit tests

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9487: Target Version/s: 1.6.0 (was: 1.5.0) Use the same num. worker threads in Scala/Python

[jira] [Updated] (SPARK-9139) Add backwards-compatibility tests for DataType.fromJson()

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9139: Target Version/s: 1.6.0 (was: 1.5.0) Add backwards-compatibility tests for

[jira] [Updated] (SPARK-8682) Range Join for Spark SQL

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8682: Target Version/s: 1.6.0 (was: 1.5.0) Range Join for Spark SQL

[jira] [Updated] (SPARK-6740) SQL operator and condition precedence is not honoured

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6740: Target Version/s: 1.6.0 (was: 1.5.0) SQL operator and condition precedence is not

[jira] [Updated] (SPARK-6774) Implement Parquet complex types backwards-compatiblity rules

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6774: Target Version/s: 1.6.0 (was: 1.5.0) Implement Parquet complex types

[jira] [Updated] (SPARK-7903) PythonUDT shouldn't get serialized on the Scala side

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7903: Target Version/s: 1.6.0 (was: 1.5.0) PythonUDT shouldn't get serialized on the Scala side

[jira] [Updated] (SPARK-2973) Use LocalRelation for all ExecutedCommands, avoid job for take/collect()

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2973: Target Version/s: 1.6.0 (was: 1.5.0) Use LocalRelation for all ExecutedCommands, avoid

[jira] [Updated] (SPARK-6467) Override QueryPlan.missingInput when necessary and rely on it CheckAnalysis

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6467: Target Version/s: 1.6.0 (was: 1.5.0) Override QueryPlan.missingInput when necessary and

[jira] [Updated] (SPARK-9357) Remove JoinedRow

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9357: Target Version/s: 1.6.0 (was: 1.5.0) Remove JoinedRow

[jira] [Updated] (SPARK-8848) Write Parquet LISTs and MAPs conforming to Parquet format spec

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8848: Target Version/s: 1.6.0 (was: 1.5.0) Write Parquet LISTs and MAPs conforming to Parquet

[jira] [Updated] (SPARK-9298) corr aggregate functions

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9298: Target Version/s: 1.6.0 (was: 1.5.0) corr aggregate functions

[jira] [Updated] (SPARK-8786) Create a wrapper for BinaryType

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8786: Target Version/s: 1.6.0 (was: 1.5.0) Create a wrapper for BinaryType

[jira] [Updated] (SPARK-8328) Add a CheckAnalysis rule to ensure that Union branches have the same schema

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8328: Target Version/s: 1.6.0 (was: 1.4.2, 1.5.0) Add a CheckAnalysis rule to ensure that Union

[jira] [Updated] (SPARK-6548) stddev_pop and stddev_samp aggregate functions

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6548: Target Version/s: 1.6.0 (was: 1.5.0) stddev_pop and stddev_samp aggregate functions

[jira] [Updated] (SPARK-3860) Improve dimension joins

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3860: Target Version/s: 1.6.0 (was: 1.5.0) Improve dimension joins ---

[jira] [Updated] (SPARK-6819) Support nested types in SparkR DataFrame

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6819: Target Version/s: 1.6.0 (was: 1.5.0) Support nested types in SparkR DataFrame

[jira] [Updated] (SPARK-9296) variance, var_pop, and var_samp aggregate functions

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9296: Target Version/s: 1.6.0 (was: 1.5.0) variance, var_pop, and var_samp aggregate functions

[jira] [Updated] (SPARK-7549) Support aggregating over nested fields

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7549: Target Version/s: 1.6.0 (was: 1.5.0) Support aggregating over nested fields

[jira] [Updated] (SPARK-8448) ORC data source doesn't support column names with comma

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8448: Target Version/s: 1.6.0 (was: 1.5.0) ORC data source doesn't support column names with

[jira] [Updated] (SPARK-9271) Concurrency bug triggered by partition predicate push-down

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9271: Target Version/s: 1.6.0 (was: 1.5.0) Concurrency bug triggered by partition predicate

[jira] [Updated] (SPARK-9456) Remove InternalRow.toSeq

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9456: Target Version/s: 1.6.0 (was: 1.5.0) Remove InternalRow.toSeq

[jira] [Updated] (SPARK-6817) DataFrame UDFs in R

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-6817: Target Version/s: 1.6.0 (was: 1.5.0) DataFrame UDFs in R ---

[jira] [Updated] (SPARK-8745) Remove GenerateProjection

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8745: Target Version/s: 1.6.0 (was: 1.5.0) Remove GenerateProjection -

[jira] [Updated] (SPARK-8144) For PySpark SQL, automatically convert values provided in readwriter options to string

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8144: Target Version/s: 1.6.0 (was: 1.4.2, 1.5.0) For PySpark SQL, automatically convert values

[jira] [Updated] (SPARK-7712) Window Function Improvements

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-7712: Target Version/s: 1.6.0 (was: 1.5.0, 1.6.0) Window Function Improvements

[jira] [Created] (SPARK-9561) Enable BroadcastJoinSuite

2015-08-03 Thread Andrew Or (JIRA)
Andrew Or created SPARK-9561: Summary: Enable BroadcastJoinSuite Key: SPARK-9561 URL: https://issues.apache.org/jira/browse/SPARK-9561 Project: Spark Issue Type: Bug Components: SQL,

[jira] [Updated] (SPARK-8641) Native Spark Window Functions

2015-08-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-8641: - Target Version/s: 1.6.0 (was: 1.5.0) Native Spark Window Functions

[jira] [Created] (SPARK-9562) Move spark-ec2 from mesos to amplab

2015-08-03 Thread Shivaram Venkataraman (JIRA)
Shivaram Venkataraman created SPARK-9562: Summary: Move spark-ec2 from mesos to amplab Key: SPARK-9562 URL: https://issues.apache.org/jira/browse/SPARK-9562 Project: Spark Issue

[jira] [Created] (SPARK-9563) Collapse repartition and exchange

2015-08-03 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-9563: - Summary: Collapse repartition and exchange Key: SPARK-9563 URL: https://issues.apache.org/jira/browse/SPARK-9563 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-9563) Remove repartition operators when they are the child of Exchange and shuffle=True

2015-08-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-9563: -- Summary: Remove repartition operators when they are the child of Exchange and shuffle=True (was:

[jira] [Resolved] (SPARK-1855) Provide memory-and-local-disk RDD checkpointing

2015-08-03 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-1855. -- Resolution: Fixed Fix Version/s: 1.5.0 Provide memory-and-local-disk RDD checkpointing

[jira] [Created] (SPARK-9564) Spark 1.5.0 Testing Plan

2015-08-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-9564: -- Summary: Spark 1.5.0 Testing Plan Key: SPARK-9564 URL: https://issues.apache.org/jira/browse/SPARK-9564 Project: Spark Issue Type: Epic Components:

[jira] [Resolved] (SPARK-9528) RandomForestClassifier should extend ProbabilisticClassifier

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-9528. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7859

[jira] [Commented] (SPARK-3727) Trees and ensembles: More prediction functionality

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652107#comment-14652107 ] Joseph K. Bradley commented on SPARK-3727: -- variance of estimate: With

[jira] [Updated] (SPARK-7712) Window Function Improvements

2015-08-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-7712: - Target Version/s: 1.5.0, 1.6.0 (was: 1.5.0) Window Function Improvements

[jira] [Assigned] (SPARK-9562) Move spark-ec2 from mesos to amplab

2015-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9562: --- Assignee: (was: Apache Spark) Move spark-ec2 from mesos to amplab

[jira] [Commented] (SPARK-9562) Move spark-ec2 from mesos to amplab

2015-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652143#comment-14652143 ] Apache Spark commented on SPARK-9562: - User 'shivaram' has created a pull request for

[jira] [Assigned] (SPARK-9562) Move spark-ec2 from mesos to amplab

2015-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9562: --- Assignee: Apache Spark Move spark-ec2 from mesos to amplab

[jira] [Updated] (SPARK-9511) Table names starting with numbers no longer supported

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9511: Assignee: Joseph Batchik Table names starting with numbers no longer supported

[jira] [Created] (SPARK-9565) Spark SQL 1.5.0 testing umbrella

2015-08-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-9565: -- Summary: Spark SQL 1.5.0 testing umbrella Key: SPARK-9565 URL: https://issues.apache.org/jira/browse/SPARK-9565 Project: Spark Issue Type: Test

[jira] [Resolved] (SPARK-9511) Table names starting with numbers no longer supported

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-9511. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7844

[jira] [Updated] (SPARK-9566) Spark 1.5.0 YARN testing umbrella

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-9566: --- Issue Type: Umbrella (was: Test) Spark 1.5.0 YARN testing umbrella

[jira] [Created] (SPARK-9566) Spark 1.5.0 YARN testing umbrella

2015-08-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-9566: -- Summary: Spark 1.5.0 YARN testing umbrella Key: SPARK-9566 URL: https://issues.apache.org/jira/browse/SPARK-9566 Project: Spark Issue Type: Test

[jira] [Commented] (SPARK-8333) Spark failed to delete temp directory created by HiveContext

2015-08-03 Thread sheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14651767#comment-14651767 ] sheng commented on SPARK-8333: -- Hi Thota, I ran my code in a scala application, not in REPL.

[jira] [Created] (SPARK-9558) Update docs to follow the increase of memory defaults.

2015-08-03 Thread Kousuke Saruta (JIRA)
Kousuke Saruta created SPARK-9558: - Summary: Update docs to follow the increase of memory defaults. Key: SPARK-9558 URL: https://issues.apache.org/jira/browse/SPARK-9558 Project: Spark Issue

[jira] [Resolved] (SPARK-9518) Clean up GenerateUnsafeRowJoiner

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-9518. Resolution: Fixed Fix Version/s: 1.5.0 Clean up GenerateUnsafeRowJoiner

[jira] [Resolved] (SPARK-9551) add copyTo for UnsafeRow to reuse a copy buffer

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-9551. Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 1.5.0 add copyTo for

[jira] [Assigned] (SPARK-9558) Update docs to follow the increase of memory defaults.

2015-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9558: --- Assignee: Apache Spark Update docs to follow the increase of memory defaults.

[jira] [Assigned] (SPARK-9558) Update docs to follow the increase of memory defaults.

2015-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9558: --- Assignee: (was: Apache Spark) Update docs to follow the increase of memory defaults.

[jira] [Commented] (SPARK-9558) Update docs to follow the increase of memory defaults.

2015-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14651778#comment-14651778 ] Apache Spark commented on SPARK-9558: - User 'sarutak' has created a pull request for

[jira] [Commented] (SPARK-9555) Cannot use spark-csv in spark-shell

2015-08-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14651802#comment-14651802 ] Sean Owen commented on SPARK-9555: -- This is the problem, not any particular library:

[jira] [Updated] (SPARK-7563) OutputCommitCoordinator.stop() should only be executed in driver

2015-08-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-7563: - Assignee: Josh Rosen Target Version/s: (was: 1.3.2) Labels: (was:

[jira] [Commented] (SPARK-9559) Worker redundancy/failover in spark stand-alone mode

2015-08-03 Thread partha bishnu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14651882#comment-14651882 ] partha bishnu commented on SPARK-9559: -- Hi I am running some tests on spark in

[jira] [Updated] (SPARK-9499) Possible file handle leak in spilling/sort code

2015-08-03 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-9499: - Attachment: perf_test4.scala [~joshrosen], I have checked out the new master, and it

[jira] [Commented] (SPARK-4156) Add expectation maximization for Gaussian mixture models to MLLib clustering

2015-08-03 Thread Fabian Boehnlein (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14651862#comment-14651862 ] Fabian Boehnlein commented on SPARK-4156: - is there a reason that a

[jira] [Commented] (SPARK-9559) Worker redundancy/failover in spark stand-alone mode

2015-08-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14651891#comment-14651891 ] Sean Owen commented on SPARK-9559: -- You should see 1 executor per worker. You lost an

[jira] [Created] (SPARK-9559) Worker redundancy/failover in spark stand-alone mode

2015-08-03 Thread partha bishnu (JIRA)
partha bishnu created SPARK-9559: Summary: Worker redundancy/failover in spark stand-alone mode Key: SPARK-9559 URL: https://issues.apache.org/jira/browse/SPARK-9559 Project: Spark Issue

[jira] [Updated] (SPARK-9482) flaky test: org.apache.spark.sql.hive.execution.HiveCompatibilitySuite.semijoin

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9482: Priority: Blocker (was: Critical) flaky test:

[jira] [Updated] (SPARK-9141) DataFrame recomputed instead of using cached parent.

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-9141: --- Assignee: Michael Armbrust DataFrame recomputed instead of using cached parent.

[jira] [Commented] (SPARK-9568) Spark MLlib 1.5.0 testing umbrella

2015-08-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652343#comment-14652343 ] Sean Owen commented on SPARK-9568: -- [~rxin] 6 of the subtasks for MLlib 1.4 QA plan

[jira] [Commented] (SPARK-8246) string function: get_json_object

2015-08-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652353#comment-14652353 ] Apache Spark commented on SPARK-8246: - User 'davies' has created a pull request for

[jira] [Updated] (SPARK-9228) Combine unsafe and codegen into a single option

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9228: Summary: Combine unsafe and codegen into a single option (was: Adjust Spark SQL Configs)

[jira] [Updated] (SPARK-9561) Enable BroadcastJoinSuite

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-9561: Shepherd: Michael Armbrust Enable BroadcastJoinSuite -

[jira] [Updated] (SPARK-9140) Replace TimeTracker by Stopwatch

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9140: - Target Version/s: 1.6.0 (was: 1.5.0) Replace TimeTracker by Stopwatch

[jira] [Updated] (SPARK-8246) string function: get_json_object

2015-08-03 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8246: Target Version/s: 1.5.0 (was: ) string function: get_json_object

[jira] [Updated] (SPARK-2429) Hierarchical Implementation of KMeans

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-2429: - Target Version/s: 1.6.0 (was: 1.5.0) Hierarchical Implementation of KMeans

[jira] [Commented] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652402#comment-14652402 ] Joseph K. Bradley commented on SPARK-3181: -- It looks like this needs to slip to

[jira] [Updated] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-3181: - Target Version/s: 1.6.0 (was: 1.5.0) Add Robust Regression Algorithm with Huber

[jira] [Updated] (SPARK-6722) Model import/export for StreamingKMeansModel

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6722: - Target Version/s: 1.6.0 (was: 1.5.0) Model import/export for StreamingKMeansModel

[jira] [Commented] (SPARK-7316) Add step capability to RDD sliding window

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652407#comment-14652407 ] Joseph K. Bradley commented on SPARK-7316: -- Retargeting for 1.6 Add step

[jira] [Assigned] (SPARK-7130) spark.ml RandomForest* should always do bootstrapping

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-7130: Assignee: Joseph K. Bradley spark.ml RandomForest* should always do bootstrapping

[jira] [Assigned] (SPARK-9447) Update python API to include RandomForest as classifier changes.

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-9447: Assignee: Joseph K. Bradley Update python API to include RandomForest as

[jira] [Updated] (SPARK-9447) Update python API to include RandomForest as classifier changes.

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9447: - Shepherd: (was: Joseph K. Bradley) Update python API to include RandomForest as

[jira] [Resolved] (SPARK-8160) Tungsten style external aggregation

2015-08-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-8160. --- Resolution: Fixed Fix Version/s: 1.5.0 Marking this as resolved since all of its subtasks have

[jira] [Updated] (SPARK-9570) Preferred rrecommendation for spark-submit --master

2015-08-03 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neelesh Srinivas Salian updated SPARK-9570: --- Summary: Preferred rrecommendation for spark-submit --master (was:

[jira] [Updated] (SPARK-9570) Preferred recommendation for spark-submit --master

2015-08-03 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neelesh Srinivas Salian updated SPARK-9570: --- Summary: Preferred recommendation for spark-submit --master (was: Preferred

[jira] [Resolved] (SPARK-9311) Enable the ability to view centrally aggregated YARN logs for Spark Executors in the History Server UI

2015-08-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-9311. --- Resolution: Duplicate This is already possible, you just need to configure YARN properly.

[jira] [Resolved] (SPARK-7725) --py-files doesn't seem to work in YARN cluster mode

2015-08-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-7725. --- Resolution: Duplicate This is fixed in 1.5.0. Unfortunately the fix is a little large for

[jira] [Commented] (SPARK-6667) hang while collect in PySpark

2015-08-03 Thread Ari Meyer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652434#comment-14652434 ] Ari Meyer commented on SPARK-6667: -- I just tested with 1.3.1, and it works fine. I then

[jira] [Updated] (SPARK-6682) Deprecate static train and use builder instead for Scala/Java

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6682: - Target Version/s: 1.6.0 (was: 1.5.0) Deprecate static train and use builder instead for

[jira] [Resolved] (SPARK-2506) In yarn-cluster mode, ApplicationMaster does not clean up correctly at the end of the job if users call sc.stop manually

2015-08-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-2506. --- Resolution: Cannot Reproduce I believe this has been fixed in the many releases since;

[jira] [Resolved] (SPARK-3102) Add tests for yarn-client mode

2015-08-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-3102. --- Resolution: Implemented We have yarn-client integration tests now, and have for a while. If

[jira] [Updated] (SPARK-8266) string function: translate

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8266: --- Target Version/s: 1.6.0 string function: translate --

[jira] [Updated] (SPARK-8246) string function: get_json_object

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8246: --- Shepherd: Davies Liu string function: get_json_object

[jira] [Updated] (SPARK-8231) complex function: array_contains

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8231: --- Shepherd: Davies Liu complex function: array_contains

[jira] [Updated] (SPARK-8244) string function: find_in_set

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8244: --- Target Version/s: 1.6.0 (was: ) string function: find_in_set

[jira] [Updated] (SPARK-8159) Improve expression function coverage (Spark 1.5)

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8159: --- Summary: Improve expression function coverage (Spark 1.5) (was: Improve expression function

[jira] [Created] (SPARK-9571) Improve expression function coverage (Spark 1.6)

2015-08-03 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-9571: -- Summary: Improve expression function coverage (Spark 1.6) Key: SPARK-9571 URL: https://issues.apache.org/jira/browse/SPARK-9571 Project: Spark Issue Type:

[jira] [Updated] (SPARK-8266) string function: translate

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8266: --- Parent Issue: SPARK-9571 (was: SPARK-8159) string function: translate --

[jira] [Updated] (SPARK-8244) string function: find_in_set

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8244: --- Parent Issue: SPARK-9571 (was: SPARK-8159) string function: find_in_set

[jira] [Updated] (SPARK-8233) misc function: hash

2015-08-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8233: --- Parent Issue: SPARK-9571 (was: SPARK-8159) misc function: hash ---

[jira] [Commented] (SPARK-9570) Preferred recommendation for spark-submit --master

2015-08-03 Thread Guru Medasani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14652497#comment-14652497 ] Guru Medasani commented on SPARK-9570: -- +1. May be change the name of this JIRA as

[jira] [Updated] (SPARK-9447) Python RandomForestClassifier probabilityCol, rawPredictionCol

2015-08-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9447: - Summary: Python RandomForestClassifier probabilityCol, rawPredictionCol (was: Update

<    1   2   3   4   5   >