[jira] [Created] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-02-16 Thread Theodore Vasiloudis (JIRA)
Theodore Vasiloudis created SPARK-5838: -- Summary: Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart Key: SPARK-5838 URL:

[jira] [Commented] (SPARK-5837) HTTP 500 if try to access Spark UI in yarn-cluster or yarn-client mode

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322957#comment-14322957 ] Sean Owen commented on SPARK-5837: -- The question is what is trying to connect to what

[jira] [Updated] (SPARK-5812) Potential flaky test JavaAPISuite.glom

2015-02-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5812: -- Description: https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/27455/ {code} [error]

[jira] [Commented] (SPARK-5837) HTTP 500 if try to access Spark UI in yarn-cluster or yarn-client mode

2015-02-16 Thread Marco Capuccini (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322936#comment-14322936 ] Marco Capuccini commented on SPARK-5837: More about my setup: I'm running over

[jira] [Updated] (SPARK-5815) Deprecate SVDPlusPlus APIs that expose DoubleMatrix from JBLAS

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5815: - Fix Version/s: 1.4.0 Also pushed a change to update run() and deprecated runSVDPlusPlus() for 1.4.0

[jira] [Commented] (SPARK-5837) HTTP 500 if try to access Spark UI in yarn-cluster or yarn-client mode

2015-02-16 Thread Marco Capuccini (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322919#comment-14322919 ] Marco Capuccini commented on SPARK-5837: Right now I'm running Spark 1.2.1 over

[jira] [Commented] (SPARK-5838) Changing SPARK_LOCAL_DIRS option in spark-env.sh does not take effect without daemon restart

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322947#comment-14322947 ] Sean Owen commented on SPARK-5838: -- I think it's to be expected that you can't change env

[jira] [Commented] (SPARK-5837) HTTP 500 if try to access Spark UI in yarn-cluster or yarn-client mode

2015-02-16 Thread Marco Capuccini (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14322982#comment-14322982 ] Marco Capuccini commented on SPARK-5837: The problem occurs in both yarn-client

[jira] [Created] (SPARK-5839) HiveMetastoreCatalog does not recognize table aliases of data source tables.

2015-02-16 Thread Yin Huai (JIRA)
Yin Huai created SPARK-5839: --- Summary: HiveMetastoreCatalog does not recognize table aliases of data source tables. Key: SPARK-5839 URL: https://issues.apache.org/jira/browse/SPARK-5839 Project: Spark

[jira] [Reopened] (SPARK-5548) Flaky test: org.apache.spark.util.AkkaUtilsSuite.remote fetch ssl on - untrusted server

2015-02-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reopened SPARK-5548: --- I'm re-opening this issue since we've continued to see this flakiness on AMPLab Jenkins (even after the

[jira] [Updated] (SPARK-5832) Add Affinity Propagation clustering algorithm

2015-02-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5832: - Assignee: Liang-Chi Hsieh Add Affinity Propagation clustering algorithm

[jira] [Commented] (SPARK-5841) Memory leak in DiskBlockManager

2015-02-16 Thread Matt Whelan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323169#comment-14323169 ] Matt Whelan commented on SPARK-5841: PR: https://github.com/apache/spark/pull/4627

[jira] [Commented] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323217#comment-14323217 ] Joseph K. Bradley commented on SPARK-5016: -- Hi all, (back with Internet now)

[jira] [Resolved] (SPARK-5795) api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5795. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4608

[jira] [Assigned] (SPARK-5795) api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-5795: Assignee: Sean Owen api.java.JavaPairDStream.saveAsNewAPIHadoopFiles may not friendly to java

[jira] [Created] (SPARK-5843) Expose Map-Side-Combine Setting in JavaPairRDD.combineByKey()

2015-02-16 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-5843: - Summary: Expose Map-Side-Combine Setting in JavaPairRDD.combineByKey() Key: SPARK-5843 URL: https://issues.apache.org/jira/browse/SPARK-5843 Project: Spark Issue

[jira] [Commented] (SPARK-5839) HiveMetastoreCatalog does not recognize table names and aliases of data source tables.

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323086#comment-14323086 ] Apache Spark commented on SPARK-5839: - User 'yhuai' has created a pull request for

[jira] [Created] (SPARK-5841) Memory leak in DiskBlockManager

2015-02-16 Thread Matt Whelan (JIRA)
Matt Whelan created SPARK-5841: -- Summary: Memory leak in DiskBlockManager Key: SPARK-5841 URL: https://issues.apache.org/jira/browse/SPARK-5841 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-4949) shutdownCallback in SparkDeploySchedulerBackend should be enclosed by synchronized block.

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-4949: - Priority: Minor (was: Major) Target Version/s: (was: 1.3.0) Affects Version/s:

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-16 Thread Chris T (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323254#comment-14323254 ] Chris T commented on SPARK-5436: That sounds like a good idea to me, with the caveat that

[jira] [Commented] (SPARK-5844) Optimize Pipeline.fit for ParamGrid

2015-02-16 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323253#comment-14323253 ] Peter Rudenko commented on SPARK-5844: -- Here's a solution i came up with. Maybe would

[jira] [Updated] (SPARK-5843) Expose Map-Side-Combine Setting in JavaPairRDD.combineByKey()

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5843: - Component/s: Spark Core Priority: Minor (was: Major) Affects Version/s: 1.2.1

[jira] [Created] (SPARK-5840) HiveContext cannot be serialized due to tuple extraction

2015-02-16 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-5840: -- Summary: HiveContext cannot be serialized due to tuple extraction Key: SPARK-5840 URL: https://issues.apache.org/jira/browse/SPARK-5840 Project: Spark Issue

[jira] [Commented] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2015-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323242#comment-14323242 ] Joseph K. Bradley commented on SPARK-4766: -- [~prudenko] That's a great point

[jira] [Commented] (SPARK-5843) Expose Map-Side-Combine Setting in JavaPairRDD.combineByKey()

2015-02-16 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323259#comment-14323259 ] Matt Cheah commented on SPARK-5843: --- Code's on my screen right now and will ship

[jira] [Commented] (SPARK-5841) Memory leak in DiskBlockManager

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323170#comment-14323170 ] Apache Spark commented on SPARK-5841: - User 'MattWhelan' has created a pull request

[jira] [Commented] (SPARK-5840) HiveContext cannot be serialized due to tuple extraction

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323177#comment-14323177 ] Apache Spark commented on SPARK-5840: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323248#comment-14323248 ] Joseph K. Bradley commented on SPARK-5436: -- I think it would be nice to have a

[jira] [Resolved] (SPARK-5824) CTAS should set null format in hive-0.13.1

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5824. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4609

[jira] [Created] (SPARK-5842) Allow creating broadcast variables on workers

2015-02-16 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5842: Summary: Allow creating broadcast variables on workers Key: SPARK-5842 URL: https://issues.apache.org/jira/browse/SPARK-5842 Project: Spark Issue Type: New

[jira] [Comment Edited] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2015-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323242#comment-14323242 ] Joseph K. Bradley edited comment on SPARK-4766 at 2/16/15 8:27 PM:

[jira] [Resolved] (SPARK-5799) Compute aggregation function on specified numeric columns

2015-02-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5799. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Liang-Chi Hsieh Compute

[jira] [Commented] (SPARK-3203) ClassNotFoundException in spark-shell with Cassandra

2015-02-16 Thread Lishu Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323093#comment-14323093 ] Lishu Liu commented on SPARK-3203: -- [~helena_e] I have similar issue as well when I'm in

[jira] [Updated] (SPARK-5839) HiveMetastoreCatalog does not recognize table aliases of data source tables.

2015-02-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5839: Description: For example, when we run {code} val originalDefaultSource = conf.defaultDataSourceName val

[jira] [Updated] (SPARK-5839) HiveMetastoreCatalog does not recognize table names and aliases of data source tables.

2015-02-16 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-5839: Summary: HiveMetastoreCatalog does not recognize table names and aliases of data source tables. (was:

[jira] [Created] (SPARK-5844) Optimize Pipeline.fit for ParamGrid

2015-02-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-5844: Summary: Optimize Pipeline.fit for ParamGrid Key: SPARK-5844 URL: https://issues.apache.org/jira/browse/SPARK-5844 Project: Spark Issue Type:

[jira] [Updated] (SPARK-4865) Include temporary tables in SHOW TABLES

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4865: Assignee: Yin Huai Include temporary tables in SHOW TABLES

[jira] [Updated] (SPARK-5740) Change comment default value from empty string to null in DescribeCommand

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5740: Target Version/s: 1.4.0 (was: 1.3.0) Change comment default value from empty string to

[jira] [Updated] (SPARK-5327) HiveCompatibilitySuite fails when executed against Hive 0.12.0

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5327: Target Version/s: 1.4.0 (was: 1.3.0, 1.2.1) HiveCompatibilitySuite fails when executed

[jira] [Updated] (SPARK-4706) Remove FakeParquetSerDe

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4706: Target Version/s: 1.4.0 (was: 1.3.0) Remove FakeParquetSerDe ---

[jira] [Updated] (SPARK-4302) Make jsonRDD/jsonFile support more field data types

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4302: Target Version/s: 1.4.0 (was: 1.3.0) Make jsonRDD/jsonFile support more field data types

[jira] [Resolved] (SPARK-3851) Support for reading parquet files with different but compatible schema

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-3851. - Resolution: Fixed This is now handled for parquet tables created through the data sources

[jira] [Created] (SPARK-5845) Time to cleanup intermediate shuffle files not included in shuffle write time

2015-02-16 Thread Kay Ousterhout (JIRA)
Kay Ousterhout created SPARK-5845: - Summary: Time to cleanup intermediate shuffle files not included in shuffle write time Key: SPARK-5845 URL: https://issues.apache.org/jira/browse/SPARK-5845

[jira] [Updated] (SPARK-3702) Standardize MLlib classes for learners, models

2015-02-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3702: - Target Version/s: (was: 1.3.0) Standardize MLlib classes for learners, models

[jira] [Commented] (SPARK-5688) Splits for Categorical Variables in DecisionTrees

2015-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323311#comment-14323311 ] Joseph K. Bradley commented on SPARK-5688: -- This actually does not happen.

[jira] [Resolved] (SPARK-5255) Use python doc note for experimental tags in tree.py

2015-02-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5255. -- Resolution: Fixed Use python doc note for experimental tags in tree.py

[jira] [Commented] (SPARK-5785) Pyspark does not support narrow dependencies

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323327#comment-14323327 ] Apache Spark commented on SPARK-5785: - User 'davies' has created a pull request for

[jira] [Commented] (SPARK-1600) flaky recovery with file input stream test in streaming.CheckpointSuite

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323346#comment-14323346 ] Apache Spark commented on SPARK-1600: - User 'JoshRosen' has created a pull request for

[jira] [Updated] (SPARK-5848) ConsoleProgressBar timer thread leaks SparkContext

2015-02-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5848: --- Component/s: (was: Web UI) Spark Shell ConsoleProgressBar timer thread

[jira] [Commented] (SPARK-5016) GaussianMixtureEM should distribute matrix inverse for large numFeatures, k

2015-02-16 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323412#comment-14323412 ] Travis Galoppo commented on SPARK-5016: --- Realistically, I think it will be very

[jira] [Updated] (SPARK-5849) Handle more types of invalid JSON in SubmitRestProtocolMessage.parseAction

2015-02-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5849: -- Summary: Handle more types of invalid JSON in SubmitRestProtocolMessage.parseAction (was:

[jira] [Updated] (SPARK-5535) Add parameter for storage levels.

2015-02-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5535: - Target Version/s: 1.4.0 (was: 1.3.0) Add parameter for storage levels.

[jira] [Updated] (SPARK-5846) Spark SQL should set job description and pool *before* running jobs

2015-02-16 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-5846: -- Description: Spark SQL current sets the scheduler pool and job description AFTER jobs run (see

[jira] [Commented] (SPARK-5846) Spark SQL should set job description and pool *before* running jobs

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323334#comment-14323334 ] Apache Spark commented on SPARK-5846: - User 'kayousterhout' has created a pull request

[jira] [Updated] (SPARK-5846) Spark SQL should set job description and pool *before* running jobs

2015-02-16 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-5846: -- Target Version/s: 1.3.0, 1.2.2 Spark SQL should set job description and pool *before* running

[jira] [Created] (SPARK-5847) Allow for configuring MetricsSystem's use of app ID to namespace all metrics

2015-02-16 Thread Ryan Williams (JIRA)
Ryan Williams created SPARK-5847: Summary: Allow for configuring MetricsSystem's use of app ID to namespace all metrics Key: SPARK-5847 URL: https://issues.apache.org/jira/browse/SPARK-5847 Project:

[jira] [Updated] (SPARK-5846) Spark SQL does not correctly set job description and scheduler pool

2015-02-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5846: --- Priority: Critical (was: Major) Spark SQL does not correctly set job description and

[jira] [Commented] (SPARK-5849) Handle more types of invalid JSON in SubmitRestProtocolMessage.parseAction

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323421#comment-14323421 ] Apache Spark commented on SPARK-5849: - User 'JoshRosen' has created a pull request for

[jira] [Resolved] (SPARK-5357) Upgrade from commons-codec 1.5

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-5357. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4153

[jira] [Commented] (SPARK-5850) Remove experimental label for Scala 2.11 and FlumePollingStream

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323430#comment-14323430 ] Apache Spark commented on SPARK-5850: - User 'pwendell' has created a pull request for

[jira] [Updated] (SPARK-3839) Reimplement HashOuterJoin to construct hash table of only one relation

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3839: Target Version/s: 1.4.0 (was: 1.3.0) Reimplement HashOuterJoin to construct hash table of

[jira] [Updated] (SPARK-5814) Remove JBLAS from runtime dependencies

2015-02-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5814: - Priority: Critical (was: Major) Remove JBLAS from runtime dependencies

[jira] [Commented] (SPARK-5255) Use python doc note for experimental tags in tree.py

2015-02-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323313#comment-14323313 ] Xiangrui Meng commented on SPARK-5255: -- This was resolved as part of

[jira] [Updated] (SPARK-5846) Spark SQL does not correctly set job description and scheduler pool

2015-02-16 Thread Kay Ousterhout (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kay Ousterhout updated SPARK-5846: -- Summary: Spark SQL does not correctly set job description and scheduler pool (was: Spark SQL

[jira] [Updated] (SPARK-5843) Expose all parameters in JavaPairRDD.combineByKey()

2015-02-16 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-5843: -- Description: It would be nice if users of the Java API could specify the map-side-combine and

[jira] [Updated] (SPARK-5849) Handle more types of invalid JSON requests in SubmitRestProtocolMessage.parseAction

2015-02-16 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5849: -- Summary: Handle more types of invalid JSON requests in SubmitRestProtocolMessage.parseAction (was:

[jira] [Updated] (SPARK-5850) Remove experimental label for Scala 2.11 and FlumePollingStream

2015-02-16 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5850: --- Summary: Remove experimental label for Scala 2.11 and FlumePollingStream (was: Clean up

[jira] [Created] (SPARK-5850) Clean up experimental label for Scala 2.11 and FlumePollingStream

2015-02-16 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-5850: -- Summary: Clean up experimental label for Scala 2.11 and FlumePollingStream Key: SPARK-5850 URL: https://issues.apache.org/jira/browse/SPARK-5850 Project: Spark

[jira] [Updated] (SPARK-3184) Allow user to specify num tasks to use for a table

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3184: Target Version/s: 1.4.0 (was: 1.3.0) Allow user to specify num tasks to use for a table

[jira] [Updated] (SPARK-3862) MultiWayBroadcastInnerHashJoin

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3862: Target Version/s: 1.4.0 (was: 1.3.0) MultiWayBroadcastInnerHashJoin

[jira] [Updated] (SPARK-3298) [SQL] registerAsTable / registerTempTable overwrites old tables

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3298: Target Version/s: 1.4.0 (was: 1.3.0) [SQL] registerAsTable / registerTempTable overwrites

[jira] [Updated] (SPARK-4521) Parquet fails to read columns with spaces in the name

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4521: Target Version/s: 1.4.0 (was: 1.3.0) Parquet fails to read columns with spaces in the

[jira] [Updated] (SPARK-5264) Support `drop temporary table [if exists]` DDL command

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5264: Target Version/s: 1.4.0 (was: 1.3.0) Support `drop temporary table [if exists]` DDL

[jira] [Updated] (SPARK-4561) PySparkSQL's Row.asDict() should convert nested rows to dictionaries

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4561: Target Version/s: 1.4.0 (was: 1.3.0) PySparkSQL's Row.asDict() should convert nested rows

[jira] [Updated] (SPARK-5251) Using `tableIdentifier` in hive metastore

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5251: Target Version/s: 1.4.0 (was: 1.3.0) Using `tableIdentifier` in hive metastore

[jira] [Updated] (SPARK-4689) Unioning 2 SchemaRDDs should return a SchemaRDD in Python, Scala, and Java

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4689: Target Version/s: 1.4.0 (was: 1.3.0) Unioning 2 SchemaRDDs should return a SchemaRDD in

[jira] [Updated] (SPARK-2205) Unnecessary exchange operators in a join on multiple tables with the same join key.

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2205: Target Version/s: 1.4.0 (was: 1.3.0) Unnecessary exchange operators in a join on multiple

[jira] [Updated] (SPARK-3880) HBase as data source to SparkSQL

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3880: Target Version/s: 1.4.0 (was: 1.3.0) HBase as data source to SparkSQL

[jira] [Updated] (SPARK-4782) Add inferSchema support for RDD[Map[String, Any]]

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4782: Target Version/s: 1.4.0 (was: 1.3.0) Add inferSchema support for RDD[Map[String, Any]]

[jira] [Updated] (SPARK-4684) Add a script to run JDBC server on Windows

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4684: Target Version/s: 1.4.0 (was: 1.3.0) Add a script to run JDBC server on Windows

[jira] [Updated] (SPARK-5833) Adds REFRESH TABLE command to refresh external data sources tables

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5833: Target Version/s: 1.4.0 (was: 1.3.0) Adds REFRESH TABLE command to refresh external data

[jira] [Updated] (SPARK-3864) Specialize join for tables with unique integer keys

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3864: Target Version/s: 1.4.0 (was: 1.3.0) Specialize join for tables with unique integer keys

[jira] [Updated] (SPARK-4944) Table Not Found exception in Create Table Like registered RDD table

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-4944: Target Version/s: 1.4.0 (was: 1.3.0) Table Not Found exception in Create Table Like

[jira] [Updated] (SPARK-2863) Emulate Hive type coercion in native reimplementations of Hive functions

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-2863: Target Version/s: 1.4.0 (was: 1.3.0) Emulate Hive type coercion in native

[jira] [Updated] (SPARK-5741) Support the path contains comma in HiveContext

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5741: Target Version/s: 1.4.0 (was: 1.3.0) Support the path contains comma in HiveContext

[jira] [Updated] (SPARK-5720) `Create Table Like` in HiveContext need support `like registered temporary table`

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5720: Target Version/s: 1.4.0 (was: 1.3.0) `Create Table Like` in HiveContext need support

[jira] [Updated] (SPARK-3863) Cache broadcasted tables and reuse them across queries

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3863: Target Version/s: 1.4.0 (was: 1.3.0) Cache broadcasted tables and reuse them across

[jira] [Commented] (SPARK-5809) OutOfMemoryError in logDebug in RandomForest.scala

2015-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323280#comment-14323280 ] Joseph K. Bradley commented on SPARK-5809: -- I agree with [~srowen]'s assessment.

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-16 Thread Chris T (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323335#comment-14323335 ] Chris T commented on SPARK-5436: Aha, that's a neat solution. I like it! Validate

[jira] [Commented] (SPARK-5847) Allow for configuring MetricsSystem's use of app ID to namespace all metrics

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323340#comment-14323340 ] Apache Spark commented on SPARK-5847: - User 'ryan-williams' has created a pull request

[jira] [Commented] (SPARK-5843) Expose Map-Side-Combine Setting in JavaPairRDD.combineByKey()

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323357#comment-14323357 ] Apache Spark commented on SPARK-5843: - User 'mccheah' has created a pull request for

[jira] [Updated] (SPARK-5357) Upgrade from commons-codec 1.5

2015-02-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-5357: - Priority: Minor (was: Major) Assignee: Matt Whelan Upgrade from commons-codec 1.5

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-16 Thread Chris T (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323269#comment-14323269 ] Chris T commented on SPARK-5436: I think we need to allow the use-case where the user

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323282#comment-14323282 ] Joseph K. Bradley commented on SPARK-5436: -- If they call train/fit with only a

[jira] [Updated] (SPARK-4588) Add API for feature attributes

2015-02-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4588: - Priority: Critical (was: Major) Add API for feature attributes --

[jira] [Updated] (SPARK-5723) Change the default file format to Parquet for CTAS statements.

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5723: Assignee: Yin Huai Change the default file format to Parquet for CTAS statements.

[jira] [Commented] (SPARK-5463) Fix Parquet filter push-down

2015-02-16 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323284#comment-14323284 ] Michael Armbrust commented on SPARK-5463: - Any progress here? It seems like

[jira] [Commented] (SPARK-5436) Validate GradientBoostedTrees during training

2015-02-16 Thread Chris T (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323317#comment-14323317 ] Chris T commented on SPARK-5436: There is already a predict method in the model object, so

[jira] [Commented] (SPARK-5846) Spark SQL should set job description and pool *before* running jobs

2015-02-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323328#comment-14323328 ] Apache Spark commented on SPARK-5846: - User 'kayousterhout' has created a pull request

[jira] [Commented] (SPARK-5548) Flaky test: org.apache.spark.util.AkkaUtilsSuite.remote fetch ssl on - untrusted server

2015-02-16 Thread Jacek Lewandowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323354#comment-14323354 ] Jacek Lewandowski commented on SPARK-5548: -- :( Flaky test:

  1   2   3   >