[jira] [Created] (SPARK-11524) Support SparkR with Mesos cluster

2015-11-04 Thread Sun Rui (JIRA)
Sun Rui created SPARK-11524: --- Summary: Support SparkR with Mesos cluster Key: SPARK-11524 URL: https://issues.apache.org/jira/browse/SPARK-11524 Project: Spark Issue Type: New Feature Com

[jira] [Commented] (SPARK-11507) Error thrown when using BlockMatrix.add

2015-11-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991261#comment-14991261 ] yuhao yang commented on SPARK-11507: Looking into it. Should be a bug. Breeze may rem

[jira] [Comment Edited] (SPARK-11507) Error thrown when using BlockMatrix.add

2015-11-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991261#comment-14991261 ] yuhao yang edited comment on SPARK-11507 at 11/5/15 7:21 AM: -

[jira] [Commented] (SPARK-11475) DataFrame API saveAsTable() does not work well for HDFS HA

2015-11-04 Thread zhangxiongfei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991258#comment-14991258 ] zhangxiongfei commented on SPARK-11475: --- Hi [~rekhajoshm] Thanks for pointing out m

[jira] [Commented] (SPARK-4557) Spark Streaming' foreachRDD method should accept a VoidFunction<...>, not a Function<..., Void>

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991239#comment-14991239 ] Apache Spark commented on SPARK-4557: - User 'BryanCutler' has created a pull request f

[jira] [Assigned] (SPARK-4557) Spark Streaming' foreachRDD method should accept a VoidFunction<...>, not a Function<..., Void>

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4557: --- Assignee: (was: Apache Spark) > Spark Streaming' foreachRDD method should accept a VoidFu

[jira] [Assigned] (SPARK-4557) Spark Streaming' foreachRDD method should accept a VoidFunction<...>, not a Function<..., Void>

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4557: --- Assignee: Apache Spark > Spark Streaming' foreachRDD method should accept a VoidFunction<...>

[jira] [Created] (SPARK-11523) spark_partition_id() considered invalid function

2015-11-04 Thread Simeon Simeonov (JIRA)
Simeon Simeonov created SPARK-11523: --- Summary: spark_partition_id() considered invalid function Key: SPARK-11523 URL: https://issues.apache.org/jira/browse/SPARK-11523 Project: Spark Issue

[jira] [Commented] (SPARK-2533) Show summary of locality level of completed tasks in the each stage page of web UI

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991208#comment-14991208 ] Apache Spark commented on SPARK-2533: - User 'jbonofre' has created a pull request for

[jira] [Commented] (SPARK-10729) word2vec model save for python

2015-11-04 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991206#comment-14991206 ] Yu Ishikawa commented on SPARK-10729: - Sorry, the cause isn't `@inherit_doc`. I misun

[jira] [Commented] (SPARK-2533) Show summary of locality level of completed tasks in the each stage page of web UI

2015-11-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-2533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991197#comment-14991197 ] Jean-Baptiste Onofré commented on SPARK-2533: - New clean PR. > Show summary o

[jira] [Commented] (SPARK-2533) Show summary of locality level of completed tasks in the each stage page of web UI

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991187#comment-14991187 ] Apache Spark commented on SPARK-2533: - User 'jbonofre' has created a pull request for

[jira] [Commented] (SPARK-11193) Spark 1.5+ Kinesis Streaming - ClassCastException when starting KinesisReceiver

2015-11-04 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-11193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991160#comment-14991160 ] Jean-Baptiste Onofré commented on SPARK-11193: -- Hi Phil, I'm testing a fix o

[jira] [Resolved] (SPARK-11486) TungstenAggregate may fail when switching to sort-based aggregation when there are string in grouping columns and no aggregation buffer columns

2015-11-04 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-11486. Resolution: Fixed Issue resolved by pull request 9383 [https://github.com/apache/spark/pull/9383]

[jira] [Resolved] (SPARK-11425) Improve hybrid aggregation (sort-based after hash-based)

2015-11-04 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-11425. Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9383 [https://github.c

[jira] [Updated] (SPARK-11500) Not deterministic order of columns when using merging schemas.

2015-11-04 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-11500: - Description: When executing {{sqlContext.read.option("mergeSchema", "true").parquet(pathOne, p

[jira] [Assigned] (SPARK-11514) Pass random seed to spark.ml DecisionTree*

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11514: Assignee: Apache Spark (was: Yu Ishikawa) > Pass random seed to spark.ml DecisionTree* >

[jira] [Commented] (SPARK-11514) Pass random seed to spark.ml DecisionTree*

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991117#comment-14991117 ] Apache Spark commented on SPARK-11514: -- User 'yu-iskw' has created a pull request fo

[jira] [Assigned] (SPARK-11514) Pass random seed to spark.ml DecisionTree*

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11514: Assignee: Yu Ishikawa (was: Apache Spark) > Pass random seed to spark.ml DecisionTree* >

[jira] [Commented] (SPARK-10838) Repeat to join one DataFrame twice,there will be AnalysisException.

2015-11-04 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991108#comment-14991108 ] Xiao Li commented on SPARK-10838: - The fix is ready. Writing unit test cases now. > Rep

[jira] [Created] (SPARK-11522) input_file_name() returns "" for external tables

2015-11-04 Thread Simeon Simeonov (JIRA)
Simeon Simeonov created SPARK-11522: --- Summary: input_file_name() returns "" for external tables Key: SPARK-11522 URL: https://issues.apache.org/jira/browse/SPARK-11522 Project: Spark Issue

[jira] [Created] (SPARK-11521) LinearRegressionSummary needs to clarify which metrics are weighted

2015-11-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-11521: - Summary: LinearRegressionSummary needs to clarify which metrics are weighted Key: SPARK-11521 URL: https://issues.apache.org/jira/browse/SPARK-11521 Project

[jira] [Created] (SPARK-11520) RegressionMetrics should support instance weights

2015-11-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-11520: - Summary: RegressionMetrics should support instance weights Key: SPARK-11520 URL: https://issues.apache.org/jira/browse/SPARK-11520 Project: Spark I

[jira] [Created] (SPARK-11519) Spark MemoryStore with hadoop SequenceFile cache the values is same record.

2015-11-04 Thread xukaiqiang (JIRA)
xukaiqiang created SPARK-11519: -- Summary: Spark MemoryStore with hadoop SequenceFile cache the values is same record. Key: SPARK-11519 URL: https://issues.apache.org/jira/browse/SPARK-11519 Project: Spar

[jira] [Commented] (SPARK-11473) R-like summary statistics with intercept for OLS via normal equation solver

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991022#comment-14991022 ] Apache Spark commented on SPARK-11473: -- User 'yanboliang' has created a pull request

[jira] [Assigned] (SPARK-11473) R-like summary statistics with intercept for OLS via normal equation solver

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11473: Assignee: (was: Apache Spark) > R-like summary statistics with intercept for OLS via n

[jira] [Assigned] (SPARK-11473) R-like summary statistics with intercept for OLS via normal equation solver

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11473: Assignee: Apache Spark > R-like summary statistics with intercept for OLS via normal equat

[jira] [Commented] (SPARK-9273) Add Convolutional Neural network to Spark MLlib

2015-11-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14991018#comment-14991018 ] yuhao yang commented on SPARK-9273: --- Hi [~avulanov]. I've refactored the CNN in [https

[jira] [Created] (SPARK-11518) The script spark-submit.cmd can not handle spark directory with space.

2015-11-04 Thread Cele Liu (JIRA)
Cele Liu created SPARK-11518: Summary: The script spark-submit.cmd can not handle spark directory with space. Key: SPARK-11518 URL: https://issues.apache.org/jira/browse/SPARK-11518 Project: Spark

[jira] [Commented] (SPARK-10809) Single-document topicDistributions method for LocalLDAModel

2015-11-04 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990974#comment-14990974 ] yuhao yang commented on SPARK-10809: working on this. > Single-document topicDistrib

[jira] [Commented] (SPARK-10809) Single-document topicDistributions method for LocalLDAModel

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990975#comment-14990975 ] Apache Spark commented on SPARK-10809: -- User 'hhbyyh' has created a pull request for

[jira] [Assigned] (SPARK-10809) Single-document topicDistributions method for LocalLDAModel

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10809: Assignee: (was: Apache Spark) > Single-document topicDistributions method for LocalLDA

[jira] [Assigned] (SPARK-10809) Single-document topicDistributions method for LocalLDAModel

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10809: Assignee: Apache Spark > Single-document topicDistributions method for LocalLDAModel > ---

[jira] [Commented] (SPARK-11517) Calc partitions in parallel for multiple partitions table

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990962#comment-14990962 ] Apache Spark commented on SPARK-11517: -- User 'zhichao-li' has created a pull request

[jira] [Assigned] (SPARK-11517) Calc partitions in parallel for multiple partitions table

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11517: Assignee: (was: Apache Spark) > Calc partitions in parallel for multiple partitions ta

[jira] [Assigned] (SPARK-11517) Calc partitions in parallel for multiple partitions table

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11517: Assignee: Apache Spark > Calc partitions in parallel for multiple partitions table > -

[jira] [Created] (SPARK-11517) Calc partitions in parallel for multiple partitions table

2015-11-04 Thread zhichao-li (JIRA)
zhichao-li created SPARK-11517: -- Summary: Calc partitions in parallel for multiple partitions table Key: SPARK-11517 URL: https://issues.apache.org/jira/browse/SPARK-11517 Project: Spark Issue T

[jira] [Created] (SPARK-11516) Spark application cannot be found from JSON API even though it exists

2015-11-04 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-11516: -- Summary: Spark application cannot be found from JSON API even though it exists Key: SPARK-11516 URL: https://issues.apache.org/jira/browse/SPARK-11516 Project: Spark

[jira] [Commented] (SPARK-11453) append data to partitioned table will messes up the result

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990951#comment-14990951 ] Apache Spark commented on SPARK-11453: -- User 'cloud-fan' has created a pull request

[jira] [Commented] (SPARK-10785) Scale QuantileDiscretizer using distributed binning

2015-11-04 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990948#comment-14990948 ] holdenk commented on SPARK-10785: - So looking at the tree work it looks like just did a g

[jira] [Commented] (SPARK-10785) Scale QuantileDiscretizer using distributed binning

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990930#comment-14990930 ] Joseph K. Bradley commented on SPARK-10785: --- Yes, we should sample still. Exte

[jira] [Created] (SPARK-11515) QuantileDiscretizer should take random seed

2015-11-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-11515: - Summary: QuantileDiscretizer should take random seed Key: SPARK-11515 URL: https://issues.apache.org/jira/browse/SPARK-11515 Project: Spark Issue T

[jira] [Assigned] (SPARK-10745) Separate configs between shuffle and RPC

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10745: Assignee: Apache Spark > Separate configs between shuffle and RPC > --

[jira] [Assigned] (SPARK-10745) Separate configs between shuffle and RPC

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10745: Assignee: (was: Apache Spark) > Separate configs between shuffle and RPC > ---

[jira] [Commented] (SPARK-10745) Separate configs between shuffle and RPC

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990926#comment-14990926 ] Apache Spark commented on SPARK-10745: -- User 'zsxwing' has created a pull request fo

[jira] [Commented] (SPARK-7425) spark.ml Predictor should support other numeric types for label

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990917#comment-14990917 ] Joseph K. Bradley commented on SPARK-7425: -- The VectorUDT usage for features shou

[jira] [Issue Comment Deleted] (SPARK-9465) Could not read parquet table after recreating it with the same table name

2015-11-04 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xin Wu updated SPARK-9465: -- Comment: was deleted (was: I can not recreate the issue on 1.5.1 or 1.6.0.. {code} scala> sqlContext.sql("creat

[jira] [Commented] (SPARK-9465) Could not read parquet table after recreating it with the same table name

2015-11-04 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990909#comment-14990909 ] Xin Wu commented on SPARK-9465: --- I can not recreate the issue on 1.5.1 or 1.6.0.. {code} sc

[jira] [Commented] (SPARK-9722) Pass random seed to spark.ml RandomForest findSplitsBins

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990899#comment-14990899 ] Joseph K. Bradley commented on SPARK-9722: -- Great, thank you! > Pass random seed

[jira] [Assigned] (SPARK-10371) Optimize sequential projections

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10371: Assignee: Apache Spark > Optimize sequential projections > ---

[jira] [Commented] (SPARK-10371) Optimize sequential projections

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990894#comment-14990894 ] Apache Spark commented on SPARK-10371: -- User 'nongli' has created a pull request for

[jira] [Assigned] (SPARK-10371) Optimize sequential projections

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10371: Assignee: (was: Apache Spark) > Optimize sequential projections >

[jira] [Commented] (SPARK-9465) Could not read parquet table after recreating it with the same table name

2015-11-04 Thread Xin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990890#comment-14990890 ] Xin Wu commented on SPARK-9465: --- I tried on both 1.5.1 and 1.6.0, I can not recreate the iss

[jira] [Resolved] (SPARK-11307) Reduce memory consumption of OutputCommitCoordinator bookkeeping structures

2015-11-04 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-11307. Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9274 [https://github.c

[jira] [Resolved] (SPARK-11398) unnecessary def dialectClassName in HiveContext, and misleading dialect conf at the start of spark-sql

2015-11-04 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-11398. Resolution: Fixed Fix Version/s: 1.6.0 > unnecessary def dialectClassName in HiveContext, an

[jira] [Commented] (SPARK-9722) Pass random seed to spark.ml RandomForest findSplitsBins

2015-11-04 Thread Yu Ishikawa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990877#comment-14990877 ] Yu Ishikawa commented on SPARK-9722: [~josephkb] I'll add a seed Param to {{DecisionTr

[jira] [Commented] (SPARK-11512) Bucket Join

2015-11-04 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990868#comment-14990868 ] Cheng Hao commented on SPARK-11512: --- We need to support the "bucket" for DataSource API

[jira] [Commented] (SPARK-11512) Bucket Join

2015-11-04 Thread Cheng Hao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990867#comment-14990867 ] Cheng Hao commented on SPARK-11512: --- Oh, yes, but SPARK-5292 is only about to support t

[jira] [Updated] (SPARK-9722) Pass random seed to spark.ml RandomForest findSplitsBins

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9722: - Summary: Pass random seed to spark.ml RandomForest findSplitsBins (was: Pass random seed

[jira] [Commented] (SPARK-9722) Pass random seed to spark.ml DecisionTree*

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990866#comment-14990866 ] Joseph K. Bradley commented on SPARK-9722: -- [~yuu.ishik...@gmail.com] Thanks for

[jira] [Created] (SPARK-11514) Pass random seed to spark.ml DecisionTree*

2015-11-04 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-11514: - Summary: Pass random seed to spark.ml DecisionTree* Key: SPARK-11514 URL: https://issues.apache.org/jira/browse/SPARK-11514 Project: Spark Issue Ty

[jira] [Commented] (SPARK-11512) Bucket Join

2015-11-04 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990861#comment-14990861 ] Marcelo Vanzin commented on SPARK-11512: Isn't this the same as in SPARK-5292? >

[jira] [Resolved] (SPARK-11491) Use Scala 2.10.5

2015-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-11491. - Resolution: Fixed Fix Version/s: 1.6.0 > Use Scala 2.10.5 > > >

[jira] [Assigned] (SPARK-11513) Remove the internal implicit conversion from LogicalPlan to DataFrame

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11513: Assignee: Reynold Xin (was: Apache Spark) > Remove the internal implicit conversion from

[jira] [Assigned] (SPARK-11513) Remove the internal implicit conversion from LogicalPlan to DataFrame

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11513: Assignee: Apache Spark (was: Reynold Xin) > Remove the internal implicit conversion from

[jira] [Commented] (SPARK-11513) Remove the internal implicit conversion from LogicalPlan to DataFrame

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990853#comment-14990853 ] Apache Spark commented on SPARK-11513: -- User 'rxin' has created a pull request for t

[jira] [Created] (SPARK-11513) Remove the internal implicit conversion from LogicalPlan to DataFrame

2015-11-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-11513: --- Summary: Remove the internal implicit conversion from LogicalPlan to DataFrame Key: SPARK-11513 URL: https://issues.apache.org/jira/browse/SPARK-11513 Project: Spark

[jira] [Commented] (SPARK-10309) Some tasks failed with Unable to acquire memory

2015-11-04 Thread Abhishek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990851#comment-14990851 ] Abhishek commented on SPARK-10309: -- Is there any work around for this issue. We migrated

[jira] [Created] (SPARK-11512) Bucket Join

2015-11-04 Thread Cheng Hao (JIRA)
Cheng Hao created SPARK-11512: - Summary: Bucket Join Key: SPARK-11512 URL: https://issues.apache.org/jira/browse/SPARK-11512 Project: Spark Issue Type: Sub-task Components: SQL

[jira] [Resolved] (SPARK-11510) Remove some SQL aggregation tests

2015-11-04 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-11510. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9475 [https://github.com/a

[jira] [Commented] (SPARK-10387) Code generation for decision tree

2015-11-04 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990809#comment-14990809 ] holdenk commented on SPARK-10387: - Progress - although I'm a little uncertain of what the

[jira] [Resolved] (SPARK-6001) K-Means clusterer should return the assignments of input points to clusters

2015-11-04 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-6001. -- Resolution: Fixed Assignee: Yu Ishikawa Fix Version/s: 1.5.0 Yep, thanks

[jira] [Closed] (SPARK-7332) RpcCallContext.sender has a different name from the original sender's name

2015-11-04 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu closed SPARK-7332. --- Resolution: Won't Fix They are internal APIs and not exposed to the user. > RpcCallContext.sender has

[jira] [Commented] (SPARK-11303) sample (without replacement) + filter returns wrong results in DataFrame

2015-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990795#comment-14990795 ] Reynold Xin commented on SPARK-11303: - This made it into 1.5.2. > sample (without r

[jira] [Commented] (SPARK-11103) Parquet filters push-down may cause exception when schema merging is turned on

2015-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990793#comment-14990793 ] Reynold Xin commented on SPARK-11103: - I think this was included in 1.5.2 > Parquet

[jira] [Updated] (SPARK-11303) sample (without replacement) + filter returns wrong results in DataFrame

2015-11-04 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-11303: Fix Version/s: 1.5.2 > sample (without replacement) + filter returns wrong results in DataFrame > -

[jira] [Commented] (SPARK-6521) executors in the same node read local shuffle file

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990791#comment-14990791 ] Apache Spark commented on SPARK-6521: - User 'maropu' has created a pull request for th

[jira] [Commented] (SPARK-10648) Spark-SQL JDBC fails to set a default precision and scale when they are not defined in an oracle schema.

2015-11-04 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990785#comment-14990785 ] Yin Huai commented on SPARK-10648: -- https://github.com/apache/spark/pull/8780#issuecomme

[jira] [Commented] (SPARK-7542) Support off-heap sort buffer in UnsafeExternalSorter

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990761#comment-14990761 ] Apache Spark commented on SPARK-7542: - User 'davies' has created a pull request for th

[jira] [Assigned] (SPARK-7542) Support off-heap sort buffer in UnsafeExternalSorter

2015-11-04 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-7542: - Assignee: Davies Liu > Support off-heap sort buffer in UnsafeExternalSorter > ---

[jira] [Commented] (SPARK-11509) ipython notebooks do not work on clusters created using spark-1.5.1-bin-hadoop2.6/ec2/spark-ec2 script

2015-11-04 Thread Andrew Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990746#comment-14990746 ] Andrew Davidson commented on SPARK-11509: - I forgot to mentioned. on my cluster m

[jira] [Commented] (SPARK-11509) ipython notebooks do not work on clusters created using spark-1.5.1-bin-hadoop2.6/ec2/spark-ec2 script

2015-11-04 Thread Andrew Davidson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990742#comment-14990742 ] Andrew Davidson commented on SPARK-11509: - yes , it appears the show stopper issu

[jira] [Resolved] (SPARK-10028) Add Python API for PrefixSpan

2015-11-04 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10028. --- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9469 [https://gi

[jira] [Commented] (SPARK-11459) Allow configuring checkpoint dir, filenames

2015-11-04 Thread Ryan Williams (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990676#comment-14990676 ] Ryan Williams commented on SPARK-11459: --- I'm mostly interested in saving RDDs to di

[jira] [Commented] (SPARK-11499) Spark History Server UI should respect protocol when doing redirection

2015-11-04 Thread Lukasz Jastrzebski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990674#comment-14990674 ] Lukasz Jastrzebski commented on SPARK-11499: There is also https://en.wikiped

[jira] [Commented] (SPARK-11511) Creating an InputDStream but not using it throws NPE

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990639#comment-14990639 ] Apache Spark commented on SPARK-11511: -- User 'zsxwing' has created a pull request fo

[jira] [Assigned] (SPARK-11511) Creating an InputDStream but not using it throws NPE

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11511: Assignee: (was: Apache Spark) > Creating an InputDStream but not using it throws NPE >

[jira] [Assigned] (SPARK-11511) Creating an InputDStream but not using it throws NPE

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11511: Assignee: Apache Spark > Creating an InputDStream but not using it throws NPE > --

[jira] [Created] (SPARK-11511) Creating an InputDStream but not using it throws NPE

2015-11-04 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-11511: Summary: Creating an InputDStream but not using it throws NPE Key: SPARK-11511 URL: https://issues.apache.org/jira/browse/SPARK-11511 Project: Spark Issue Ty

[jira] [Assigned] (SPARK-11510) Remove some SQL aggregation tests

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11510: Assignee: Apache Spark (was: Reynold Xin) > Remove some SQL aggregation tests > -

[jira] [Assigned] (SPARK-11510) Remove some SQL aggregation tests

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-11510: Assignee: Reynold Xin (was: Apache Spark) > Remove some SQL aggregation tests > -

[jira] [Commented] (SPARK-11510) Remove some SQL aggregation tests

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990615#comment-14990615 ] Apache Spark commented on SPARK-11510: -- User 'rxin' has created a pull request for t

[jira] [Resolved] (SPARK-11493) Remove Bitset in BytesToBytesMap

2015-11-04 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-11493. Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 9452 [https://github.c

[jira] [Created] (SPARK-11510) Remove some SQL aggregation tests

2015-11-04 Thread Reynold Xin (JIRA)
Reynold Xin created SPARK-11510: --- Summary: Remove some SQL aggregation tests Key: SPARK-11510 URL: https://issues.apache.org/jira/browse/SPARK-11510 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-11459) Allow configuring checkpoint dir, filenames

2015-11-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-11459: -- Priority: Minor (was: Major) What's the use case for this? you can already control the directory, but

[jira] [Commented] (SPARK-11509) ipython notebooks do not work on clusters created using spark-1.5.1-bin-hadoop2.6/ec2/spark-ec2 script

2015-11-04 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990600#comment-14990600 ] Sean Owen commented on SPARK-11509: --- This ultimately means the initialization failed. I

[jira] [Created] (SPARK-11509) ipython notebooks do not work on clusters created using spark-1.5.1-bin-hadoop2.6/ec2/spark-ec2 script

2015-11-04 Thread Andrew Davidson (JIRA)
Andrew Davidson created SPARK-11509: --- Summary: ipython notebooks do not work on clusters created using spark-1.5.1-bin-hadoop2.6/ec2/spark-ec2 script Key: SPARK-11509 URL: https://issues.apache.org/jira/browse/S

[jira] [Assigned] (SPARK-10788) Decision Tree duplicates bins for unordered categorical features

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10788: Assignee: Apache Spark > Decision Tree duplicates bins for unordered categorical features

[jira] [Assigned] (SPARK-10788) Decision Tree duplicates bins for unordered categorical features

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10788: Assignee: (was: Apache Spark) > Decision Tree duplicates bins for unordered categorica

[jira] [Commented] (SPARK-10788) Decision Tree duplicates bins for unordered categorical features

2015-11-04 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990576#comment-14990576 ] Apache Spark commented on SPARK-10788: -- User 'sethah' has created a pull request for

  1   2   3   >