[jira] [Commented] (SPARK-7035) Drop __getattr__ on pyspark.sql.DataFrame

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14530638#comment-14530638 ] Apache Spark commented on SPARK-7035: - User 'ksonj' has created a pull request for

[jira] [Assigned] (SPARK-7298) Harmonize style of new UI visualizations

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7298: --- Assignee: Matei Zaharia (was: Apache Spark) Harmonize style of new UI visualizations

[jira] [Commented] (SPARK-7298) Harmonize style of new UI visualizations

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14530627#comment-14530627 ] Apache Spark commented on SPARK-7298: - User 'mateiz' has created a pull request for

[jira] [Assigned] (SPARK-7035) Drop __getattr__ on pyspark.sql.DataFrame

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7035: --- Assignee: Apache Spark Drop __getattr__ on pyspark.sql.DataFrame

[jira] [Created] (SPARK-7399) Master fails on 2.11 with compilation error

2015-05-06 Thread Iulian Dragos (JIRA)
Iulian Dragos created SPARK-7399: Summary: Master fails on 2.11 with compilation error Key: SPARK-7399 URL: https://issues.apache.org/jira/browse/SPARK-7399 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-7116) Intermediate RDD cached but never unpersisted

2015-05-06 Thread Dennis Proppe (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14530390#comment-14530390 ] Dennis Proppe commented on SPARK-7116: -- *I second the importance of fixing this.*

[jira] [Commented] (SPARK-5913) Python API for ChiSqSelector

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14530427#comment-14530427 ] Apache Spark commented on SPARK-5913: - User 'yanboliang' has created a pull request

[jira] [Comment Edited] (SPARK-6258) Python MLlib API missing items: Clustering

2015-05-06 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14530434#comment-14530434 ] Yanbo Liang edited comment on SPARK-6258 at 5/6/15 12:28 PM: -

[jira] [Assigned] (SPARK-4669) Allow users to set arbitrary akka configurations via property file

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4669: --- Assignee: Apache Spark Allow users to set arbitrary akka configurations via property file

[jira] [Commented] (SPARK-7377) DAG visualization: JS error when there is only 1 RDD in a stage

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531629#comment-14531629 ] Apache Spark commented on SPARK-7377: - User 'andrewor14' has created a pull request

[jira] [Resolved] (SPARK-5995) Make ML Prediction Developer APIs public

2015-05-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5995. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5913

[jira] [Created] (SPARK-7409) Designing multilabel abstractions for spark.ml

2015-05-06 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7409: Summary: Designing multilabel abstractions for spark.ml Key: SPARK-7409 URL: https://issues.apache.org/jira/browse/SPARK-7409 Project: Spark Issue

[jira] [Created] (SPARK-7407) Use uid and param name to identify a parameter instead of the param object

2015-05-06 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7407: Summary: Use uid and param name to identify a parameter instead of the param object Key: SPARK-7407 URL: https://issues.apache.org/jira/browse/SPARK-7407 Project:

[jira] [Commented] (SPARK-7352) ml.feature.IDF should rename minDocFreq to minDocCount

2015-05-06 Thread Glenn Weidner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531566#comment-14531566 ] Glenn Weidner commented on SPARK-7352: -- [~josephkb] Is Document Frequency a more

[jira] [Commented] (SPARK-7371) DAG visualization: put less emphasis on RDDs on stage page

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531603#comment-14531603 ] Apache Spark commented on SPARK-7371: - User 'andrewor14' has created a pull request

[jira] [Assigned] (SPARK-7371) DAG visualization: put less emphasis on RDDs on stage page

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7371: --- Assignee: Apache Spark (was: Andrew Or) DAG visualization: put less emphasis on RDDs on

[jira] [Assigned] (SPARK-7371) DAG visualization: put less emphasis on RDDs on stage page

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7371: --- Assignee: Andrew Or (was: Apache Spark) DAG visualization: put less emphasis on RDDs on

[jira] [Assigned] (SPARK-7408) Dag visualization: move style from JS to CSS

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7408: --- Assignee: Andrew Or (was: Apache Spark) Dag visualization: move style from JS to CSS

[jira] [Updated] (SPARK-7408) DAG visualization: move style from JS to CSS

2015-05-06 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7408: - Summary: DAG visualization: move style from JS to CSS (was: Dag visualization: move style from JS to

[jira] [Updated] (SPARK-7391) DAG visualization: open viz on stage page if from job page

2015-05-06 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7391: - Summary: DAG visualization: open viz on stage page if from job page (was: Dag visualization: open viz on

[jira] [Created] (SPARK-7412) Designing distributed prediction model abstractions for spark.ml

2015-05-06 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-7412: Summary: Designing distributed prediction model abstractions for spark.ml Key: SPARK-7412 URL: https://issues.apache.org/jira/browse/SPARK-7412 Project:

[jira] [Created] (SPARK-7413) Time to write shuffle spill files is not captured in ShuffleWriteMetrics

2015-05-06 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-7413: - Summary: Time to write shuffle spill files is not captured in ShuffleWriteMetrics Key: SPARK-7413 URL: https://issues.apache.org/jira/browse/SPARK-7413 Project: Spark

[jira] [Commented] (SPARK-7413) Time to write shuffle spill files is not captured in ShuffleWriteMetrics

2015-05-06 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531702#comment-14531702 ] Josh Rosen commented on SPARK-7413: --- /cc [~kayousterhout], who's pretty familiar with

[jira] [Created] (SPARK-7417) Flaky test: o.a.s.deploy.SparkSubmitUtilsSuite neglect dependencies

2015-05-06 Thread Andrew Or (JIRA)
Andrew Or created SPARK-7417: Summary: Flaky test: o.a.s.deploy.SparkSubmitUtilsSuite neglect dependencies Key: SPARK-7417 URL: https://issues.apache.org/jira/browse/SPARK-7417 Project: Spark

[jira] [Created] (SPARK-7416) Shuffle performance metrics umbrella

2015-05-06 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-7416: - Summary: Shuffle performance metrics umbrella Key: SPARK-7416 URL: https://issues.apache.org/jira/browse/SPARK-7416 Project: Spark Issue Type: Umbrella

[jira] [Commented] (SPARK-7416) Shuffle performance metrics umbrella

2015-05-06 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531740#comment-14531740 ] Josh Rosen commented on SPARK-7416: --- It would be very useful to have a more fine-grained

[jira] [Created] (SPARK-7410) Add option to avoid broadcasting configuration with newAPIHadoopFile

2015-05-06 Thread Sandy Ryza (JIRA)
Sandy Ryza created SPARK-7410: - Summary: Add option to avoid broadcasting configuration with newAPIHadoopFile Key: SPARK-7410 URL: https://issues.apache.org/jira/browse/SPARK-7410 Project: Spark

[jira] [Created] (SPARK-7414) Flaky test: o.a.s.deploy.SparkSubmitSuite --jars

2015-05-06 Thread Andrew Or (JIRA)
Andrew Or created SPARK-7414: Summary: Flaky test: o.a.s.deploy.SparkSubmitSuite --jars Key: SPARK-7414 URL: https://issues.apache.org/jira/browse/SPARK-7414 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-7384) Fix flaky tests for distributed mode in BroadcastSuite

2015-05-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7384: --- Assignee: Shixiong Zhu Fix flaky tests for distributed mode in BroadcastSuite

[jira] [Created] (SPARK-7411) CTAS parser is incomplete

2015-05-06 Thread Michael Armbrust (JIRA)
Michael Armbrust created SPARK-7411: --- Summary: CTAS parser is incomplete Key: SPARK-7411 URL: https://issues.apache.org/jira/browse/SPARK-7411 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-7377) DAG visualization: JS error when there is only 1 RDD

2015-05-06 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7377: - Summary: DAG visualization: JS error when there is only 1 RDD (was: DAG visualization: JS error when

[jira] [Updated] (SPARK-7315) Flaky Test: WriteAheadLogBackedBlockRDDSuite

2015-05-06 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7315: - Labels: flaky-test (was: ) Flaky Test: WriteAheadLogBackedBlockRDDSuite

[jira] [Created] (SPARK-7418) Flaky test: o.a.s.deploy.SparkSubmitUtilsSuite search for artifacts

2015-05-06 Thread Andrew Or (JIRA)
Andrew Or created SPARK-7418: Summary: Flaky test: o.a.s.deploy.SparkSubmitUtilsSuite search for artifacts Key: SPARK-7418 URL: https://issues.apache.org/jira/browse/SPARK-7418 Project: Spark

[jira] [Commented] (SPARK-7413) Time to write shuffle spill files is not captured in ShuffleWriteMetrics

2015-05-06 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531736#comment-14531736 ] Josh Rosen commented on SPARK-7413: --- I've also created SPARK-7416 as a more general

[jira] [Updated] (SPARK-7417) Flaky test: o.a.s.deploy.SparkSubmitUtilsSuite neglect dependencies

2015-05-06 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7417: - Labels: flaky-test (was: ) Flaky test: o.a.s.deploy.SparkSubmitUtilsSuite neglect dependencies

[jira] [Commented] (SPARK-7352) ml.feature.IDF should rename minDocFreq to minDocCount

2015-05-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531572#comment-14531572 ] Xiangrui Meng commented on SPARK-7352: -- +1 on keeping `minDocFreq`, which is

[jira] [Closed] (SPARK-7352) ml.feature.IDF should rename minDocFreq to minDocCount

2015-05-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-7352. Resolution: Not A Problem ml.feature.IDF should rename minDocFreq to minDocCount

[jira] [Created] (SPARK-7408) Dag visualization: move style from JS to CSS

2015-05-06 Thread Andrew Or (JIRA)
Andrew Or created SPARK-7408: Summary: Dag visualization: move style from JS to CSS Key: SPARK-7408 URL: https://issues.apache.org/jira/browse/SPARK-7408 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-7408) Dag visualization: move style from JS to CSS

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531604#comment-14531604 ] Apache Spark commented on SPARK-7408: - User 'andrewor14' has created a pull request

[jira] [Updated] (SPARK-7414) Flaky test: o.a.s.deploy.SparkSubmitSuite --jars

2015-05-06 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7414?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7414: - Description: Observed recently in master with a not so helpful error message: {code} The code passed to

[jira] [Updated] (SPARK-7415) Flaky test: o.a.s.deploy.DriverSuite

2015-05-06 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7415: - Labels: flaky-test (was: ) Flaky test: o.a.s.deploy.DriverSuite

[jira] [Created] (SPARK-7415) Flaky test: o.a.s.deploy.DriverSuite

2015-05-06 Thread Andrew Or (JIRA)
Andrew Or created SPARK-7415: Summary: Flaky test: o.a.s.deploy.DriverSuite Key: SPARK-7415 URL: https://issues.apache.org/jira/browse/SPARK-7415 Project: Spark Issue Type: Bug

[jira] [Closed] (SPARK-7370) Add missing items to pyspark.mllib.linalg.Vectors

2015-05-06 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar closed SPARK-7370. -- Resolution: Duplicate Add missing items to pyspark.mllib.linalg.Vectors

[jira] [Resolved] (SPARK-6201) INSET should coerce types

2015-05-06 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-6201. - Resolution: Fixed Fix Version/s: 1.4.0 INSET should coerce types -

[jira] [Commented] (SPARK-6910) Support for pushing predicates down to metastore for partition pruning

2015-05-06 Thread Ashwin Shankar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531325#comment-14531325 ] Ashwin Shankar commented on SPARK-6910: --- +1, this is causing pretty bad user

[jira] [Created] (SPARK-7406) Add tooltips for Scheduling Delay, Processing Time and Total Delay in Streaming WebUI

2015-05-06 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-7406: --- Summary: Add tooltips for Scheduling Delay, Processing Time and Total Delay in Streaming WebUI Key: SPARK-7406 URL: https://issues.apache.org/jira/browse/SPARK-7406

[jira] [Commented] (SPARK-7406) Add tooltips for Scheduling Delay, Processing Time and Total Delay in Streaming WebUI

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531457#comment-14531457 ] Apache Spark commented on SPARK-7406: - User 'zsxwing' has created a pull request for

[jira] [Assigned] (SPARK-7352) ml.feature.IDF should rename minDocFreq to minDocCount

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7352: --- Assignee: (was: Apache Spark) ml.feature.IDF should rename minDocFreq to minDocCount

[jira] [Commented] (SPARK-7352) ml.feature.IDF should rename minDocFreq to minDocCount

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531491#comment-14531491 ] Apache Spark commented on SPARK-7352: - User 'gweidner' has created a pull request for

[jira] [Assigned] (SPARK-7352) ml.feature.IDF should rename minDocFreq to minDocCount

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7352: --- Assignee: Apache Spark ml.feature.IDF should rename minDocFreq to minDocCount

[jira] [Resolved] (SPARK-7384) Fix flaky tests for distributed mode in BroadcastSuite

2015-05-06 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-7384. - Resolution: Fixed Fix Version/s: 1.4.0 Fix flaky tests for distributed mode in

[jira] [Assigned] (SPARK-7375) Avoid defensive copying in SQL exchange operator when sort-based shuffle buffers data in serialized form

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7375: --- Assignee: Apache Spark Avoid defensive copying in SQL exchange operator when sort-based

[jira] [Commented] (SPARK-2484) Build should not run hive compatibility tests by default.

2015-05-06 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531452#comment-14531452 ] Michael Armbrust commented on SPARK-2484: - Can we undo this change? Its weird to

[jira] [Assigned] (SPARK-7406) Add tooltips for Scheduling Delay, Processing Time and Total Delay in Streaming WebUI

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7406: --- Assignee: (was: Apache Spark) Add tooltips for Scheduling Delay, Processing Time and

[jira] [Assigned] (SPARK-7406) Add tooltips for Scheduling Delay, Processing Time and Total Delay in Streaming WebUI

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7406: --- Assignee: Apache Spark Add tooltips for Scheduling Delay, Processing Time and Total Delay

[jira] [Commented] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-05-06 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531498#comment-14531498 ] Bryan Cutler commented on SPARK-6980: - [~harshg], sorry I have not seen those

[jira] [Assigned] (SPARK-7375) Avoid defensive copying in SQL exchange operator when sort-based shuffle buffers data in serialized form

2015-05-06 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-7375: - Assignee: Josh Rosen Avoid defensive copying in SQL exchange operator when sort-based shuffle

[jira] [Commented] (SPARK-7405) Fix the bug that ReceiverInputDStream doesn't report InputInfo

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531350#comment-14531350 ] Apache Spark commented on SPARK-7405: - User 'zsxwing' has created a pull request for

[jira] [Assigned] (SPARK-7405) Fix the bug that ReceiverInputDStream doesn't report InputInfo

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7405: --- Assignee: Apache Spark Fix the bug that ReceiverInputDStream doesn't report InputInfo

[jira] [Assigned] (SPARK-7405) Fix the bug that ReceiverInputDStream doesn't report InputInfo

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7405: --- Assignee: (was: Apache Spark) Fix the bug that ReceiverInputDStream doesn't report

[jira] [Commented] (SPARK-7316) Add step capability to RDD sliding window

2015-05-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531296#comment-14531296 ] Joseph K. Bradley commented on SPARK-7316: -- Definitely makes sense for time

[jira] [Commented] (SPARK-7375) Avoid defensive copying in SQL exchange operator when sort-based shuffle buffers data in serialized form

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531303#comment-14531303 ] Apache Spark commented on SPARK-7375: - User 'JoshRosen' has created a pull request for

[jira] [Assigned] (SPARK-7375) Avoid defensive copying in SQL exchange operator when sort-based shuffle buffers data in serialized form

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7375: --- Assignee: (was: Apache Spark) Avoid defensive copying in SQL exchange operator when

[jira] [Commented] (SPARK-6799) Add dataframe examples for SparkR

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531324#comment-14531324 ] Apache Spark commented on SPARK-6799: - User 'shivaram' has created a pull request for

[jira] [Commented] (SPARK-6824) Fill the docs for DataFrame API in SparkR

2015-05-06 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14530855#comment-14530855 ] Shivaram Venkataraman commented on SPARK-6824: -- [~qhuang] A related issue is

[jira] [Updated] (SPARK-6824) Fill the docs for DataFrame API in SparkR

2015-05-06 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-6824: - Assignee: Qian Huang Fill the docs for DataFrame API in SparkR

[jira] [Updated] (SPARK-7401) Dot product and squared_distances should be vectorized in Vectors

2015-05-06 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-7401: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-7203 Dot product and

[jira] [Commented] (SPARK-7035) Drop __getattr__ on pyspark.sql.DataFrame

2015-05-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14530972#comment-14530972 ] Xiangrui Meng commented on SPARK-7035: -- I wonder if we provide both options how many

[jira] [Updated] (SPARK-5832) Add Affinity Propagation clustering algorithm

2015-05-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5832: - Target Version/s: 1.5.0 (was: 1.4.0) Add Affinity Propagation clustering algorithm

[jira] [Commented] (SPARK-1442) Add Window function support

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531067#comment-14531067 ] Apache Spark commented on SPARK-1442: - User 'yhuai' has created a pull request for

[jira] [Assigned] (SPARK-7401) Dot product and squared_distances should be vectorized in Vectors

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7401: --- Assignee: Apache Spark Dot product and squared_distances should be vectorized in Vectors

[jira] [Commented] (SPARK-7401) Dot product and squared_distances should be vectorized in Vectors

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531096#comment-14531096 ] Apache Spark commented on SPARK-7401: - User 'MechCoder' has created a pull request for

[jira] [Assigned] (SPARK-7401) Dot product and squared_distances should be vectorized in Vectors

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7401: --- Assignee: (was: Apache Spark) Dot product and squared_distances should be vectorized in

[jira] [Updated] (SPARK-7370) Add missing items to pyspark.mllib.linalg.Vectors

2015-05-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7370: - External issue URL: (was: https://issues.apache.org/jira/browse/SPARK-7328) Add

[jira] [Created] (SPARK-7402) JSON serialization of params

2015-05-06 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7402: Summary: JSON serialization of params Key: SPARK-7402 URL: https://issues.apache.org/jira/browse/SPARK-7402 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-6258) Python MLlib API missing items: Clustering

2015-05-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531130#comment-14531130 ] Joseph K. Bradley commented on SPARK-6258: -- [~yanboliang] That will be

[jira] [Commented] (SPARK-7403) Link URL in objects on Timeline View is wrong in case of running on YARN

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531200#comment-14531200 ] Apache Spark commented on SPARK-7403: - User 'sarutak' has created a pull request for

[jira] [Assigned] (SPARK-7403) Link URL in objects on Timeline View is wrong in case of running on YARN

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7403: --- Assignee: Apache Spark Link URL in objects on Timeline View is wrong in case of running on

[jira] [Updated] (SPARK-6725) Model export/import for Pipeline API

2015-05-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6725: - Issue Type: Umbrella (was: New Feature) Model export/import for Pipeline API

[jira] [Commented] (SPARK-7316) Add step capability to RDD sliding window

2015-05-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531151#comment-14531151 ] Joseph K. Bradley commented on SPARK-7316: -- I've spoken with [~mengxr], and this

[jira] [Assigned] (SPARK-7404) Add RegressionEvaluator to spark.ml

2015-05-06 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-7404: Assignee: Xiangrui Meng Add RegressionEvaluator to spark.ml

[jira] [Created] (SPARK-7404) Add RegressionEvaluator to spark.ml

2015-05-06 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7404: Summary: Add RegressionEvaluator to spark.ml Key: SPARK-7404 URL: https://issues.apache.org/jira/browse/SPARK-7404 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-7316) Add step capability to RDD sliding window

2015-05-06 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531229#comment-14531229 ] Alexander Ulanov commented on SPARK-7316: - I would say that the major use case is

[jira] [Resolved] (SPARK-5456) Decimal Type comparison issue

2015-05-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5456. Resolution: Fixed Fix Version/s: 1.4.0 1.3.2 1.2.3

[jira] [Updated] (SPARK-7328) Add missing items to pyspark.mllib.linalg.Vectors

2015-05-06 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-7328: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-7203 Add missing items to

[jira] [Updated] (SPARK-6201) INSET should coerce types

2015-05-06 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-6201: Assignee: Adrian Wang INSET should coerce types - Key:

[jira] [Resolved] (SPARK-7311) Enable in-memory serialized map-side shuffle to work with SQL serializers

2015-05-06 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-7311. --- Resolution: Fixed Fix Version/s: 1.4.0 Enable in-memory serialized map-side shuffle to work

[jira] [Commented] (SPARK-7311) Enable in-memory serialized map-side shuffle to work with SQL serializers

2015-05-06 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531037#comment-14531037 ] Josh Rosen commented on SPARK-7311: --- Fixed by my PR for 1.4.0. Enable in-memory

[jira] [Updated] (SPARK-1442) Add Window function support

2015-05-06 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-1442: Assignee: guowei Add Window function support --- Key:

[jira] [Updated] (SPARK-7401) Dot product and squared_distances should be vectorized in Vectors

2015-05-06 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-7401: --- Component/s: PySpark MLlib Dot product and squared_distances should be vectorized

[jira] [Commented] (SPARK-5281) Registering table on RDD is giving MissingRequirementError

2015-05-06 Thread Yves Raimond (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531082#comment-14531082 ] Yves Raimond commented on SPARK-5281: - +1 - this workaround does the trick with

[jira] [Assigned] (SPARK-4669) Allow users to set arbitrary akka configurations via property file

2015-05-06 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-4669: --- Assignee: (was: Apache Spark) Allow users to set arbitrary akka configurations via

[jira] [Updated] (SPARK-7401) Dot product and squared_distances should be vectorized in Vectors

2015-05-06 Thread Manoj Kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manoj Kumar updated SPARK-7401: --- Summary: Dot product and squared_distances should be vectorized in Vectors (was: Dot product and

[jira] [Commented] (SPARK-1437) Jenkins should build with Java 6

2015-05-06 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531012#comment-14531012 ] shane knapp commented on SPARK-1437: that's my understanding: all pre-1.4 builds will

[jira] [Resolved] (SPARK-1442) Add Window function support

2015-05-06 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-1442. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5604

[jira] [Commented] (SPARK-7352) ml.feature.IDF should rename minDocFreq to minDocCount

2015-05-06 Thread Glenn Weidner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7352?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531064#comment-14531064 ] Glenn Weidner commented on SPARK-7352: -- I can make the change to ml\feature\IDF.scala

[jira] [Commented] (SPARK-664) Accumulator updates should get locally merged before sent to the driver

2015-05-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14530859#comment-14530859 ] Reynold Xin commented on SPARK-664: --- I'm going to close this due to inactivity. Also

[jira] [Created] (SPARK-7401) Dot product and squared_distances should be vectorized

2015-05-06 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-7401: -- Summary: Dot product and squared_distances should be vectorized Key: SPARK-7401 URL: https://issues.apache.org/jira/browse/SPARK-7401 Project: Spark Issue Type:

[jira] [Closed] (SPARK-664) Accumulator updates should get locally merged before sent to the driver

2015-05-06 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-664. - Resolution: Won't Fix Accumulator updates should get locally merged before sent to the driver

[jira] [Commented] (SPARK-6812) filter() on DataFrame does not work as expected

2015-05-06 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14530890#comment-14530890 ] Shivaram Venkataraman commented on SPARK-6812: -- Regarding the name conflicts

<    1   2   3   >