[jira] [Commented] (SPARK-11381) Replace example code in mllib-linear-methods.md using include_example

2015-12-11 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052513#comment-15052513 ] Xusen Yin commented on SPARK-11381: --- [~somi...@us.ibm.com] This JIRA is blocked by

[jira] [Commented] (SPARK-6363) make scala 2.11 default language

2015-12-11 Thread Ismael Juma (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052631#comment-15052631 ] Ismael Juma commented on SPARK-6363: It's also worth pointing out that Scala 2.10 is no longer

[jira] [Commented] (SPARK-10625) Spark SQL JDBC read/write is unable to handle JDBC Drivers that adds unserializable objects into connection properties

2015-12-11 Thread Chandra Sekhar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052617#comment-15052617 ] Chandra Sekhar commented on SPARK-10625: Can i test this now? which version I have to download

[jira] [Commented] (SPARK-11381) Replace example code in mllib-linear-methods.md using include_example

2015-12-11 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052512#comment-15052512 ] Xusen Yin commented on SPARK-11381: --- [~somi...@us.ibm.com] This JIRA is blocked by

[jira] [Updated] (SPARK-6363) make scala 2.11 default language

2015-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6363: - Target Version/s: 2.0.0 > make scala 2.11 default language > > >

[jira] [Commented] (SPARK-9858) Introduce an ExchangeCoordinator to estimate the number of post-shuffle partitions.

2015-12-11 Thread Adam Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052603#comment-15052603 ] Adam Roberts commented on SPARK-9858: - Modifying the UnsafeRowSerializer to always write/read in LE

[jira] [Commented] (SPARK-12264) Could DataType provide a TypeTag?

2015-12-11 Thread Andras Nemeth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052612#comment-15052612 ] Andras Nemeth commented on SPARK-12264: --- I guess my concrete proposal was a bit hidden in the last

[jira] [Commented] (SPARK-6918) Secure HBase with Kerberos does not work over YARN

2015-12-11 Thread Pierre Beauvois (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052737#comment-15052737 ] Pierre Beauvois commented on SPARK-6918: I found nothing about org.apache.spark.deploy.yarn.Client

[jira] [Commented] (SPARK-12218) Boolean logic in sql does not work "not (A and B)" is not the same as "(not A) or (not B)"

2015-12-11 Thread Irakli Machabeli (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052759#comment-15052759 ] Irakli Machabeli commented on SPARK-12218: -- The bug itself is really dangerous, it's ok if it

[jira] [Commented] (SPARK-12218) Boolean logic in sql does not work "not (A and B)" is not the same as "(not A) or (not B)"

2015-12-11 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052763#comment-15052763 ] Xiao Li commented on SPARK-12218: - Agree! I will do a search to find out what happened in the push down

[jira] [Commented] (SPARK-11136) Warm-start support for ML estimator

2015-12-11 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052718#comment-15052718 ] Xusen Yin commented on SPARK-11136: --- I add a [design

[jira] [Commented] (SPARK-2356) Exception: Could not locate executable null\bin\winutils.exe in the Hadoop

2015-12-11 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052852#comment-15052852 ] Steve Loughran commented on SPARK-2356: --- I've stuck up binaries compatible with Hadoop 2.6 & 2.7, to

[jira] [Commented] (SPARK-11193) Spark 1.5+ Kinesis Streaming - ClassCastException when starting KinesisReceiver

2015-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052755#comment-15052755 ] Sean Owen commented on SPARK-11193: --- [~phibit] are you able to test the change in

[jira] [Updated] (SPARK-12275) No plan for BroadcastHint in some condition

2015-12-11 Thread yucai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yucai updated SPARK-12275: -- Description: *Summary* No plan for BroadcastHint is generated in some condition. *Test Case* {code} val

[jira] [Created] (SPARK-12280) "--packages" command doesn't work in "spark-submit"

2015-12-11 Thread Anton Loss (JIRA)
Anton Loss created SPARK-12280: -- Summary: "--packages" command doesn't work in "spark-submit" Key: SPARK-12280 URL: https://issues.apache.org/jira/browse/SPARK-12280 Project: Spark Issue Type:

[jira] [Commented] (SPARK-12275) No plan for BroadcastHint in some condition

2015-12-11 Thread yucai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15052916#comment-15052916 ] yucai commented on SPARK-12275: --- *Root Cause* When BasicOperators's "case BroadcastHint(child)" is hit (in

[jira] [Commented] (SPARK-9690) Add random seed Param to PySpark CrossValidator

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053588#comment-15053588 ] Apache Spark commented on SPARK-9690: - User 'jkbradley' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12276) Prevent RejectedExecutionException by checking if ThreadPoolExecutor is shutdown and its capacity

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12276: Assignee: Apache Spark > Prevent RejectedExecutionException by checking if

[jira] [Assigned] (SPARK-12276) Prevent RejectedExecutionException by checking if ThreadPoolExecutor is shutdown and its capacity

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12276: Assignee: (was: Apache Spark) > Prevent RejectedExecutionException by checking if

[jira] [Updated] (SPARK-9694) Add random seed Param to Scala CrossValidator

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9694: - Assignee: Yanbo Liang > Add random seed Param to Scala CrossValidator >

[jira] [Updated] (SPARK-9694) Add random seed Param to Scala CrossValidator

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9694: - Shepherd: Joseph K. Bradley Target Version/s: 2.0.0 (was: ) > Add random

[jira] [Created] (SPARK-12284) Output UnsafeRow from window function

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12284: -- Summary: Output UnsafeRow from window function Key: SPARK-12284 URL: https://issues.apache.org/jira/browse/SPARK-12284 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-11529) Add section in user guide for StreamingLogisticRegressionWithSGD

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11529: -- Target Version/s: (was: 1.6.0) > Add section in user guide for

[jira] [Created] (SPARK-12286) Support UnsafeRow in all SparkPlan (if possible)

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12286: -- Summary: Support UnsafeRow in all SparkPlan (if possible) Key: SPARK-12286 URL: https://issues.apache.org/jira/browse/SPARK-12286 Project: Spark Issue Type:

[jira] [Created] (SPARK-12289) Support UnsafeRow in TakeOrderedAndProject/Limit

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12289: -- Summary: Support UnsafeRow in TakeOrderedAndProject/Limit Key: SPARK-12289 URL: https://issues.apache.org/jira/browse/SPARK-12289 Project: Spark Issue Type:

[jira] [Commented] (SPARK-11885) UDAF may nondeterministically generate wrong results

2015-12-11 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053616#comment-15053616 ] Yin Huai commented on SPARK-11885: -- [~davies] btw, which exprId was generated at executor side? > UDAF

[jira] [Closed] (SPARK-12047) Unhelpful error messages generated by JavaDoc while doing sbt unidoc

2015-12-11 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neelesh Srinivas Salian closed SPARK-12047. --- Resolution: Duplicate > Unhelpful error messages generated by JavaDoc while

[jira] [Created] (SPARK-12290) Change the default value in SparkPlan

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12290: -- Summary: Change the default value in SparkPlan Key: SPARK-12290 URL: https://issues.apache.org/jira/browse/SPARK-12290 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-12272) Gradient boosted trees: too slow at the first finding best siplts

2015-12-11 Thread Wenmin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenmin Wu updated SPARK-12272: -- Attachment: screenshot-1.png > Gradient boosted trees: too slow at the first finding best siplts >

[jira] [Commented] (SPARK-6270) Standalone Master hangs when streaming job completes and event logging is enabled

2015-12-11 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053810#comment-15053810 ] Josh Rosen commented on SPARK-6270: --- While I think that we should have this discussion about UI

[jira] [Commented] (SPARK-10931) PySpark ML Models should contain Param values

2015-12-11 Thread Evan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053808#comment-15053808 ] Evan Chen commented on SPARK-10931: --- Hey Joseph, Thanks for the suggestion. I was wondering what

[jira] [Updated] (SPARK-12272) Gradient boosted trees: too slow at the first finding best siplts

2015-12-11 Thread Wenmin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenmin Wu updated SPARK-12272: -- Attachment: training-log3.png > Gradient boosted trees: too slow at the first finding best siplts >

[jira] [Created] (SPARK-12282) Document spark.jars

2015-12-11 Thread Justin Bailey (JIRA)
Justin Bailey created SPARK-12282: - Summary: Document spark.jars Key: SPARK-12282 URL: https://issues.apache.org/jira/browse/SPARK-12282 Project: Spark Issue Type: Documentation

[jira] [Updated] (SPARK-12280) "--packages" command doesn't work in "spark-submit"

2015-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12280: -- Component/s: Spark Submit > "--packages" command doesn't work in "spark-submit" >

[jira] [Updated] (SPARK-11497) PySpark RowMatrix Constructor Has Type Erasure Issue

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11497: -- Target Version/s: 1.6.1, 2.0.0 > PySpark RowMatrix Constructor Has Type Erasure Issue

[jira] [Created] (SPARK-12283) Use UnsafeRow as the buffer in SortBasedAggregation to avoid Unsafe/Safe conversion

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12283: -- Summary: Use UnsafeRow as the buffer in SortBasedAggregation to avoid Unsafe/Safe conversion Key: SPARK-12283 URL: https://issues.apache.org/jira/browse/SPARK-12283

[jira] [Assigned] (SPARK-12217) Document invalid handling for StringIndexer

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12217: Assignee: Apache Spark > Document invalid handling for StringIndexer >

[jira] [Assigned] (SPARK-12217) Document invalid handling for StringIndexer

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12217: Assignee: (was: Apache Spark) > Document invalid handling for StringIndexer >

[jira] [Assigned] (SPARK-12217) Document invalid handling for StringIndexer

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12217: Assignee: Apache Spark > Document invalid handling for StringIndexer >

[jira] [Assigned] (SPARK-12286) Support UnsafeRow in all SparkPlan (if possible)

2015-12-11 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu reassigned SPARK-12286: -- Assignee: Davies Liu > Support UnsafeRow in all SparkPlan (if possible) >

[jira] [Updated] (SPARK-12287) Support UnsafeRow in MapPartitions/MapGroups/CoGroup

2015-12-11 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-12287: --- Issue Type: Improvement (was: Epic) > Support UnsafeRow in MapPartitions/MapGroups/CoGroup >

[jira] [Updated] (SPARK-11965) Update user guide for RFormula feature interactions

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11965: -- Assignee: Yanbo Liang > Update user guide for RFormula feature interactions >

[jira] [Created] (SPARK-12288) Support UnsafeRow in Coalesce/Except/Intersect

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12288: -- Summary: Support UnsafeRow in Coalesce/Except/Intersect Key: SPARK-12288 URL: https://issues.apache.org/jira/browse/SPARK-12288 Project: Spark Issue Type:

[jira] [Commented] (SPARK-11885) UDAF may nondeterministically generate wrong results

2015-12-11 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053613#comment-15053613 ] Yin Huai commented on SPARK-11885: -- Thanks [~davies]! [~milad.bourh...@gmail.com] Can you try our latest

[jira] [Commented] (SPARK-12047) Unhelpful error messages generated by JavaDoc while doing sbt unidoc

2015-12-11 Thread Neelesh Srinivas Salian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053614#comment-15053614 ] Neelesh Srinivas Salian commented on SPARK-12047: - Closing these since they are

[jira] [Resolved] (SPARK-12273) Spark Streaming Web UI does not list Receivers in order

2015-12-11 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-12273. -- Resolution: Fixed Assignee: (was: Apache Spark) Fix Version/s: 2.0.0 >

[jira] [Commented] (SPARK-12281) Fixed potential exceptions when exiting a local cluster.

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053648#comment-15053648 ] Apache Spark commented on SPARK-12281: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Created] (SPARK-12285) MLlib user guide: umbrella for missing sections

2015-12-11 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-12285: - Summary: MLlib user guide: umbrella for missing sections Key: SPARK-12285 URL: https://issues.apache.org/jira/browse/SPARK-12285 Project: Spark

[jira] [Commented] (SPARK-11606) ML 1.6 QA: Update user guide for new APIs

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053734#comment-15053734 ] Joseph K. Bradley commented on SPARK-11606: --- I'm going to split off the remaining guide

[jira] [Resolved] (SPARK-11606) ML 1.6 QA: Update user guide for new APIs

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-11606. --- Resolution: Fixed Fix Version/s: 1.6.0 > ML 1.6 QA: Update user guide for new

[jira] [Updated] (SPARK-12272) Gradient boosted trees: too slow at the first finding best siplts

2015-12-11 Thread Wenmin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenmin Wu updated SPARK-12272: -- Attachment: (was: screenshot-1.png) > Gradient boosted trees: too slow at the first finding best

[jira] [Updated] (SPARK-12272) Gradient boosted trees: too slow at the first finding best siplts

2015-12-11 Thread Wenmin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenmin Wu updated SPARK-12272: -- Attachment: training-log2.pnd.png > Gradient boosted trees: too slow at the first finding best siplts

[jira] [Created] (SPARK-12297) Add work-around for Parquet/Hive int96 timestamp bug.

2015-12-11 Thread Ryan Blue (JIRA)
Ryan Blue created SPARK-12297: - Summary: Add work-around for Parquet/Hive int96 timestamp bug. Key: SPARK-12297 URL: https://issues.apache.org/jira/browse/SPARK-12297 Project: Spark Issue Type:

[jira] [Assigned] (SPARK-12281) Fixed potential exceptions when exiting a local cluster.

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12281: Assignee: Apache Spark > Fixed potential exceptions when exiting a local cluster. >

[jira] [Commented] (SPARK-11885) UDAF may nondeterministically generate wrong results

2015-12-11 Thread Milad Bourhani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053634#comment-15053634 ] Milad Bourhani commented on SPARK-11885: Sure, I'll give it a go next week :) I'll write the

[jira] [Updated] (SPARK-11497) PySpark RowMatrix Constructor Has Type Erasure Issue

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11497: -- Target Version/s: 1.5.3, 1.6.1, 2.0.0 (was: 1.6.1, 2.0.0) > PySpark RowMatrix

[jira] [Resolved] (SPARK-11497) PySpark RowMatrix Constructor Has Type Erasure Issue

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-11497. --- Resolution: Fixed Fix Version/s: 1.6.1 1.5.3

[jira] [Commented] (SPARK-10931) PySpark ML Models should contain Param values

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053721#comment-15053721 ] Apache Spark commented on SPARK-10931: -- User 'evanyc15' has created a pull request for this issue:

[jira] [Updated] (SPARK-6518) Add example code and user guide for bisecting k-means

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6518: - Issue Type: Sub-task (was: Documentation) Parent: SPARK-12285 > Add example code

[jira] [Updated] (SPARK-12215) User guide section for KMeans in spark.ml

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12215: -- Issue Type: Sub-task (was: Documentation) Parent: SPARK-12285 > User guide

[jira] [Created] (SPARK-12291) Support UnsafeRow in BroadcastLeftSemiJoinHash

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12291: -- Summary: Support UnsafeRow in BroadcastLeftSemiJoinHash Key: SPARK-12291 URL: https://issues.apache.org/jira/browse/SPARK-12291 Project: Spark Issue Type:

[jira] [Issue Comment Deleted] (SPARK-6725) Model export/import for Pipeline API

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-6725: - Comment: was deleted (was: User 'anabranch' has created a pull request for this issue:

[jira] [Commented] (SPARK-11193) Spark 1.5+ Kinesis Streaming - ClassCastException when starting KinesisReceiver

2015-12-11 Thread Phil Kallos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053639#comment-15053639 ] Phil Kallos commented on SPARK-11193: - yes, code looks great to me, thanks JB and Sean. any

[jira] [Updated] (SPARK-12282) Document spark.jars

2015-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12282: -- Priority: Trivial (was: Major) Component/s: Documentation I don't see evidence this is

[jira] [Commented] (SPARK-11959) Document normal equation solver for ordinary least squares in user guide

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053743#comment-15053743 ] Joseph K. Bradley commented on SPARK-11959: --- [~yanboliang] Will you have time to write this

[jira] [Updated] (SPARK-11959) Document normal equation solver for ordinary least squares in user guide

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11959: -- Issue Type: Sub-task (was: Documentation) Parent: SPARK-12285 > Document

[jira] [Updated] (SPARK-11965) Update user guide for RFormula feature interactions

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11965: -- Issue Type: Sub-task (was: Documentation) Parent: SPARK-12285 > Update user

[jira] [Updated] (SPARK-12272) Gradient boosted trees: too slow at the first finding best siplts

2015-12-11 Thread Wenmin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenmin Wu updated SPARK-12272: -- Attachment: training-log1.png > Gradient boosted trees: too slow at the first finding best siplts >

[jira] [Created] (SPARK-12281) Fixed potential exceptions when exiting a local cluster.

2015-12-11 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-12281: Summary: Fixed potential exceptions when exiting a local cluster. Key: SPARK-12281 URL: https://issues.apache.org/jira/browse/SPARK-12281 Project: Spark

[jira] [Updated] (SPARK-12247) Documentation for spark.ml's ALS and collaborative filtering in general

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12247: -- Parent Issue: SPARK-12285 (was: SPARK-8517) > Documentation for spark.ml's ALS and

[jira] [Updated] (SPARK-11529) Add section in user guide for StreamingLogisticRegressionWithSGD

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11529: -- Issue Type: Sub-task (was: Documentation) Parent: SPARK-12285 > Add section

[jira] [Created] (SPARK-12293) Support UnsafeRow in LocalTableScan

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12293: -- Summary: Support UnsafeRow in LocalTableScan Key: SPARK-12293 URL: https://issues.apache.org/jira/browse/SPARK-12293 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-12294) Support UnsafeRow in HiveTableScan

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12294: -- Summary: Support UnsafeRow in HiveTableScan Key: SPARK-12294 URL: https://issues.apache.org/jira/browse/SPARK-12294 Project: Spark Issue Type: Improvement

[jira] [Comment Edited] (SPARK-10931) PySpark ML Models should contain Param values

2015-12-11 Thread Evan Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053808#comment-15053808 ] Evan Chen edited comment on SPARK-10931 at 12/11/15 11:51 PM: -- Hey Joseph,

[jira] [Created] (SPARK-12296) Feature parity for pyspark.mllib StandardScalerModel

2015-12-11 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-12296: - Summary: Feature parity for pyspark.mllib StandardScalerModel Key: SPARK-12296 URL: https://issues.apache.org/jira/browse/SPARK-12296 Project: Spark

[jira] [Updated] (SPARK-12296) Feature parity for pyspark.mllib StandardScalerModel

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12296: -- Issue Type: Sub-task (was: New Feature) Parent: SPARK-11937 > Feature parity

[jira] [Commented] (SPARK-6523) Error when get attribute of StandardScalerModel, When use python api

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053830#comment-15053830 ] Joseph K. Bradley commented on SPARK-6523: -- You're right; sorry I did not see that PR as it was

[jira] [Assigned] (SPARK-10931) PySpark ML Models should contain Param values

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10931: Assignee: Apache Spark > PySpark ML Models should contain Param values >

[jira] [Assigned] (SPARK-10931) PySpark ML Models should contain Param values

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10931: Assignee: (was: Apache Spark) > PySpark ML Models should contain Param values >

[jira] [Created] (SPARK-12287) Support UnsafeRow in MapPartitions/MapGroups/CoGroup

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12287: -- Summary: Support UnsafeRow in MapPartitions/MapGroups/CoGroup Key: SPARK-12287 URL: https://issues.apache.org/jira/browse/SPARK-12287 Project: Spark Issue Type:

[jira] [Created] (SPARK-12295) Manage the memory used by window function

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12295: -- Summary: Manage the memory used by window function Key: SPARK-12295 URL: https://issues.apache.org/jira/browse/SPARK-12295 Project: Spark Issue Type:

[jira] [Commented] (SPARK-11606) ML 1.6 QA: Update user guide for new APIs

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053790#comment-15053790 ] Joseph K. Bradley commented on SPARK-11606: --- I'll close this now that [SPARK-12285] contains

[jira] [Updated] (SPARK-12217) Document invalid handling for StringIndexer

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-12217: -- Assignee: Benjamin Fradet (was: Apache Spark) > Document invalid handling for

[jira] [Commented] (SPARK-12183) Remove spark.mllib tree, forest implementations and use spark.ml

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053823#comment-15053823 ] Joseph K. Bradley commented on SPARK-12183: --- Lower priority than both, really. This is more of

[jira] [Comment Edited] (SPARK-12282) Document spark.jars

2015-12-11 Thread Justin Bailey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053676#comment-15053676 ] Justin Bailey edited comment on SPARK-12282 at 12/11/15 10:18 PM: -- If

[jira] [Commented] (SPARK-12282) Document spark.jars

2015-12-11 Thread Justin Bailey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053676#comment-15053676 ] Justin Bailey commented on SPARK-12282: --- If you pass `--conf spark.jars=".."`, you can set this

[jira] [Created] (SPARK-12292) Support UnsafeRow in Generate

2015-12-11 Thread Davies Liu (JIRA)
Davies Liu created SPARK-12292: -- Summary: Support UnsafeRow in Generate Key: SPARK-12292 URL: https://issues.apache.org/jira/browse/SPARK-12292 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-12217) Document invalid handling for StringIndexer

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-12217. --- Resolution: Fixed Fix Version/s: 1.6.1 2.0.0 Issue

[jira] [Issue Comment Deleted] (SPARK-11937) Python API coverage check found issues for ML during 1.6 QA

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11937: -- Comment: was deleted (was: User 'holdenk' has created a pull request for this issue:

[jira] [Commented] (SPARK-12272) Gradient boosted trees: too slow at the first finding best siplts

2015-12-11 Thread Wenmin Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053827#comment-15053827 ] Wenmin Wu commented on SPARK-12272: --- I didn't run a test, but the click-through data of my company.

[jira] [Commented] (SPARK-10285) Add @since annotation to pyspark.ml.util

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053933#comment-15053933 ] Joseph K. Bradley commented on SPARK-10285: --- I'll close the issue. Thanks! > Add @since

[jira] [Closed] (SPARK-10285) Add @since annotation to pyspark.ml.util

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-10285. - Resolution: Not A Problem Target Version/s: (was: 1.6.0) > Add @since

[jira] [Updated] (SPARK-10263) Add @Since annotation to ml.param and ml.*

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-10263: -- Target Version/s: (was: 1.6.0) > Add @Since annotation to ml.param and ml.* >

[jira] [Created] (SPARK-12301) Remove final from classes in spark.ml trees and ensembles where possible

2015-12-11 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-12301: - Summary: Remove final from classes in spark.ml trees and ensembles where possible Key: SPARK-12301 URL: https://issues.apache.org/jira/browse/SPARK-12301

[jira] [Commented] (SPARK-7131) Move tree,forest implementation from spark.mllib to spark.ml

2015-12-11 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15054000#comment-15054000 ] Joseph K. Bradley commented on SPARK-7131: -- Yes, I'm sorry about how long this has taken, but I

[jira] [Comment Edited] (SPARK-2870) Thorough schema inference directly on RDDs of Python dictionaries

2015-12-11 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053917#comment-15053917 ] holdenk edited comment on SPARK-2870 at 12/12/15 1:27 AM: -- So this seems to be

[jira] [Comment Edited] (SPARK-2870) Thorough schema inference directly on RDDs of Python dictionaries

2015-12-11 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053917#comment-15053917 ] holdenk edited comment on SPARK-2870 at 12/12/15 1:26 AM: -- So this seems to be

[jira] [Commented] (SPARK-2870) Thorough schema inference directly on RDDs of Python dictionaries

2015-12-11 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053917#comment-15053917 ] holdenk commented on SPARK-2870: So this seems to be resolved in Spark 1.6 with `createDataFrame` e.g.:

[jira] [Commented] (SPARK-12298) Infinite loop in DataFrame.sortWithinPartitions(String, String*)

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15053978#comment-15053978 ] Apache Spark commented on SPARK-12298: -- User 'ankurdave' has created a pull request for this issue:

[jira] [Assigned] (SPARK-9578) Stemmer feature transformer

2015-12-11 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9578: --- Assignee: (was: Apache Spark) > Stemmer feature transformer >

  1   2   >