[jira] [Commented] (SPARK-21972) Allow users to control input data persistence in ML Estimators via a handlePersistence ml.Param

2017-09-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178572#comment-16178572 ] zhengruifeng commented on SPARK-21972: -- [~WeichenXu123] you solution is reasonable.

[jira] [Commented] (SPARK-13030) Change OneHotEncoder to Estimator

2017-09-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178554#comment-16178554 ] zhengruifeng commented on SPARK-13030: -- Agree that we create another estimator and r

[jira] [Assigned] (SPARK-21947) monotonically_increasing_id doesn't work in Structured Streaming

2017-09-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21947: Assignee: (was: Apache Spark) > monotonically_increasing_id doesn't work in Structured

[jira] [Assigned] (SPARK-21947) monotonically_increasing_id doesn't work in Structured Streaming

2017-09-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21947: Assignee: Apache Spark > monotonically_increasing_id doesn't work in Structured Streaming

[jira] [Commented] (SPARK-21947) monotonically_increasing_id doesn't work in Structured Streaming

2017-09-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178531#comment-16178531 ] Apache Spark commented on SPARK-21947: -- User 'viirya' has created a pull request for

[jira] [Commented] (SPARK-22112) Add missing method to pyspark api: spark.read.csv(Dataset)

2017-09-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178515#comment-16178515 ] Hyukjin Kwon commented on SPARK-22112: -- BTW, I think we should pass {{RDD}} instead

[jira] [Created] (SPARK-22113) Dataset shows in Hive is inconsistent with JDBC

2017-09-24 Thread Michael Fu (JIRA)
Michael Fu created SPARK-22113: -- Summary: Dataset shows in Hive is inconsistent with JDBC Key: SPARK-22113 URL: https://issues.apache.org/jira/browse/SPARK-22113 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-19700) Design an API for pluggable scheduler implementations

2017-09-24 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178459#comment-16178459 ] Andrew Ash commented on SPARK-19700: There was a thread on the dev list recently abou

[jira] [Comment Edited] (SPARK-22112) Add missing method to pyspark api: spark.read.csv(Dataset)

2017-09-24 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178458#comment-16178458 ] Liang-Chi Hsieh edited comment on SPARK-22112 at 9/25/17 2:36 AM: -

[jira] [Commented] (SPARK-22112) Add missing method to pyspark api: spark.read.csv(Dataset)

2017-09-24 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178458#comment-16178458 ] Liang-Chi Hsieh commented on SPARK-22112: - cc [~jmchung] or [~goldmedal]] Maybe a

[jira] [Issue Comment Deleted] (SPARK-2691) Allow Spark on Mesos to be launched with Docker

2017-09-24 Thread Anthony Louis Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Louis Burns updated SPARK-2691: --- Comment: was deleted (was: would like to work on SPARK-8734, wondering where the PR fo

[jira] [Commented] (SPARK-2691) Allow Spark on Mesos to be launched with Docker

2017-09-24 Thread Anthony Louis Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178438#comment-16178438 ] Anthony Louis Burns commented on SPARK-2691: would like to work on SPARK-8734,

[jira] [Updated] (SPARK-22082) Spelling mistake: 'choosen' in API doc of R

2017-09-24 Thread zuotingbing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zuotingbing updated SPARK-22082: Summary: Spelling mistake: 'choosen' in API doc of R (was: Spelling mistake: choosen in API doc o

[jira] [Assigned] (SPARK-22107) "as" should be "alias" in python quick start documentation

2017-09-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-22107: Assignee: John O'Leary > "as" should be "alias" in python quick start documentation >

[jira] [Updated] (SPARK-22107) "as" should be "alias" in python quick start documentation

2017-09-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-22107: - Fix Version/s: 2.3.0 2.2.1 > "as" should be "alias" in python quick start docu

[jira] [Resolved] (SPARK-22107) "as" should be "alias" in python quick start documentation

2017-09-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22107. -- Resolution: Fixed Fixed in https://github.com/apache/spark/pull/19326 > "as" should be "alias"

[jira] [Commented] (SPARK-14540) Support Scala 2.12 closures and Java 8 lambdas in ClosureCleaner

2017-09-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178320#comment-16178320 ] Sean Owen commented on SPARK-14540: --- [~joshrosen] [~iakovlev] I tried this test on mast

[jira] [Resolved] (SPARK-22081) Generalized Reduced Error Logistic Regression

2017-09-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22081. --- Resolution: Won't Fix > Generalized Reduced Error Logistic Regression > -

[jira] [Commented] (SPARK-22077) RpcEndpointAddress fails to parse spark URL if it is an ipv6 address.

2017-09-24 Thread Sayat Satybaldiyev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178267#comment-16178267 ] Sayat Satybaldiyev commented on SPARK-22077: sorry, I little bit overestimate

[jira] [Commented] (SPARK-20448) Document how FileInputDStream works with object storage

2017-09-24 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178236#comment-16178236 ] Steve Loughran commented on SPARK-20448: thanks! > Document how FileInputDStream

[jira] [Created] (SPARK-22112) Add missing method to pyspark api: spark.read.csv(Dataset)

2017-09-24 Thread Andrew Ash (JIRA)
Andrew Ash created SPARK-22112: -- Summary: Add missing method to pyspark api: spark.read.csv(Dataset) Key: SPARK-22112 URL: https://issues.apache.org/jira/browse/SPARK-22112 Project: Spark Issue

[jira] [Updated] (SPARK-14650) Compile Spark REPL for Scala 2.12

2017-09-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-14650: -- Priority: Minor (was: Major) > Compile Spark REPL for Scala 2.12 > - >

[jira] [Resolved] (SPARK-14650) Compile Spark REPL for Scala 2.12

2017-09-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14650. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19307 [https://github.co

[jira] [Resolved] (SPARK-22087) Clear remaining compile errors for 2.12; resolve most warnings

2017-09-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22087. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19307 [https://github.co

[jira] [Assigned] (SPARK-14650) Compile Spark REPL for Scala 2.12

2017-09-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-14650: - Assignee: Sean Owen > Compile Spark REPL for Scala 2.12 > - > >

[jira] [Assigned] (SPARK-22058) the BufferedInputStream will not be closed if an exception occurs

2017-09-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-22058: - Assignee: zuotingbing > the BufferedInputStream will not be closed if an exception occurs >

[jira] [Resolved] (SPARK-22058) the BufferedInputStream will not be closed if an exception occurs

2017-09-24 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22058. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19277 [https://github.co

[jira] [Updated] (SPARK-22093) UtilsSuite "resolveURIs with multiple paths" test always cancelled

2017-09-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-22093: - Fix Version/s: 2.3.0 > UtilsSuite "resolveURIs with multiple paths" test always cancelled > -

[jira] [Resolved] (SPARK-22093) UtilsSuite "resolveURIs with multiple paths" test always cancelled

2017-09-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22093. -- Resolution: Fixed Fixed in https://github.com/apache/spark/pull/19332 > UtilsSuite "resolveURI

[jira] [Assigned] (SPARK-22093) UtilsSuite "resolveURIs with multiple paths" test always cancelled

2017-09-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-22093: Assignee: Hyukjin Kwon > UtilsSuite "resolveURIs with multiple paths" test always cancelle

[jira] [Commented] (SPARK-22104) Add new option to dataframe -> parquet ==> custom extension to file name

2017-09-24 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178116#comment-16178116 ] Hyukjin Kwon commented on SPARK-22104: -- Do you mean part files inside the destinatio

[jira] [Commented] (SPARK-22046) Streaming State cannot be scalable

2017-09-24 Thread danny mor (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178106#comment-16178106 ] danny mor commented on SPARK-22046: --- the problem is not with scheduling the tasks throu