[jira] [Commented] (SPARK-21892) status code is 200 OK when kill application fail via spark master rest api

2017-09-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150171#comment-16150171 ] Sean Owen commented on SPARK-21892: --- The request succeeded at the HTTP level. The appli

[jira] [Created] (SPARK-21893) Put Kafka 0.8 behind a profile

2017-09-01 Thread Sean Owen (JIRA)
Sean Owen created SPARK-21893: - Summary: Put Kafka 0.8 behind a profile Key: SPARK-21893 URL: https://issues.apache.org/jira/browse/SPARK-21893 Project: Spark Issue Type: Sub-task Compo

[jira] [Commented] (SPARK-21882) OutputMetrics doesn't count written bytes correctly in the saveAsHadoopDataset function

2017-09-01 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150260#comment-16150260 ] Saisai Shao commented on SPARK-21882: - Please submit the patch to Github Apache Spark

[jira] [Commented] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-09-01 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150258#comment-16150258 ] Saisai Shao commented on SPARK-21888: - Jars added by "--jars" will be added to client

[jira] [Updated] (SPARK-19976) DirectStream API throws OffsetOutOfRange Exception

2017-09-01 Thread Taukir (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Taukir updated SPARK-19976: --- Description: I am using following code. While data on kafka topic get deleted/retention period is over, it t

[jira] [Commented] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150389#comment-16150389 ] Nikhil Bhide commented on SPARK-21861: -- Hi Sean, Please find additional contents as

[jira] [Comment Edited] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150389#comment-16150389 ] Nikhil Bhide edited comment on SPARK-21861 at 9/1/17 11:43 AM:

[jira] [Comment Edited] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150389#comment-16150389 ] Nikhil Bhide edited comment on SPARK-21861 at 9/1/17 11:43 AM:

[jira] [Comment Edited] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150389#comment-16150389 ] Nikhil Bhide edited comment on SPARK-21861 at 9/1/17 11:44 AM:

[jira] [Comment Edited] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150389#comment-16150389 ] Nikhil Bhide edited comment on SPARK-21861 at 9/1/17 11:45 AM:

[jira] [Comment Edited] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150389#comment-16150389 ] Nikhil Bhide edited comment on SPARK-21861 at 9/1/17 11:42 AM:

[jira] [Comment Edited] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150389#comment-16150389 ] Nikhil Bhide edited comment on SPARK-21861 at 9/1/17 11:48 AM:

[jira] [Comment Edited] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150389#comment-16150389 ] Nikhil Bhide edited comment on SPARK-21861 at 9/1/17 11:47 AM:

[jira] [Updated] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nikhil Bhide updated SPARK-21861: - Attachment: (was: PageRankExample.scala) > Add more details to PageRank illustration > --

[jira] [Updated] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nikhil Bhide updated SPARK-21861: - Attachment: PageRankExample.scala > Add more details to PageRank illustration > -

[jira] [Issue Comment Deleted] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nikhil Bhide updated SPARK-21861: - Comment: was deleted (was: PageRankExample.scala ) > Add more details to PageRank illustration >

[jira] [Comment Edited] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150389#comment-16150389 ] Nikhil Bhide edited comment on SPARK-21861 at 9/1/17 11:53 AM:

[jira] [Updated] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nikhil Bhide updated SPARK-21861: - Attachment: PageRankExample.scala Modified PageRankExample > Add more details to PageRank illust

[jira] [Comment Edited] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150389#comment-16150389 ] Nikhil Bhide edited comment on SPARK-21861 at 9/1/17 11:54 AM:

[jira] [Comment Edited] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150405#comment-16150405 ] Nikhil Bhide edited comment on SPARK-21861 at 9/1/17 11:54 AM:

[jira] [Comment Edited] (SPARK-21861) Add more details to PageRank illustration

2017-09-01 Thread Nikhil Bhide (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150389#comment-16150389 ] Nikhil Bhide edited comment on SPARK-21861 at 9/1/17 11:56 AM:

[jira] [Comment Edited] (SPARK-21885) HiveMetastoreCatalog.InferIfNeeded too slow when caseSensitiveInference enabled

2017-09-01 Thread liupengcheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16149997#comment-16149997 ] liupengcheng edited comment on SPARK-21885 at 9/1/17 1:25 PM: -

[jira] [Updated] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-09-01 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-21888: -- Issue Type: Improvement (was: Bug) > Cannot add stuff to Client Classpath for Yarn Cluster Mod

[jira] [Commented] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-09-01 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150530#comment-16150530 ] Thomas Graves commented on SPARK-21888: --- Putting things into SPARK_CONF_DIR will wo

[jira] [Comment Edited] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-09-01 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150530#comment-16150530 ] Thomas Graves edited comment on SPARK-21888 at 9/1/17 1:37 PM:

[jira] [Assigned] (SPARK-21890) ObtainCredentials does not pass creds to addDelegationTokens

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21890: Assignee: (was: Apache Spark) > ObtainCredentials does not pass creds to addDelegation

[jira] [Assigned] (SPARK-21890) ObtainCredentials does not pass creds to addDelegationTokens

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21890: Assignee: Apache Spark > ObtainCredentials does not pass creds to addDelegationTokens > --

[jira] [Commented] (SPARK-21890) ObtainCredentials does not pass creds to addDelegationTokens

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150589#comment-16150589 ] Apache Spark commented on SPARK-21890: -- User 'redsanket' has created a pull request

[jira] [Commented] (SPARK-21727) Operating on an ArrayType in a SparkR DataFrame throws error

2017-09-01 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150657#comment-16150657 ] Yanbo Liang commented on SPARK-21727: - I can run successfully with minor change: {cod

[jira] [Updated] (SPARK-21890) ObtainCredentials does not pass creds to addDelegationTokens

2017-09-01 Thread Sanket Reddy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sanket Reddy updated SPARK-21890: - Description: I observed this while running a oozie job trying to connect to hbase via spark. It l

[jira] [Updated] (SPARK-21890) ObtainCredentials does not pass creds to addDelegationTokens

2017-09-01 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-21890: -- Description: I observed this while running a oozie job trying to connect to hbase via spark. It

[jira] [Commented] (SPARK-21890) ObtainCredentials does not pass creds to addDelegationTokens

2017-09-01 Thread Sanket Reddy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150762#comment-16150762 ] Sanket Reddy commented on SPARK-21890: -- Will put up a PR for master too thanks > Ob

[jira] [Created] (SPARK-21894) Some Netty errors do not propagate to the top level driver

2017-09-01 Thread Charles Allen (JIRA)
Charles Allen created SPARK-21894: - Summary: Some Netty errors do not propagate to the top level driver Key: SPARK-21894 URL: https://issues.apache.org/jira/browse/SPARK-21894 Project: Spark

[jira] [Commented] (SPARK-21770) ProbabilisticClassificationModel: Improve normalization of all-zero raw predictions

2017-09-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150846#comment-16150846 ] Joseph K. Bradley commented on SPARK-21770: --- Linear models are the most likely

[jira] [Resolved] (SPARK-21728) Allow SparkSubmit to use logging

2017-09-01 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-21728. Resolution: Fixed > Allow SparkSubmit to use logging > > >

[jira] [Resolved] (SPARK-21880) [spark UI]In the SQL table page, modify jobs trace information

2017-09-01 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-21880. -- Resolution: Fixed Assignee: he.qiao Fix Version/s: 2.3.0 > [spark UI]In the SQL

[jira] [Commented] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-09-01 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150924#comment-16150924 ] Bryan Cutler commented on SPARK-21190: -- I'm good with the API summary proposed by [~

[jira] [Comment Edited] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-09-01 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150924#comment-16150924 ] Bryan Cutler edited comment on SPARK-21190 at 9/1/17 5:56 PM: -

[jira] [Commented] (SPARK-21617) ALTER TABLE...ADD COLUMNS broken in Hive 2.1 for DS tables

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150940#comment-16150940 ] Apache Spark commented on SPARK-21617: -- User 'vanzin' has created a pull request for

[jira] [Commented] (SPARK-17742) Spark Launcher does not get failed state in Listener

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17742?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150941#comment-16150941 ] Apache Spark commented on SPARK-17742: -- User 'vanzin' has created a pull request for

[jira] [Commented] (SPARK-21858) Make Spark grouping_id() compatible with Hive grouping__id

2017-09-01 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150955#comment-16150955 ] Dongjoon Hyun commented on SPARK-21858: --- Hi, [~_Yann_]. Thank you for investigating

[jira] [Commented] (SPARK-21858) Make Spark grouping_id() compatible with Hive grouping__id

2017-09-01 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150958#comment-16150958 ] Dongjoon Hyun commented on SPARK-21858: --- I'm adding SPARK-21055, too. IIUC, SPARK-2

[jira] [Resolved] (SPARK-14280) Update change-version.sh and pom.xml to add Scala 2.12 profiles

2017-09-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-14280. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18645 [https://github.co

[jira] [Comment Edited] (SPARK-18085) SPIP: Better History Server scalability for many / large applications

2017-09-01 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16148093#comment-16148093 ] Marcelo Vanzin edited comment on SPARK-18085 at 9/1/17 6:37 PM: ---

[jira] [Created] (SPARK-21895) Support changing database in HiveClient

2017-09-01 Thread Xiao Li (JIRA)
Xiao Li created SPARK-21895: --- Summary: Support changing database in HiveClient Key: SPARK-21895 URL: https://issues.apache.org/jira/browse/SPARK-21895 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-21895) Support changing database in HiveClient

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151006#comment-16151006 ] Apache Spark commented on SPARK-21895: -- User 'gatorsmile' has created a pull request

[jira] [Assigned] (SPARK-21895) Support changing database in HiveClient

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21895: Assignee: Apache Spark (was: Xiao Li) > Support changing database in HiveClient > ---

[jira] [Assigned] (SPARK-21895) Support changing database in HiveClient

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21895: Assignee: Xiao Li (was: Apache Spark) > Support changing database in HiveClient > ---

[jira] [Resolved] (SPARK-21864) Spark 2.0.1 - SaveMode.Overwrite does not work while saving data to memsql

2017-09-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21864. --- Resolution: Not A Problem > Spark 2.0.1 - SaveMode.Overwrite does not work while saving data to memsq

[jira] [Resolved] (SPARK-21895) Support changing database in HiveClient

2017-09-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21895. - Resolution: Fixed Assignee: Xiao Li (was: Apache Spark) Fix Version/s: 2.3.0 > Support c

[jira] [Assigned] (SPARK-21895) Support changing database in HiveClient

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21895: Assignee: Apache Spark (was: Xiao Li) > Support changing database in HiveClient > ---

[jira] [Commented] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-09-01 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151128#comment-16151128 ] Marco Gaido commented on SPARK-21888: - It is enough to add {{hbase-site.xml}} using {

[jira] [Updated] (SPARK-21477) Mark LocalTableScanExec's input data transient

2017-09-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-21477: Fix Version/s: 2.2.1 > Mark LocalTableScanExec's input data transient > ---

[jira] [Commented] (SPARK-20761) Union uses column order rather than schema

2017-09-01 Thread Munesh Bandaru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151181#comment-16151181 ] Munesh Bandaru commented on SPARK-20761: As the ticket was closed as 'Not a Probl

[jira] [Comment Edited] (SPARK-20761) Union uses column order rather than schema

2017-09-01 Thread Munesh Bandaru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151181#comment-16151181 ] Munesh Bandaru edited comment on SPARK-20761 at 9/1/17 9:31 PM: ---

[jira] [Commented] (SPARK-15918) unionAll returns wrong result when two dataframes has schema in different order

2017-09-01 Thread Munesh Bandaru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151185#comment-16151185 ] Munesh Bandaru commented on SPARK-15918: As the ticket was closed as 'Not a Probl

[jira] [Created] (SPARK-21896) Stack Overflow when window function nested inside aggregate function

2017-09-01 Thread Luyao Yang (JIRA)
Luyao Yang created SPARK-21896: -- Summary: Stack Overflow when window function nested inside aggregate function Key: SPARK-21896 URL: https://issues.apache.org/jira/browse/SPARK-21896 Project: Spark

[jira] [Commented] (SPARK-14864) [MLLIB] Implement Doc2Vec

2017-09-01 Thread Li Ping Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151229#comment-16151229 ] Li Ping Zhang commented on SPARK-14864: --- Agree. I think it would extend spark user

[jira] [Issue Comment Deleted] (SPARK-21729) Generic test for ProbabilisticClassifier to ensure consistent output columns

2017-09-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-21729: -- Comment: was deleted (was: User 'WeichenXu123' has created a pull request for this issu

[jira] [Assigned] (SPARK-21729) Generic test for ProbabilisticClassifier to ensure consistent output columns

2017-09-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-21729: - Assignee: Weichen Xu > Generic test for ProbabilisticClassifier to ensure consis

[jira] [Commented] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-09-01 Thread Leif Walsh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151261#comment-16151261 ] Leif Walsh commented on SPARK-21190: I'm not 100% sure this is legal pandas but I thi

[jira] [Comment Edited] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-09-01 Thread Leif Walsh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151261#comment-16151261 ] Leif Walsh edited comment on SPARK-21190 at 9/1/17 10:55 PM: -

[jira] [Commented] (SPARK-15918) unionAll returns wrong result when two dataframes has schema in different order

2017-09-01 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151279#comment-16151279 ] Hyukjin Kwon commented on SPARK-15918: -- [~Munesh], No need to leave a duplicated com

[jira] [Commented] (SPARK-15918) unionAll returns wrong result when two dataframes has schema in different order

2017-09-01 Thread Munesh Bandaru (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151294#comment-16151294 ] Munesh Bandaru commented on SPARK-15918: [~hyukjin.kwon] Thank you Hyukjin for pr

[jira] [Commented] (SPARK-12157) Support numpy types as return values of Python UDFs

2017-09-01 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151336#comment-16151336 ] Felix Cheung commented on SPARK-12157: -- any more thought on this? I think we should

[jira] [Resolved] (SPARK-21729) Generic test for ProbabilisticClassifier to ensure consistent output columns

2017-09-01 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21729. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19065 [h

[jira] [Created] (SPARK-21897) Add unionByName API to DataFrame in Python and R

2017-09-01 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-21897: Summary: Add unionByName API to DataFrame in Python and R Key: SPARK-21897 URL: https://issues.apache.org/jira/browse/SPARK-21897 Project: Spark Issue Type:

[jira] [Commented] (SPARK-21897) Add unionByName API to DataFrame in Python and R

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21897?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151368#comment-16151368 ] Apache Spark commented on SPARK-21897: -- User 'HyukjinKwon' has created a pull reques

[jira] [Assigned] (SPARK-21897) Add unionByName API to DataFrame in Python and R

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21897: Assignee: Apache Spark > Add unionByName API to DataFrame in Python and R > --

[jira] [Assigned] (SPARK-21897) Add unionByName API to DataFrame in Python and R

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21897: Assignee: (was: Apache Spark) > Add unionByName API to DataFrame in Python and R > ---

[jira] [Commented] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-09-01 Thread Leif Walsh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151376#comment-16151376 ] Leif Walsh commented on SPARK-21190: Yep, that's totally a thing: {noformat}In [1]:

[jira] [Commented] (SPARK-21190) SPIP: Vectorized UDFs in Python

2017-09-01 Thread Leif Walsh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151377#comment-16151377 ] Leif Walsh commented on SPARK-21190: You can also make a Series with no content and a

[jira] [Assigned] (SPARK-21770) ProbabilisticClassificationModel: Improve normalization of all-zero raw predictions

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21770: Assignee: Apache Spark > ProbabilisticClassificationModel: Improve normalization of all-ze

[jira] [Assigned] (SPARK-21770) ProbabilisticClassificationModel: Improve normalization of all-zero raw predictions

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21770: Assignee: (was: Apache Spark) > ProbabilisticClassificationModel: Improve normalizatio

[jira] [Commented] (SPARK-21770) ProbabilisticClassificationModel: Improve normalization of all-zero raw predictions

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151404#comment-16151404 ] Apache Spark commented on SPARK-21770: -- User 'WeichenXu123' has created a pull reque

[jira] [Assigned] (SPARK-21799) KMeans performance regression (5-6x slowdown) in Spark 2.2

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21799: Assignee: (was: Apache Spark) > KMeans performance regression (5-6x slowdown) in Spark

[jira] [Commented] (SPARK-21799) KMeans performance regression (5-6x slowdown) in Spark 2.2

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151407#comment-16151407 ] Apache Spark commented on SPARK-21799: -- User 'WeichenXu123' has created a pull reque

[jira] [Assigned] (SPARK-21799) KMeans performance regression (5-6x slowdown) in Spark 2.2

2017-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21799: Assignee: Apache Spark > KMeans performance regression (5-6x slowdown) in Spark 2.2 >