[jira] [Updated] (SPARK-16090) Improve method grouping in SparkR generated docs

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-16090: -- Description: This JIRA follows the discussion on https://github.com/apache/spark/pull/13109 to

[jira] [Commented] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341256#comment-15341256 ] Sean Owen commented on SPARK-15917: --- I think we're probably missing something. Does {{s

[jira] [Commented] (SPARK-16090) Improve method grouping in SparkR generated docs

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341258#comment-15341258 ] Xiangrui Meng commented on SPARK-16090: --- For ML methods, I'd like to propose the fo

[jira] [Commented] (SPARK-16088) Deprecate setJobGroup, clearJobGroup, cancelJobGroup from SparkR API

2016-06-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341263#comment-15341263 ] Felix Cheung commented on SPARK-16088: -- It's possible - both setLogLevel and spark.l

[jira] [Comment Edited] (SPARK-16088) Deprecate setJobGroup, clearJobGroup, cancelJobGroup from SparkR API

2016-06-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341263#comment-15341263 ] Felix Cheung edited comment on SPARK-16088 at 6/21/16 7:10 AM:

[jira] [Commented] (SPARK-16090) Improve method grouping in SparkR generated docs

2016-06-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341267#comment-15341267 ] Felix Cheung commented on SPARK-16090: -- +1 on that! > Improve method grouping in Sp

[jira] [Created] (SPARK-16091) Dataset.partitionBy.csv raise a java.io.FileNotFoundException when launched on an hadoop cluster

2016-06-21 Thread Romain Giot (JIRA)
Romain Giot created SPARK-16091: --- Summary: Dataset.partitionBy.csv raise a java.io.FileNotFoundException when launched on an hadoop cluster Key: SPARK-16091 URL: https://issues.apache.org/jira/browse/SPARK-16091

[jira] [Commented] (SPARK-15987) PostgreSQL CITEXT type JDBC support

2016-06-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341277#comment-15341277 ] Takeshi Yamamuro commented on SPARK-15987: -- How about casting `citext` types int

[jira] [Comment Edited] (SPARK-15987) PostgreSQL CITEXT type JDBC support

2016-06-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341277#comment-15341277 ] Takeshi Yamamuro edited comment on SPARK-15987 at 6/21/16 7:19 AM:

[jira] [Resolved] (SPARK-15319) Fix SparkR doc layout for corr and other DataFrame stats functions

2016-06-21 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-15319. --- Resolution: Fixed Assignee: Felix Cheung Fix Version/s: 2.0.0

[jira] [Commented] (SPARK-16069) rdd.map(identity).cache very slow

2016-06-21 Thread Julien Diener (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341284#comment-15341284 ] Julien Diener commented on SPARK-16069: --- Why would data be send to executors? I und

[jira] [Commented] (SPARK-16069) rdd.map(identity).cache very slow

2016-06-21 Thread Julien Diener (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341283#comment-15341283 ] Julien Diener commented on SPARK-16069: --- Why would data be send to executors? I und

[jira] [Issue Comment Deleted] (SPARK-16069) rdd.map(identity).cache very slow

2016-06-21 Thread Julien Diener (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Diener updated SPARK-16069: -- Comment: was deleted (was: Why would data be send to executors? I understood that cache means t

[jira] [Comment Edited] (SPARK-16069) rdd.map(identity).cache very slow

2016-06-21 Thread Julien Diener (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341284#comment-15341284 ] Julien Diener edited comment on SPARK-16069 at 6/21/16 7:22 AM: ---

[jira] [Updated] (SPARK-16091) Dataset.partitionBy.csv raise a java.io.FileNotFoundException when launched on an hadoop cluster

2016-06-21 Thread Romain Giot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Romain Giot updated SPARK-16091: Description: When writing a Dataset in a CSV file, the following exception java.io.FileNotFoundEx

[jira] [Comment Edited] (SPARK-16069) rdd.map(identity).cache very slow

2016-06-21 Thread Julien Diener (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341284#comment-15341284 ] Julien Diener edited comment on SPARK-16069 at 6/21/16 7:22 AM: ---

[jira] [Updated] (SPARK-16082) Refactor dapply's/dapplyCollect's documentation - remove duplicated comments

2016-06-21 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-16082: -- Assignee: Narine Kokhlikyan > Refactor dapply's/dapplyCollect's documentation -

[jira] [Resolved] (SPARK-16082) Refactor dapply's/dapplyCollect's documentation - remove duplicated comments

2016-06-21 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-16082. --- Resolution: Fixed Fix Version/s: 2.0.0 Resolved by https://github.com/

[jira] [Updated] (SPARK-16091) Dataset.partitionBy.csv raise a java.io.FileNotFoundException when launched on an hadoop cluster

2016-06-21 Thread Romain Giot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Romain Giot updated SPARK-16091: Description: When writing a Dataset in a CSV file, the following exception java.io.FileNotFoundEx

[jira] [Commented] (SPARK-15987) PostgreSQL CITEXT type JDBC support

2016-06-21 Thread Sergey Bahchissaraitsev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341294#comment-15341294 ] Sergey Bahchissaraitsev commented on SPARK-15987: - Casting could be a wor

[jira] [Updated] (SPARK-16086) Python UDF failed when there is no arguments

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-16086: -- Fix Version/s: (was: 2.0.0) > Python UDF failed when there is no arguments > --

[jira] [Reopened] (SPARK-16086) Python UDF failed when there is no arguments

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reopened SPARK-16086: --- > Python UDF failed when there is no arguments > > >

[jira] [Comment Edited] (SPARK-15987) PostgreSQL CITEXT type JDBC support

2016-06-21 Thread Sergey Bahchissaraitsev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341294#comment-15341294 ] Sergey Bahchissaraitsev edited comment on SPARK-15987 at 6/21/16 7:35 AM: -

[jira] [Assigned] (SPARK-16086) Python UDF failed when there is no arguments

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16086: Assignee: Davies Liu (was: Apache Spark) > Python UDF failed when there is no arguments >

[jira] [Commented] (SPARK-16086) Python UDF failed when there is no arguments

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341297#comment-15341297 ] Xiangrui Meng commented on SPARK-16086: --- Reverted the changes in master and branch-

[jira] [Assigned] (SPARK-16086) Python UDF failed when there is no arguments

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16086: Assignee: Apache Spark (was: Davies Liu) > Python UDF failed when there is no arguments >

[jira] [Resolved] (SPARK-10258) Add @Since annotation to ml.feature

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-10258. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13641 [https://g

[jira] [Updated] (SPARK-10258) Add @Since annotation to ml.feature

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10258: -- Shepherd: Nick Pentreath > Add @Since annotation to ml.feature > --

[jira] [Commented] (SPARK-16069) rdd.map(identity).cache very slow

2016-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341301#comment-15341301 ] Sean Owen commented on SPARK-16069: --- It's sent from the driver in this case, at least.

[jira] [Resolved] (SPARK-7751) Add @Since annotation to stable and experimental methods in MLlib

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7751. -- Resolution: Fixed Fix Version/s: 2.0.0 Mark this umbrella as resolved since all sub-tasks

[jira] [Updated] (SPARK-16080) Config archive not properly added to YARN classpath

2016-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16080: -- Target Version/s: (was: 2.0.0) Priority: Major (was: Blocker) Fix Version/s: (

[jira] [Updated] (SPARK-16091) Dataset.partitionBy.csv raise a java.io.FileNotFoundException when launched on an hadoop cluster

2016-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16091: -- Priority: Minor (was: Blocker) [~rgiot] have a look at https://cwiki.apache.org/confluence/display/SP

[jira] [Updated] (SPARK-12144) Support more external data source API in SparkR

2016-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-12144: -- Assignee: Yanbo Liang > Support more external data source API in SparkR >

[jira] [Updated] (SPARK-16045) Spark 2.0 ML.feature: doc update for stopwords and binarizer

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-16045: -- Affects Version/s: 2.0.0 Target Version/s: 2.0.0 > Spark 2.0 ML.feature: doc update for st

[jira] [Updated] (SPARK-16045) Spark 2.0 ML.feature: doc update for stopwords and binarizer

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-16045: -- Assignee: yuhao yang > Spark 2.0 ML.feature: doc update for stopwords and binarizer > -

[jira] [Resolved] (SPARK-16045) Spark 2.0 ML.feature: doc update for stopwords and binarizer

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-16045. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13375 [https://g

[jira] [Commented] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-06-21 Thread Jonathan Taws (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341312#comment-15341312 ] Jonathan Taws commented on SPARK-15917: --- No it doesn't for me, I can see it properl

[jira] [Resolved] (SPARK-16084) Minor javadoc issue with "Describe" table in the parser

2016-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-16084. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13791 [https://github.co

[jira] [Updated] (SPARK-16084) Minor javadoc issue with "Describe" table in the parser

2016-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-16084: -- Assignee: Bo Meng > Minor javadoc issue with "Describe" table in the parser > -

[jira] [Commented] (SPARK-15987) PostgreSQL CITEXT type JDBC support

2016-06-21 Thread Nipun Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341316#comment-15341316 ] Nipun Agarwal commented on SPARK-15987: --- Can you provide me a sample python example

[jira] [Commented] (SPARK-16071) Not sufficient array size checks to avoid integer overflows in Tungsten

2016-06-21 Thread ding (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341318#comment-15341318 ] ding commented on SPARK-16071: -- The exception raised in different location as one happened i

[jira] [Commented] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341323#comment-15341323 ] Sean Owen commented on SPARK-15917: --- Hm, it's a bit out of my area, since I don't use s

[jira] [Commented] (SPARK-15987) PostgreSQL CITEXT type JDBC support

2016-06-21 Thread Nipun Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341325#comment-15341325 ] Nipun Agarwal commented on SPARK-15987: --- Can you provide me some ref on how to do c

[jira] [Created] (SPARK-16092) Spark2.0 take no effect after set hive.exec.dynamic.partition.mode=nonstrict as a global variable in Spark2.0 configuration file while Spark1.6 does

2016-06-21 Thread marymwu (JIRA)
marymwu created SPARK-16092: --- Summary: Spark2.0 take no effect after set hive.exec.dynamic.partition.mode=nonstrict as a global variable in Spark2.0 configuration file while Spark1.6 does Key: SPARK-16092 URL: https://

[jira] [Assigned] (SPARK-10258) Add @Since annotation to ml.feature

2016-06-21 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath reassigned SPARK-10258: -- Assignee: Nick Pentreath (was: Martin Brown) > Add @Since annotation to ml.feature >

[jira] [Commented] (SPARK-16071) Not sufficient array size checks to avoid integer overflows in Tungsten

2016-06-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341330#comment-15341330 ] Xiangrui Meng commented on SPARK-16071: --- [~ding] This JIRA is not to solve this par

[jira] [Created] (SPARK-16093) Spark2.0 take no effect after set spark.sql.autoBroadcastJoinThreshold = 1

2016-06-21 Thread marymwu (JIRA)
marymwu created SPARK-16093: --- Summary: Spark2.0 take no effect after set spark.sql.autoBroadcastJoinThreshold = 1 Key: SPARK-16093 URL: https://issues.apache.org/jira/browse/SPARK-16093 Project: Spark

[jira] [Commented] (SPARK-15987) PostgreSQL CITEXT type JDBC support

2016-06-21 Thread Sergey Bahchissaraitsev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341334#comment-15341334 ] Sergey Bahchissaraitsev commented on SPARK-15987: - If you mean my suggest

[jira] [Created] (SPARK-16094) Support HashAggregateExec for non-partial aggregates

2016-06-21 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-16094: Summary: Support HashAggregateExec for non-partial aggregates Key: SPARK-16094 URL: https://issues.apache.org/jira/browse/SPARK-16094 Project: Spark

[jira] [Updated] (SPARK-16093) Spark2.0 take no effect after set spark.sql.autoBroadcastJoinThreshold = 1

2016-06-21 Thread marymwu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] marymwu updated SPARK-16093: Attachment: Errorlog.txt > Spark2.0 take no effect after set spark.sql.autoBroadcastJoinThreshold = 1 > ---

[jira] [Commented] (SPARK-15987) PostgreSQL CITEXT type JDBC support

2016-06-21 Thread Nipun Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341347#comment-15341347 ] Nipun Agarwal commented on SPARK-15987: --- Got it, I created a view and then worked o

[jira] [Commented] (SPARK-16091) Dataset.partitionBy.csv raise a java.io.FileNotFoundException when launched on an hadoop cluster

2016-06-21 Thread Romain Giot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341351#comment-15341351 ] Romain Giot commented on SPARK-16091: - Ok, sorry for the too high importance of the b

[jira] [Commented] (SPARK-16071) Not sufficient array size checks to avoid integer overflows in Tungsten

2016-06-21 Thread ding (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341354#comment-15341354 ] ding commented on SPARK-16071: -- OK, I see. Thank you for your clarification. > Not sufficie

[jira] [Assigned] (SPARK-16094) Support HashAggregateExec for non-partial aggregates

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16094: Assignee: (was: Apache Spark) > Support HashAggregateExec for non-partial aggregates >

[jira] [Commented] (SPARK-16094) Support HashAggregateExec for non-partial aggregates

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341355#comment-15341355 ] Apache Spark commented on SPARK-16094: -- User 'maropu' has created a pull request for

[jira] [Assigned] (SPARK-16094) Support HashAggregateExec for non-partial aggregates

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16094: Assignee: Apache Spark > Support HashAggregateExec for non-partial aggregates > --

[jira] [Commented] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-06-21 Thread Jonathan Taws (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341360#comment-15341360 ] Jonathan Taws commented on SPARK-15917: --- If I run the following command : {{spark-s

[jira] [Comment Edited] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-06-21 Thread Jonathan Taws (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341360#comment-15341360 ] Jonathan Taws edited comment on SPARK-15917 at 6/21/16 8:24 AM: ---

[jira] [Comment Edited] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-06-21 Thread Jonathan Taws (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341360#comment-15341360 ] Jonathan Taws edited comment on SPARK-15917 at 6/21/16 8:24 AM: ---

[jira] [Comment Edited] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-06-21 Thread Jonathan Taws (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341360#comment-15341360 ] Jonathan Taws edited comment on SPARK-15917 at 6/21/16 8:25 AM: ---

[jira] [Commented] (SPARK-16090) Improve method grouping in SparkR generated docs

2016-06-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341367#comment-15341367 ] Felix Cheung commented on SPARK-16090: -- Ok, I reviewed all 200+ html pages generated

[jira] [Comment Edited] (SPARK-16090) Improve method grouping in SparkR generated docs

2016-06-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341367#comment-15341367 ] Felix Cheung edited comment on SPARK-16090 at 6/21/16 8:34 AM:

[jira] [Updated] (SPARK-10258) Add @Since annotation to ml.feature

2016-06-21 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-10258: --- Assignee: Martin Brown (was: Nick Pentreath) > Add @Since annotation to ml.feature > ---

[jira] [Commented] (SPARK-15467) Getting stack overflow when attempting to query a wide Dataset (>200 fields)

2016-06-21 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341376#comment-15341376 ] Kazuaki Ishizaki commented on SPARK-15467: -- Thank you for letting me know it. No

[jira] [Commented] (SPARK-12177) Update KafkaDStreams to new Kafka 0.10 Consumer API

2016-06-21 Thread Lars Francke (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341381#comment-15341381 ] Lars Francke commented on SPARK-12177: -- I have looked at the code and have only mino

[jira] [Created] (SPARK-16095) Yarn cluster mode should return consistent result for command line and SparkLauncher

2016-06-21 Thread Peng Zhang (JIRA)
Peng Zhang created SPARK-16095: -- Summary: Yarn cluster mode should return consistent result for command line and SparkLauncher Key: SPARK-16095 URL: https://issues.apache.org/jira/browse/SPARK-16095 Proj

[jira] [Commented] (SPARK-15941) Netty RPC implementation ignores the executor bind address

2016-06-21 Thread Marco Capuccini (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341396#comment-15341396 ] Marco Capuccini commented on SPARK-15941: - [~tgraves] I ran Spark in standalone m

[jira] [Commented] (SPARK-16091) Dataset.partitionBy.csv raise a java.io.FileNotFoundException when launched on an hadoop cluster

2016-06-21 Thread Romain Giot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341406#comment-15341406 ] Romain Giot commented on SPARK-16091: - There is a high probability that this issue is

[jira] [Commented] (SPARK-16070) DataFrame/Parquet issues with primitive arrays

2016-06-21 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341408#comment-15341408 ] Kazuaki Ishizaki commented on SPARK-16070: -- [~mengxr], thank you for creating an

[jira] [Updated] (SPARK-16063) Add storageLevel to Dataset

2016-06-21 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-16063: --- Description: SPARK-11905 added {{cache}}/{{persist}} to {{Dataset}}. We should add {{Dataset.

[jira] [Updated] (SPARK-16063) Add storageLevel to Dataset

2016-06-21 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-16063: --- Summary: Add storageLevel to Dataset (was: Add getStorageLevel to Dataset) > Add storageLeve

[jira] [Commented] (SPARK-16070) DataFrame/Parquet issues with primitive arrays

2016-06-21 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341427#comment-15341427 ] Kazuaki Ishizaki commented on SPARK-16070: -- Other JIRAs for DataFrame issues wit

[jira] [Comment Edited] (SPARK-16070) DataFrame/Parquet issues with primitive arrays

2016-06-21 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341408#comment-15341408 ] Kazuaki Ishizaki edited comment on SPARK-16070 at 6/21/16 9:22 AM:

[jira] [Comment Edited] (SPARK-16070) DataFrame/Parquet issues with primitive arrays

2016-06-21 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341427#comment-15341427 ] Kazuaki Ishizaki edited comment on SPARK-16070 at 6/21/16 9:22 AM:

[jira] [Commented] (SPARK-16090) Improve method grouping in SparkR generated docs

2016-06-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341436#comment-15341436 ] Felix Cheung commented on SPARK-16090: -- statfunction: https://github.com/apache/spar

[jira] [Created] (SPARK-16096) R deprecate unionAll and add union

2016-06-21 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-16096: Summary: R deprecate unionAll and add union Key: SPARK-16096 URL: https://issues.apache.org/jira/browse/SPARK-16096 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-16015) Datasource register for shutdown?

2016-06-21 Thread Michael Nitschinger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341442#comment-15341442 ] Michael Nitschinger commented on SPARK-16015: - Sean, thanks for your input.

[jira] [Assigned] (SPARK-16096) R deprecate unionAll and add union

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16096: Assignee: (was: Apache Spark) > R deprecate unionAll and add union > -

[jira] [Commented] (SPARK-16096) R deprecate unionAll and add union

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341443#comment-15341443 ] Apache Spark commented on SPARK-16096: -- User 'felixcheung' has created a pull reques

[jira] [Assigned] (SPARK-16096) R deprecate unionAll and add union

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16096: Assignee: Apache Spark > R deprecate unionAll and add union >

[jira] [Commented] (SPARK-15704) TungstenAggregate crashes

2016-06-21 Thread Deenar Toraskar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341478#comment-15341478 ] Deenar Toraskar commented on SPARK-15704: - Hi guys I get a similar error when us

[jira] [Commented] (SPARK-15393) Writing empty Dataframes doesn't save any _metadata files

2016-06-21 Thread Daniel Mescheder (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341486#comment-15341486 ] Daniel Mescheder commented on SPARK-15393: -- I am observing what I think is the s

[jira] [Commented] (SPARK-16069) rdd.map(identity).cache very slow

2016-06-21 Thread Julien Diener (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341492#comment-15341492 ] Julien Diener commented on SPARK-16069: --- Maybe I wasn't clear: the input rdd is alr

[jira] [Comment Edited] (SPARK-16069) rdd.map(identity).cache very slow

2016-06-21 Thread Julien Diener (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341492#comment-15341492 ] Julien Diener edited comment on SPARK-16069 at 6/21/16 10:06 AM: --

[jira] [Comment Edited] (SPARK-16069) rdd.map(identity).cache very slow

2016-06-21 Thread Julien Diener (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341492#comment-15341492 ] Julien Diener edited comment on SPARK-16069 at 6/21/16 10:07 AM: --

[jira] [Comment Edited] (SPARK-16069) rdd.map(identity).cache very slow

2016-06-21 Thread Julien Diener (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341492#comment-15341492 ] Julien Diener edited comment on SPARK-16069 at 6/21/16 10:08 AM: --

[jira] [Commented] (SPARK-16075) Make VectorUDT/MatrixUDT singleton under spark.ml package

2016-06-21 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341506#comment-15341506 ] Nick Pentreath commented on SPARK-16075: [~wangmiao1981] SPARK-15746 will probabl

[jira] [Updated] (SPARK-15904) High Memory Pressure using MLlib K-means

2016-06-21 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-15904: Description: *Please Note*: even though the issue has been marked as "not a problem" and "resolved", this

[jira] [Updated] (SPARK-15904) High Memory Pressure using MLlib K-means

2016-06-21 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-15904: Description: *Please Note*: even though the issue has been marked as "not a problem" and "resolved", this

[jira] [Commented] (SPARK-16044) input_file_name() returns empty strings in data sources based on NewHadoopRDD.

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341552#comment-15341552 ] Apache Spark commented on SPARK-16044: -- User 'HyukjinKwon' has created a pull reques

[jira] [Commented] (SPARK-15704) TungstenAggregate crashes

2016-06-21 Thread Hiroshi Inoue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341587#comment-15341587 ] Hiroshi Inoue commented on SPARK-15704: --- I confirmed the same error by executing De

[jira] [Commented] (SPARK-15704) TungstenAggregate crashes

2016-06-21 Thread Deenar Toraskar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341598#comment-15341598 ] Deenar Toraskar commented on SPARK-15704: - [~inouehrs] thanks for checking this o

[jira] [Created] (SPARK-16097) Encoders.tuple should handle null object correctly

2016-06-21 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-16097: --- Summary: Encoders.tuple should handle null object correctly Key: SPARK-16097 URL: https://issues.apache.org/jira/browse/SPARK-16097 Project: Spark Issue Type:

[jira] [Created] (SPARK-16098) Multiclass SVM Learning

2016-06-21 Thread Hayri Volkan Agun (JIRA)
Hayri Volkan Agun created SPARK-16098: - Summary: Multiclass SVM Learning Key: SPARK-16098 URL: https://issues.apache.org/jira/browse/SPARK-16098 Project: Spark Issue Type: Request

[jira] [Assigned] (SPARK-16097) Encoders.tuple should handle null object correctly

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16097: Assignee: Apache Spark (was: Wenchen Fan) > Encoders.tuple should handle null object corr

[jira] [Assigned] (SPARK-16097) Encoders.tuple should handle null object correctly

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16097: Assignee: Wenchen Fan (was: Apache Spark) > Encoders.tuple should handle null object corr

[jira] [Commented] (SPARK-16097) Encoders.tuple should handle null object correctly

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341604#comment-15341604 ] Apache Spark commented on SPARK-16097: -- User 'cloud-fan' has created a pull request

[jira] [Commented] (SPARK-16076) Dataset - outer join nulls can sometimes combinate to default values

2016-06-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341609#comment-15341609 ] Wenchen Fan commented on SPARK-16076: - I think this is caused by https://issues.apach

[jira] [Commented] (SPARK-15704) TungstenAggregate crashes

2016-06-21 Thread Hiroshi Inoue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341618#comment-15341618 ] Hiroshi Inoue commented on SPARK-15704: --- Yes, please. Thank you. > TungstenAggrega

[jira] [Commented] (SPARK-14480) Simplify CSV parsing process with a better performance

2016-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15341624#comment-15341624 ] Apache Spark commented on SPARK-14480: -- User 'HyukjinKwon' has created a pull reques

  1   2   3   4   >