[jira] [Created] (SPARK-15920) Using map on DataFrame

2016-06-13 Thread Piotr Milanowski (JIRA)
Piotr Milanowski created SPARK-15920: Summary: Using map on DataFrame Key: SPARK-15920 URL: https://issues.apache.org/jira/browse/SPARK-15920 Project: Spark Issue Type: Bug Comp

[jira] [Commented] (SPARK-8546) PMML export for Naive Bayes

2016-06-13 Thread Radoslaw Gasiorek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8546?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15327167#comment-15327167 ] Radoslaw Gasiorek commented on SPARK-8546: -- hi there, [~josephkb] We would like t

[jira] [Created] (SPARK-15919) DStream "saveAsTextFile" doesn't update the prefix after each checkpoint

2016-06-13 Thread Aamir Abbas (JIRA)
Aamir Abbas created SPARK-15919: --- Summary: DStream "saveAsTextFile" doesn't update the prefix after each checkpoint Key: SPARK-15919 URL: https://issues.apache.org/jira/browse/SPARK-15919 Project: Spark

[jira] [Created] (SPARK-15918) unionAll returns wrong result when two dataframes has schema in different order

2016-06-13 Thread Prabhu Joseph (JIRA)
Prabhu Joseph created SPARK-15918: - Summary: unionAll returns wrong result when two dataframes has schema in different order Key: SPARK-15918 URL: https://issues.apache.org/jira/browse/SPARK-15918 Pro

[jira] [Reopened] (SPARK-15345) SparkSession's conf doesn't take effect when there's already an existing SparkContext

2016-06-13 Thread Piotr Milanowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Piotr Milanowski reopened SPARK-15345: -- Does not work as expected when using spark-submit; for example, this works fine and prints

[jira] [Assigned] (SPARK-15916) JDBC AND/OR operator push down does not respect lower OR operator precedence

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15916: Assignee: Apache Spark > JDBC AND/OR operator push down does not respect lower OR operator

[jira] [Assigned] (SPARK-15916) JDBC AND/OR operator push down does not respect lower OR operator precedence

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15916: Assignee: (was: Apache Spark) > JDBC AND/OR operator push down does not respect lower

[jira] [Commented] (SPARK-15916) JDBC AND/OR operator push down does not respect lower OR operator precedence

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15327144#comment-15327144 ] Apache Spark commented on SPARK-15916: -- User 'HyukjinKwon' has created a pull reques

[jira] [Commented] (SPARK-15916) JDBC AND/OR operator push down does not respect lower OR operator precedence

2016-06-13 Thread Piotr Czarnas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15327118#comment-15327118 ] Piotr Czarnas commented on SPARK-15916: --- Hi, I wish so. This issue is failing a lo

[jira] [Updated] (SPARK-15904) High Memory Pressure using MLlib K-means

2016-06-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-15904: Priority: Minor (was: Major) > High Memory Pressure using MLlib K-means >

[jira] [Updated] (SPARK-15904) High Memory Pressure using MLlib K-means

2016-06-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessio updated SPARK-15904: Issue Type: Improvement (was: Bug) > High Memory Pressure using MLlib K-means > --

[jira] [Updated] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-06-13 Thread Jonathan Taws (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Taws updated SPARK-15917: -- Description: After stumbling across a few StackOverflow posts around the issue of using a fixe

[jira] [Created] (SPARK-15917) Define the number of executors in standalone mode with an easy-to-use property

2016-06-13 Thread Jonathan Taws (JIRA)
Jonathan Taws created SPARK-15917: - Summary: Define the number of executors in standalone mode with an easy-to-use property Key: SPARK-15917 URL: https://issues.apache.org/jira/browse/SPARK-15917 Proj

[jira] [Commented] (SPARK-15904) High Memory Pressure using MLlib K-means

2016-06-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15327108#comment-15327108 ] yuhao yang commented on SPARK-15904: Thanks for reporting it. I'm not sure if the iss

[jira] [Commented] (SPARK-15916) JDBC AND/OR operator push down does not respect lower OR operator precedence

2016-06-13 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15327106#comment-15327106 ] Hyukjin Kwon commented on SPARK-15916: -- Indeed. Do you mind if I submit a PR for thi

[jira] [Commented] (SPARK-15904) High Memory Pressure using MLlib K-means

2016-06-13 Thread Alessio (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15327090#comment-15327090 ] Alessio commented on SPARK-15904: - Hi [~yuhaoyan]], the dataset size is 9120 rows and 212

[jira] [Commented] (SPARK-15904) High Memory Pressure using MLlib K-means

2016-06-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15327082#comment-15327082 ] yuhao yang commented on SPARK-15904: Hi [~Purple]] What's your k and vector size? Btw

[jira] [Commented] (SPARK-6628) ClassCastException occurs when executing sql statement "insert into" on hbase table

2016-06-13 Thread Murshid Chalaev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15327065#comment-15327065 ] Murshid Chalaev commented on SPARK-6628: Spark 1.6.1 is affected as well, is there

[jira] [Commented] (SPARK-15790) Audit @Since annotations in ML

2016-06-13 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15327028#comment-15327028 ] Nick Pentreath commented on SPARK-15790: Ah thanks - missed that umbrella. It's a

[jira] [Updated] (SPARK-15743) Prevent saving with all-column partitioning

2016-06-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15743: -- Assignee: Dongjoon Hyun > Prevent saving with all-column partitioning > ---

[jira] [Updated] (SPARK-15788) PySpark IDFModel missing "idf" property

2016-06-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15788: -- Assignee: Jeff Zhang > PySpark IDFModel missing "idf" property > --

[jira] [Updated] (SPARK-15489) Dataset kryo encoder won't load custom user settings

2016-06-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15489: -- Assignee: Amit Sela > Dataset kryo encoder won't load custom user settings > -

[jira] [Updated] (SPARK-6320) Adding new query plan strategy to SQLContext

2016-06-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-6320: - Assignee: Takuya Ueshin > Adding new query plan strategy to SQLContext > -

[jira] [Updated] (SPARK-15813) Spark Dyn Allocation Cancel log message misleading

2016-06-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15813: -- Assignee: Peter Ableda > Spark Dyn Allocation Cancel log message misleading > -

[jira] [Updated] (SPARK-15813) Spark Dyn Allocation Cancel log message misleading

2016-06-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15813: -- Issue Type: Improvement (was: Bug) > Spark Dyn Allocation Cancel log message misleading >

[jira] [Resolved] (SPARK-15813) Spark Dyn Allocation Cancel log message misleading

2016-06-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-15813. --- Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13552 [https://github.co

[jira] [Updated] (SPARK-15796) Reduce spark.memory.fraction default to avoid overrunning old gen in JVM default config

2016-06-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-15796: -- Priority: Blocker (was: Minor) Pardon marking this "Blocker", but I think this needs some attention be

[jira] [Commented] (SPARK-14503) spark.ml API for FPGrowth

2016-06-13 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15326994#comment-15326994 ] yuhao yang commented on SPARK-14503: Hi Jeff, welcome to contribute. I'm discussing

[jira] [Commented] (SPARK-15796) Reduce spark.memory.fraction default to avoid overrunning old gen in JVM default config

2016-06-13 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15326979#comment-15326979 ] Sean Owen commented on SPARK-15796: --- A new parameter like that would just be going back

[jira] [Commented] (SPARK-14503) spark.ml API for FPGrowth

2016-06-13 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15326977#comment-15326977 ] Jeff Zhang commented on SPARK-14503: [~GayathriMurali] [~yuhaoyan] Do you still work

[jira] [Created] (SPARK-15916) JDBC AND/OR operator push down does not respect lower OR operator precedence

2016-06-13 Thread Piotr Czarnas (JIRA)
Piotr Czarnas created SPARK-15916: - Summary: JDBC AND/OR operator push down does not respect lower OR operator precedence Key: SPARK-15916 URL: https://issues.apache.org/jira/browse/SPARK-15916 Projec

[jira] [Assigned] (SPARK-15915) CacheManager should use canonicalized plan for planToCache.

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15915: Assignee: Apache Spark > CacheManager should use canonicalized plan for planToCache. > ---

[jira] [Assigned] (SPARK-15915) CacheManager should use canonicalized plan for planToCache.

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15915: Assignee: (was: Apache Spark) > CacheManager should use canonicalized plan for planToC

[jira] [Commented] (SPARK-15915) CacheManager should use canonicalized plan for planToCache.

2016-06-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15326921#comment-15326921 ] Apache Spark commented on SPARK-15915: -- User 'ueshin' has created a pull request for

[jira] [Created] (SPARK-15915) CacheManager should use canonicalized plan for planToCache.

2016-06-13 Thread Takuya Ueshin (JIRA)
Takuya Ueshin created SPARK-15915: - Summary: CacheManager should use canonicalized plan for planToCache. Key: SPARK-15915 URL: https://issues.apache.org/jira/browse/SPARK-15915 Project: Spark

<    1   2   3