[jira] [Commented] (SPARK-21786) The 'spark.sql.parquet.compression.codec' configuration doesn't take effect on tables with partition field(s)

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303121#comment-16303121 ] Apache Spark commented on SPARK-21786: -- User 'fjh100456' has created a pull request

[jira] [Commented] (SPARK-21208) Ability to "setLocalProperty" from sc, in sparkR

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303118#comment-16303118 ] Apache Spark commented on SPARK-21208: -- User 'HyukjinKwon' has created a pull reques

[jira] [Resolved] (SPARK-22707) Optimize CrossValidator memory occupation by models in fitting

2017-12-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-22707. --- Resolution: Fixed Fix Version/s: 2.3.0 Resolved by https://github.com/apache/s

[jira] [Commented] (SPARK-22891) NullPointerException when use udf

2017-12-24 Thread gaoyang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303092#comment-16303092 ] gaoyang commented on SPARK-22891: - It happends in spark 2.2.x, not in spark 2.1.x. > Nul

[jira] [Created] (SPARK-22897) Expose stageAttemptId in TaskContext

2017-12-24 Thread Xianjin YE (JIRA)
Xianjin YE created SPARK-22897: -- Summary: Expose stageAttemptId in TaskContext Key: SPARK-22897 URL: https://issues.apache.org/jira/browse/SPARK-22897 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-22898) collect_set aggregation on bucketed table causes an exchange stage

2017-12-24 Thread Modi Tamam (JIRA)
Modi Tamam created SPARK-22898: -- Summary: collect_set aggregation on bucketed table causes an exchange stage Key: SPARK-22898 URL: https://issues.apache.org/jira/browse/SPARK-22898 Project: Spark

[jira] [Commented] (SPARK-22874) Modify checking pandas version to use LooseVersion.

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303081#comment-16303081 ] Apache Spark commented on SPARK-22874: -- User 'ueshin' has created a pull request for

[jira] [Assigned] (SPARK-22843) R localCheckpoint API

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22843: Assignee: (was: Apache Spark) > R localCheckpoint API > - > >

[jira] [Commented] (SPARK-22843) R localCheckpoint API

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303068#comment-16303068 ] Apache Spark commented on SPARK-22843: -- User 'HyukjinKwon' has created a pull reques

[jira] [Assigned] (SPARK-22843) R localCheckpoint API

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22843: Assignee: Apache Spark > R localCheckpoint API > - > >

[jira] [Commented] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2017-12-24 Thread Sandeep Kumar Choudhary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303067#comment-16303067 ] Sandeep Kumar Choudhary commented on SPARK-18844: - How can I get this tas

[jira] [Commented] (SPARK-18844) Add more binary classification metrics to BinaryClassificationMetrics

2017-12-24 Thread Sandeep Kumar Choudhary (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303066#comment-16303066 ] Sandeep Kumar Choudhary commented on SPARK-18844: - I want to work on this

[jira] [Assigned] (SPARK-22707) Optimize CrossValidator memory occupation by models in fitting

2017-12-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-22707: - Assignee: Weichen Xu > Optimize CrossValidator memory occupation by models in fi

[jira] [Updated] (SPARK-22707) Optimize CrossValidator memory occupation by models in fitting

2017-12-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22707: -- Shepherd: Joseph K. Bradley > Optimize CrossValidator memory occupation by models in fi

[jira] [Updated] (SPARK-22707) Optimize CrossValidator memory occupation by models in fitting

2017-12-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-22707: -- Target Version/s: 2.3.0 > Optimize CrossValidator memory occupation by models in fittin

[jira] [Assigned] (SPARK-22790) add a configurable factor to describe HadoopFsRelation's size

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22790: Assignee: (was: Apache Spark) > add a configurable factor to describe HadoopFsRelation

[jira] [Commented] (SPARK-22790) add a configurable factor to describe HadoopFsRelation's size

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303033#comment-16303033 ] Apache Spark commented on SPARK-22790: -- User 'CodingCat' has created a pull request

[jira] [Assigned] (SPARK-22790) add a configurable factor to describe HadoopFsRelation's size

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22790: Assignee: Apache Spark > add a configurable factor to describe HadoopFsRelation's size > -

[jira] [Commented] (SPARK-22599) Avoid extra reading for cached table

2017-12-24 Thread Nan Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303022#comment-16303022 ] Nan Zhu commented on SPARK-22599: - [~rajesh.balamohan] no, it means that SPARK-22599 and

[jira] [Resolved] (SPARK-22465) Cogroup of two disproportionate RDDs could lead into 2G limit BUG

2017-12-24 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mridul Muralidharan resolved SPARK-22465. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 2000

[jira] [Commented] (SPARK-22840) Incorrect results when using distinct on window

2017-12-24 Thread Denys Zadorozhnyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16302865#comment-16302865 ] Denys Zadorozhnyi commented on SPARK-22840: --- I'd like to take a stab at it if n

[jira] [Commented] (SPARK-22896) Improvement in String interpolation

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16302820#comment-16302820 ] Apache Spark commented on SPARK-22896: -- User 'chetkhatri' has created a pull request

[jira] [Commented] (SPARK-22879) LogisticRegression inconsistent prediction when proba == threshold

2017-12-24 Thread Adrien Lavoillotte (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16302809#comment-16302809 ] Adrien Lavoillotte commented on SPARK-22879: Comparing the probability every

[jira] [Assigned] (SPARK-22896) Improvement in String interpolation

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22896: Assignee: Apache Spark > Improvement in String interpolation > --

[jira] [Commented] (SPARK-22896) Improvement in String interpolation

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16302767#comment-16302767 ] Apache Spark commented on SPARK-22896: -- User 'chetkhatri' has created a pull request

[jira] [Assigned] (SPARK-22896) Improvement in String interpolation

2017-12-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22896: Assignee: (was: Apache Spark) > Improvement in String interpolation > ---

[jira] [Created] (SPARK-22896) Improvement in String interpolation

2017-12-24 Thread Chetan Khatri (JIRA)
Chetan Khatri created SPARK-22896: - Summary: Improvement in String interpolation Key: SPARK-22896 URL: https://issues.apache.org/jira/browse/SPARK-22896 Project: Spark Issue Type: Improvemen