[jira] [Resolved] (SPARK-22418) Add test cases for NULL Handling

2017-11-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-22418. - Resolution: Fixed Fix Version/s: 2.3.0 > Add test cases for NULL Handling >

[jira] [Assigned] (SPARK-22418) Add test cases for NULL Handling

2017-11-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-22418: --- Assignee: Marco Gaido > Add test cases for NULL Handling > > >

[jira] [Commented] (SPARK-22436) New function strip() to remove all whitespace from string

2017-11-03 Thread Eric Maynard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238712#comment-16238712 ] Eric Maynard commented on SPARK-22436: -- [~asmaier] Wouldn't the right way to implement this be to

[jira] [Updated] (SPARK-22446) Optimizer causing StringIndexerModel's indexer UDF to throw "Unseen label" exception incorrectly for filtered data.

2017-11-03 Thread Greg Bellchambers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Bellchambers updated SPARK-22446: -- Summary: Optimizer causing StringIndexerModel's indexer UDF to throw "Unseen label"

[jira] [Updated] (SPARK-22446) Optimizer causing StringIndexer's indexer UDF to throw "Unseen label" exception incorrectly for filtered data.

2017-11-03 Thread Greg Bellchambers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Bellchambers updated SPARK-22446: -- Description: In the following, the `indexer` UDF defined inside the

[jira] [Created] (SPARK-22446) Optimizer causing StringIndexer's indexer UDF to throw "Unseen label" exception incorrectly for filtered data.

2017-11-03 Thread Greg Bellchambers (JIRA)
Greg Bellchambers created SPARK-22446: - Summary: Optimizer causing StringIndexer's indexer UDF to throw "Unseen label" exception incorrectly for filtered data. Key: SPARK-22446 URL:

[jira] [Assigned] (SPARK-22445) move CodegenContext.copyResult to CodegenSupport

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22445: Assignee: Wenchen Fan (was: Apache Spark) > move CodegenContext.copyResult to

[jira] [Assigned] (SPARK-22445) move CodegenContext.copyResult to CodegenSupport

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22445: Assignee: Apache Spark (was: Wenchen Fan) > move CodegenContext.copyResult to

[jira] [Comment Edited] (SPARK-22441) JDBC REAL type is mapped to Double instead of Float

2017-11-03 Thread Tor Myklebust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238628#comment-16238628 ] Tor Myklebust edited comment on SPARK-22441 at 11/4/17 12:28 AM: - I wrote

[jira] [Commented] (SPARK-22445) move CodegenContext.copyResult to CodegenSupport

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238635#comment-16238635 ] Apache Spark commented on SPARK-22445: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-22441) JDBC REAL type is mapped to Double instead of Float

2017-11-03 Thread Tor Myklebust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238628#comment-16238628 ] Tor Myklebust commented on SPARK-22441: --- I wrote this code a while ago, but I don't think REAL ->

[jira] [Created] (SPARK-22445) move CodegenContext.copyResult to CodegenSupport

2017-11-03 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-22445: --- Summary: move CodegenContext.copyResult to CodegenSupport Key: SPARK-22445 URL: https://issues.apache.org/jira/browse/SPARK-22445 Project: Spark Issue Type:

[jira] [Commented] (SPARK-22444) Spark History Server missing /environment endpoint/api

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238615#comment-16238615 ] Apache Spark commented on SPARK-22444: -- User 'ambud' has created a pull request for this issue:

[jira] [Resolved] (SPARK-22444) Spark History Server missing /environment endpoint/api

2017-11-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-22444. Resolution: Won't Fix Fix Version/s: (was: 1.6.4) Features like this are not

[jira] [Created] (SPARK-22444) Spark History Server missing /environment endpoint/api

2017-11-03 Thread Ambud Sharma (JIRA)
Ambud Sharma created SPARK-22444: Summary: Spark History Server missing /environment endpoint/api Key: SPARK-22444 URL: https://issues.apache.org/jira/browse/SPARK-22444 Project: Spark Issue

[jira] [Commented] (SPARK-22443) AggregatedDialect doesn't override quoteIdentifier and other methods in JdbcDialects

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238482#comment-16238482 ] Sean Owen commented on SPARK-22443: --- Good catch. I suppose that this and getTableExistsQuery and

[jira] [Updated] (SPARK-22443) AggregatedDialect doesn't override quoteIdentifier and other methods in JdbcDialects

2017-11-03 Thread Hongbo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hongbo updated SPARK-22443: --- Summary: AggregatedDialect doesn't override quoteIdentifier and other methods in JdbcDialects (was:

[jira] [Created] (SPARK-22443) AggregatedDialect doesn't work for quoteIdentifier

2017-11-03 Thread Hongbo (JIRA)
Hongbo created SPARK-22443: -- Summary: AggregatedDialect doesn't work for quoteIdentifier Key: SPARK-22443 URL: https://issues.apache.org/jira/browse/SPARK-22443 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-14703) Spark uses SLF4J, but actually relies quite heavily on Log4J

2017-11-03 Thread Harry Weppner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238409#comment-16238409 ] Harry Weppner commented on SPARK-14703: --- [~happy15sheng] I've discovered that {{log4j-over-slf4j}}

[jira] [Updated] (SPARK-22442) Schema generated by Product Encoder doesn't match case class field name when using non-standard characters

2017-11-03 Thread Mikel San Vicente (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikel San Vicente updated SPARK-22442: -- Description: Product encoder encodes special characters wrongly when field name

[jira] [Updated] (SPARK-22442) Schema generated by Product Encoder doesn't match case class field name when using non-standard characters

2017-11-03 Thread Mikel San Vicente (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikel San Vicente updated SPARK-22442: -- Description: Product encoder encodes special characters wrongly when field name

[jira] [Updated] (SPARK-22442) Schema generated by Product Encoder doesn't match case class field name when using non-standard characters

2017-11-03 Thread Mikel San Vicente (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikel San Vicente updated SPARK-22442: -- Description: Product encoder encodes special characters wrongly when field name

[jira] [Created] (SPARK-22442) Schema generated by Product Encoder doesn't match case class field name when using non-standard characters

2017-11-03 Thread Mikel San Vicente (JIRA)
Mikel San Vicente created SPARK-22442: - Summary: Schema generated by Product Encoder doesn't match case class field name when using non-standard characters Key: SPARK-22442 URL:

[jira] [Commented] (SPARK-22441) JDBC REAL type is mapped to Double instead of Float

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238313#comment-16238313 ] Sean Owen commented on SPARK-22441: --- You have a point. In the same file, the reverse mapping maps float

[jira] [Created] (SPARK-22441) JDBC REAL type is mapped to Double instead of Float

2017-11-03 Thread Hongbo (JIRA)
Hongbo created SPARK-22441: -- Summary: JDBC REAL type is mapped to Double instead of Float Key: SPARK-22441 URL: https://issues.apache.org/jira/browse/SPARK-22441 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-11-03 Thread Henry Robinson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238281#comment-16238281 ] Henry Robinson commented on SPARK-22211: Sounds good, thanks both. > LimitPushDown optimization

[jira] [Commented] (SPARK-22440) Add Calinski-Harabasz index to ClusteringEvaluator

2017-11-03 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238241#comment-16238241 ] Marco Gaido commented on SPARK-22440: - Honestly I don't know what people are using for clustering

[jira] [Commented] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238202#comment-16238202 ] Sean Owen commented on SPARK-22424: --- Yes, but how is that related to the tasks you are highlighting?

[jira] [Commented] (SPARK-22440) Add Calinski-Harabasz index to ClusteringEvaluator

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238179#comment-16238179 ] Sean Owen commented on SPARK-22440: --- I had honestly never heard of this. Is this widely used at all? I

[jira] [Commented] (SPARK-22433) Linear regression R^2 train/test terminology related

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238150#comment-16238150 ] Sean Owen commented on SPARK-22433: --- Yeah, R^2 _could_ be used as a metric but it belongs a bit more as

[jira] [Commented] (SPARK-22433) Linear regression R^2 train/test terminology related

2017-11-03 Thread Seth Hendrickson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238137#comment-16238137 ] Seth Hendrickson commented on SPARK-22433: -- The main problem I see is that we put "r2" in the

[jira] [Commented] (SPARK-22440) Add Calinski-Harabasz index to ClusteringEvaluator

2017-11-03 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22440?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238095#comment-16238095 ] Marco Gaido commented on SPARK-22440: - I am preparing an implementation for this. It will stil take

[jira] [Created] (SPARK-22440) Add Calinski-Harabasz index to ClusteringEvaluator

2017-11-03 Thread Marco Gaido (JIRA)
Marco Gaido created SPARK-22440: --- Summary: Add Calinski-Harabasz index to ClusteringEvaluator Key: SPARK-22440 URL: https://issues.apache.org/jira/browse/SPARK-22440 Project: Spark Issue Type:

[jira] [Commented] (SPARK-22433) Linear regression R^2 train/test terminology related

2017-11-03 Thread Teng Peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237884#comment-16237884 ] Teng Peng commented on SPARK-22433: --- Thanks for the quick response, Sean. I am glad this issue is

[jira] [Commented] (SPARK-22433) Linear regression R^2 train/test terminology related

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237830#comment-16237830 ] Sean Owen commented on SPARK-22433: --- I think one misunderstanding here is that you need not apply an

[jira] [Commented] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-11-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237825#comment-16237825 ] Xiao Li commented on SPARK-22211: - We should merge it to the master and the previous releases at first.

[jira] [Commented] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237822#comment-16237822 ] Sean Owen commented on SPARK-22211: --- In the name of correctness, still worth disabling for now, and

[jira] [Commented] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-11-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237797#comment-16237797 ] Xiao Li commented on SPARK-22211: - The Join operator should be limit aware. Anyway, we can do it later.

[jira] [Commented] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-11-03 Thread Henry Robinson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237778#comment-16237778 ] Henry Robinson commented on SPARK-22211: [~smilegator] - sounds good! What will your approach be?

[jira] [Assigned] (SPARK-22417) createDataFrame from a pandas.DataFrame reads datetime64 values as longs

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22417: Assignee: (was: Apache Spark) > createDataFrame from a pandas.DataFrame reads

[jira] [Commented] (SPARK-22433) Linear regression R^2 train/test terminology related

2017-11-03 Thread Teng Peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237774#comment-16237774 ] Teng Peng commented on SPARK-22433: --- What I agree with you: be coherent, and we prefer ML-oreinted

[jira] [Assigned] (SPARK-22417) createDataFrame from a pandas.DataFrame reads datetime64 values as longs

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22417: Assignee: Apache Spark > createDataFrame from a pandas.DataFrame reads datetime64 values

[jira] [Commented] (SPARK-22147) BlockId.hashCode allocates a StringBuilder/String on each call

2017-11-03 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237775#comment-16237775 ] Bryan Cutler commented on SPARK-22147: -- Sorry, I linked the above PR to this JIRA accidentally >

[jira] [Commented] (SPARK-22417) createDataFrame from a pandas.DataFrame reads datetime64 values as longs

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237772#comment-16237772 ] Apache Spark commented on SPARK-22417: -- User 'BryanCutler' has created a pull request for this

[jira] [Commented] (SPARK-22426) Spark AM launching containers on node where External spark shuffle service failed to initialize

2017-11-03 Thread Prabhu Joseph (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237769#comment-16237769 ] Prabhu Joseph commented on SPARK-22426: --- Thanks [~jerryshao], we can close this as a duplicate. >

[jira] [Commented] (SPARK-22437) jdbc write fails to set default mode

2017-11-03 Thread Adrian Bridgett (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237733#comment-16237733 ] Adrian Bridgett commented on SPARK-22437: - good grief that was fast! > jdbc write fails to set

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-11-03 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237731#comment-16237731 ] Timothy Hunter commented on SPARK-21866: Adding {{spark.read.image}} is going to create a (soft)

[jira] [Commented] (SPARK-22430) Unknown tag warnings when building R docs with Roxygen 6.0.1

2017-11-03 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237719#comment-16237719 ] Felix Cheung commented on SPARK-22430: -- I am seeing it too. I think we can just remove the tag but

[jira] [Assigned] (SPARK-22437) jdbc write fails to set default mode

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22437: Assignee: (was: Apache Spark) > jdbc write fails to set default mode >

[jira] [Commented] (SPARK-22437) jdbc write fails to set default mode

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237716#comment-16237716 ] Apache Spark commented on SPARK-22437: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22437) jdbc write fails to set default mode

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22437: Assignee: Apache Spark > jdbc write fails to set default mode >

[jira] [Commented] (SPARK-22433) Linear regression R^2 train/test terminology related

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237676#comment-16237676 ] Sean Owen commented on SPARK-22433: --- Likewise, the goal here is not to adopt statistics terminology. As

[jira] [Assigned] (SPARK-22418) Add test cases for NULL Handling

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22418: Assignee: (was: Apache Spark) > Add test cases for NULL Handling >

[jira] [Assigned] (SPARK-22418) Add test cases for NULL Handling

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22418: Assignee: Apache Spark > Add test cases for NULL Handling >

[jira] [Commented] (SPARK-22418) Add test cases for NULL Handling

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237647#comment-16237647 ] Apache Spark commented on SPARK-22418: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Commented] (SPARK-22439) Not able to get numeric columns for the file having decimal values

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237626#comment-16237626 ] Sean Owen commented on SPARK-22439: --- Your code doesn't work as is, but I adapted it to Scala to try it.

[jira] [Commented] (SPARK-22438) OutOfMemoryError on very small data sets

2017-11-03 Thread Morten Hornbech (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237553#comment-16237553 ] Morten Hornbech commented on SPARK-22438: - I honestly can't see whether those are duplicates. I

[jira] [Assigned] (SPARK-22407) Add rdd id column on storage page to speed up navigating

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-22407: - Assignee: zhoukang > Add rdd id column on storage page to speed up navigating >

[jira] [Resolved] (SPARK-22407) Add rdd id column on storage page to speed up navigating

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22407. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19625

[jira] [Commented] (SPARK-22436) New function strip() to remove all whitespace from string

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237530#comment-16237530 ] Sean Owen commented on SPARK-22436: --- Yes, but that's true of any Python UDF, and not every one can be

[jira] [Resolved] (SPARK-22438) OutOfMemoryError on very small data sets

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22438. --- Resolution: Duplicate Have a look through JIRA first. This looks like

[jira] [Updated] (SPARK-22439) Not able to get numeric columns for the file having decimal values

2017-11-03 Thread Navya Krishnappa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navya Krishnappa updated SPARK-22439: - Summary: Not able to get numeric columns for the file having decimal values (was: Not

[jira] [Updated] (SPARK-22439) Not able to get numeric columns for the attached file

2017-11-03 Thread Navya Krishnappa (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Navya Krishnappa updated SPARK-22439: - Summary: Not able to get numeric columns for the attached file (was: Not able to get

[jira] [Created] (SPARK-22439) Not able to get numeric column for the attached file

2017-11-03 Thread Navya Krishnappa (JIRA)
Navya Krishnappa created SPARK-22439: Summary: Not able to get numeric column for the attached file Key: SPARK-22439 URL: https://issues.apache.org/jira/browse/SPARK-22439 Project: Spark

[jira] [Updated] (SPARK-22438) OutOfMemoryError on very small data sets

2017-11-03 Thread Morten Hornbech (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Morten Hornbech updated SPARK-22438: Description: We have a customer that uses Spark as an engine for running SQL on a

[jira] [Created] (SPARK-22438) OutOfMemoryError on very small data sets

2017-11-03 Thread Morten Hornbech (JIRA)
Morten Hornbech created SPARK-22438: --- Summary: OutOfMemoryError on very small data sets Key: SPARK-22438 URL: https://issues.apache.org/jira/browse/SPARK-22438 Project: Spark Issue Type:

[jira] [Created] (SPARK-22437) jdbc write fails to set default mode

2017-11-03 Thread Adrian Bridgett (JIRA)
Adrian Bridgett created SPARK-22437: --- Summary: jdbc write fails to set default mode Key: SPARK-22437 URL: https://issues.apache.org/jira/browse/SPARK-22437 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-22436) New function strip() to remove all whitespace from string

2017-11-03 Thread Andreas Maier (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237446#comment-16237446 ] Andreas Maier commented on SPARK-22436: --- Python UDFs are very slow, aren't they? I believe a Spark

[jira] [Commented] (SPARK-22418) Add test cases for NULL Handling

2017-11-03 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22418?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237442#comment-16237442 ] Marco Gaido commented on SPARK-22418: - can I work on this? > Add test cases for NULL Handling >

[jira] [Commented] (SPARK-22436) New function strip() to remove all whitespace from string

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237420#comment-16237420 ] Sean Owen commented on SPARK-22436: --- Why does it need to be in Spark as opposed to a simple UDF? > New

[jira] [Updated] (SPARK-22435) Support processing array and map type using script

2017-11-03 Thread jin xing (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] jin xing updated SPARK-22435: - Priority: Major (was: Critical) > Support processing array and map type using script >

[jira] [Created] (SPARK-22436) New function strip() to remove all whitespace from string

2017-11-03 Thread Andreas Maier (JIRA)
Andreas Maier created SPARK-22436: - Summary: New function strip() to remove all whitespace from string Key: SPARK-22436 URL: https://issues.apache.org/jira/browse/SPARK-22436 Project: Spark

[jira] [Assigned] (SPARK-22435) Support processing array and map type using script

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22435: Assignee: Apache Spark > Support processing array and map type using script >

[jira] [Commented] (SPARK-22435) Support processing array and map type using script

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237407#comment-16237407 ] Apache Spark commented on SPARK-22435: -- User 'jinxing64' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22435) Support processing array and map type using script

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22435: Assignee: (was: Apache Spark) > Support processing array and map type using script >

[jira] [Created] (SPARK-22435) Support processing array and map type using script

2017-11-03 Thread jin xing (JIRA)
jin xing created SPARK-22435: Summary: Support processing array and map type using script Key: SPARK-22435 URL: https://issues.apache.org/jira/browse/SPARK-22435 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-22434) Spark structured streaming with HBase

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22434. --- Resolution: Invalid Fix Version/s: (was: 2.0.3) Questions are for the mailing list >

[jira] [Created] (SPARK-22434) Spark structured streaming with HBase

2017-11-03 Thread Harendra Singh (JIRA)
Harendra Singh created SPARK-22434: -- Summary: Spark structured streaming with HBase Key: SPARK-22434 URL: https://issues.apache.org/jira/browse/SPARK-22434 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-22420) Spark SQL return invalid json string for struct with date/datetime field

2017-11-03 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237368#comment-16237368 ] Marco Gaido commented on SPARK-22420: - I think this is related and will be resolved by SPARK-20202 >

[jira] [Commented] (SPARK-21668) Ability to run driver programs within a container

2017-11-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237329#comment-16237329 ] Apache Spark commented on SPARK-21668: -- User 'tashoyan' has created a pull request for this issue:

[jira] [Commented] (SPARK-22429) Streaming checkpointing code does not retry after failure due to NullPointerException

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237282#comment-16237282 ] Sean Owen commented on SPARK-22429: --- [~tmgstev] it will have to be vs. master. Master and all branches

[jira] [Commented] (SPARK-22430) Unknown tag warnings when building R docs with Roxygen 6.0.1

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237268#comment-16237268 ] Sean Owen commented on SPARK-22430: --- I see it too, because I have Roxygen 6.0.1 locally. [~felixcheung]

[jira] [Updated] (SPARK-22427) StackOverFlowError when using FPGrowth

2017-11-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-22427: -- Description: code part: val path = jobConfig.getString("hdfspath") val vectordata =

[jira] [Commented] (SPARK-22423) Scala test source files like TestHiveSingleton.scala should be in scala source root

2017-11-03 Thread xubo245 (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237176#comment-16237176 ] xubo245 commented on SPARK-22423: - OK, I will fix it. > Scala test source files like

[jira] [Commented] (SPARK-22427) StackOverFlowError when using FPGrowth

2017-11-03 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237174#comment-16237174 ] yuhao yang commented on SPARK-22427: Could you please try to increase the stack size, E.g. with

[jira] [Updated] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-11-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22211: Target Version/s: 2.2.1, 2.3.0 > LimitPushDown optimization for FullOuterJoin generates wrong results >

[jira] [Commented] (SPARK-22211) LimitPushDown optimization for FullOuterJoin generates wrong results

2017-11-03 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237133#comment-16237133 ] Xiao Li commented on SPARK-22211: - Will submit a PR based on my previous PR