[jira] [Resolved] (SPARK-22017) watermark evaluation with multi-input stream operators is unspecified

2017-09-15 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-22017. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 19239

[jira] [Created] (SPARK-22034) CrossValidator's training and testing set with different set of labels, resulting in encoder transform error

2017-09-15 Thread AnChe Kuo (JIRA)
AnChe Kuo created SPARK-22034: - Summary: CrossValidator's training and testing set with different set of labels, resulting in encoder transform error Key: SPARK-22034 URL:

[jira] [Commented] (SPARK-22033) BufferHolder size checks should account for the specific VM array size limitations

2017-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168650#comment-16168650 ] Sean Owen commented on SPARK-22033: --- Hm, good point. There may be other similar issues throughout the

[jira] [Commented] (SPARK-22033) BufferHolder size checks should account for the specific VM array size limitations

2017-09-15 Thread Vadim Semenov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168612#comment-16168612 ] Vadim Semenov commented on SPARK-22033: --- Leaving traces for others if they happen to hit the same

[jira] [Created] (SPARK-22033) BufferHolder size checks should account for the specific VM array size limitations

2017-09-15 Thread Vadim Semenov (JIRA)
Vadim Semenov created SPARK-22033: - Summary: BufferHolder size checks should account for the specific VM array size limitations Key: SPARK-22033 URL: https://issues.apache.org/jira/browse/SPARK-22033

[jira] [Assigned] (SPARK-12297) Add work-around for Parquet/Hive int96 timestamp bug.

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12297: Assignee: (was: Apache Spark) > Add work-around for Parquet/Hive int96 timestamp bug.

[jira] [Commented] (SPARK-12297) Add work-around for Parquet/Hive int96 timestamp bug.

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168410#comment-16168410 ] Apache Spark commented on SPARK-12297: -- User 'squito' has created a pull request for this issue:

[jira] [Assigned] (SPARK-12297) Add work-around for Parquet/Hive int96 timestamp bug.

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-12297: Assignee: Apache Spark > Add work-around for Parquet/Hive int96 timestamp bug. >

[jira] [Commented] (SPARK-21842) Support Kerberos ticket renewal and creation in Mesos

2017-09-15 Thread Arthur Rand (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168278#comment-16168278 ] Arthur Rand commented on SPARK-21842: - Hey [~kalvinnchau] I'm currently of the mind that using the

[jira] [Assigned] (SPARK-22032) Speed up StructType.fromInternal

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22032: Assignee: Apache Spark > Speed up StructType.fromInternal >

[jira] [Assigned] (SPARK-22032) Speed up StructType.fromInternal

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22032: Assignee: (was: Apache Spark) > Speed up StructType.fromInternal >

[jira] [Commented] (SPARK-22032) Speed up StructType.fromInternal

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168273#comment-16168273 ] Apache Spark commented on SPARK-22032: -- User 'maver1ck' has created a pull request for this issue:

[jira] [Created] (SPARK-22032) Speed up StructType.fromInternal

2017-09-15 Thread JIRA
Maciej Bryński created SPARK-22032: -- Summary: Speed up StructType.fromInternal Key: SPARK-22032 URL: https://issues.apache.org/jira/browse/SPARK-22032 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-22031) KMeans - Compute cost for a single vector

2017-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-22031: -- Target Version/s: (was: 2.3.0) Labels: (was: newbie) Priority: Minor

[jira] [Commented] (SPARK-17025) Cannot persist PySpark ML Pipeline model that includes custom Transformer

2017-09-15 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168038#comment-16168038 ] Nicholas Chammas commented on SPARK-17025: -- I take that back. I won't be able to test this for

[jira] [Assigned] (SPARK-22030) GraphiteSink fails to re-connect to Graphite instances behind an ELB or any other auto-scaled LB

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22030: Assignee: (was: Apache Spark) > GraphiteSink fails to re-connect to Graphite

[jira] [Commented] (SPARK-22030) GraphiteSink fails to re-connect to Graphite instances behind an ELB or any other auto-scaled LB

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168030#comment-16168030 ] Apache Spark commented on SPARK-22030: -- User 'alexmnyc' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22030) GraphiteSink fails to re-connect to Graphite instances behind an ELB or any other auto-scaled LB

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22030: Assignee: Apache Spark > GraphiteSink fails to re-connect to Graphite instances behind an

[jira] [Updated] (SPARK-22031) KMeans - Compute cost for a single vector

2017-09-15 Thread Laurent Valdes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Laurent Valdes updated SPARK-22031: --- Summary: KMeans - Compute cost for a single vector (was: Compute cost for a single vector)

[jira] [Created] (SPARK-22031) Compute cost for a single vector

2017-09-15 Thread Laurent Valdes (JIRA)
Laurent Valdes created SPARK-22031: -- Summary: Compute cost for a single vector Key: SPARK-22031 URL: https://issues.apache.org/jira/browse/SPARK-22031 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-22030) GraphiteSink fails to re-connect to Graphite instances behind an ELB or any other auto-scaled LB

2017-09-15 Thread Alex Mikhailau (JIRA)
Alex Mikhailau created SPARK-22030: -- Summary: GraphiteSink fails to re-connect to Graphite instances behind an ELB or any other auto-scaled LB Key: SPARK-22030 URL:

[jira] [Commented] (SPARK-22029) Cache of _parse_datatype_json_string function

2017-09-15 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-22029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168017#comment-16168017 ] Maciej Bryński commented on SPARK-22029: I did a proof of concept with functools.lru_cache. But

[jira] [Created] (SPARK-22029) Cache of _parse_datatype_json_string function

2017-09-15 Thread JIRA
Maciej Bryński created SPARK-22029: -- Summary: Cache of _parse_datatype_json_string function Key: SPARK-22029 URL: https://issues.apache.org/jira/browse/SPARK-22029 Project: Spark Issue

[jira] [Updated] (SPARK-22024) [pySpark] Speeding up internal conversion for Spark SQL

2017-09-15 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-22024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-22024: --- Summary: [pySpark] Speeding up internal conversion for Spark SQL (was: [pySpark] Speeding

[jira] [Commented] (SPARK-22028) spark-submit trips over environment variables

2017-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16168009#comment-16168009 ] Sean Owen commented on SPARK-22028: --- But that's a Java error or limit, even. What would Spark do? You

[jira] [Reopened] (SPARK-22028) spark-submit trips over environment variables

2017-09-15 Thread Franz Wimmer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franz Wimmer reopened SPARK-22028: -- Sorry - the Error regarding the Hadoop binaries is normal for this system - I'm asking because of

[jira] [Issue Comment Deleted] (SPARK-22028) spark-submit trips over environment variables

2017-09-15 Thread Franz Wimmer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franz Wimmer updated SPARK-22028: - Comment: was deleted (was: The Error with the Hadoop binaries is normal for this system - I'm

[jira] [Updated] (SPARK-22028) spark-submit trips over environment variables

2017-09-15 Thread Franz Wimmer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franz Wimmer updated SPARK-22028: - Description: I have a strange environment variable in my Windows operating system: {code:none}

[jira] [Commented] (SPARK-22028) spark-submit trips over environment variables

2017-09-15 Thread Franz Wimmer (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167970#comment-16167970 ] Franz Wimmer commented on SPARK-22028: -- The Error with the Hadoop binaries is normal for this system

[jira] [Resolved] (SPARK-22028) spark-submit trips over environment variables

2017-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22028. --- Resolution: Not A Problem No, it indicates you don't have the win Hadoop binaries available. See the

[jira] [Created] (SPARK-22028) spark-submit trips over environment variables

2017-09-15 Thread Franz Wimmer (JIRA)
Franz Wimmer created SPARK-22028: Summary: spark-submit trips over environment variables Key: SPARK-22028 URL: https://issues.apache.org/jira/browse/SPARK-22028 Project: Spark Issue Type:

[jira] [Updated] (SPARK-22027) Explanation of default value of GBTRegressor's maxIter is missing in API doc

2017-09-15 Thread Kazunori Sakamoto (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazunori Sakamoto updated SPARK-22027: -- Labels: documentation (was: ) > Explanation of default value of GBTRegressor's

[jira] [Assigned] (SPARK-22027) Explanation of default value of GBTRegressor's maxIter is missing in API doc

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22027: Assignee: (was: Apache Spark) > Explanation of default value of GBTRegressor's

[jira] [Assigned] (SPARK-22027) Explanation of default value of GBTRegressor's maxIter is missing in API doc

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22027: Assignee: Apache Spark > Explanation of default value of GBTRegressor's maxIter is

[jira] [Commented] (SPARK-22027) Explanation of default value of GBTRegressor's maxIter is missing in API doc

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167944#comment-16167944 ] Apache Spark commented on SPARK-22027: -- User 'exKAZUu' has created a pull request for this issue:

[jira] [Created] (SPARK-22027) Explanation of default value of GBTRegressor's maxIter is missing in API doc

2017-09-15 Thread Kazunori Sakamoto (JIRA)
Kazunori Sakamoto created SPARK-22027: - Summary: Explanation of default value of GBTRegressor's maxIter is missing in API doc Key: SPARK-22027 URL: https://issues.apache.org/jira/browse/SPARK-22027

[jira] [Comment Edited] (SPARK-21996) Streaming ignores files with spaces in the file names

2017-09-15 Thread Xiayun Sun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167941#comment-16167941 ] Xiayun Sun edited comment on SPARK-21996 at 9/15/17 2:30 PM: - I can reproduce

[jira] [Commented] (SPARK-21996) Streaming ignores files with spaces in the file names

2017-09-15 Thread Xiayun Sun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167941#comment-16167941 ] Xiayun Sun commented on SPARK-21996: I can reproduce this issue for master branch, and found out it

[jira] [Commented] (SPARK-21996) Streaming ignores files with spaces in the file names

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167927#comment-16167927 ] Apache Spark commented on SPARK-21996: -- User 'xysun' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21996) Streaming ignores files with spaces in the file names

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21996: Assignee: Apache Spark > Streaming ignores files with spaces in the file names >

[jira] [Assigned] (SPARK-21996) Streaming ignores files with spaces in the file names

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21996: Assignee: (was: Apache Spark) > Streaming ignores files with spaces in the file names

[jira] [Created] (SPARK-22026) data source v2 write path

2017-09-15 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-22026: --- Summary: data source v2 write path Key: SPARK-22026 URL: https://issues.apache.org/jira/browse/SPARK-22026 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-22024) [pySpark] Speeding up fromInternal methods

2017-09-15 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-22024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-22024: --- Description: fromInternal methods of pySpark datatypes are bottleneck when using pySpark.

[jira] [Assigned] (SPARK-21958) Attempting to save large Word2Vec model hangs driver in constant GC.

2017-09-15 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath reassigned SPARK-21958: -- Assignee: Travis Hegner > Attempting to save large Word2Vec model hangs driver in

[jira] [Resolved] (SPARK-21958) Attempting to save large Word2Vec model hangs driver in constant GC.

2017-09-15 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-21958. Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19191

[jira] [Assigned] (SPARK-22025) Speeding up fromInternal for StructField

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22025: Assignee: (was: Apache Spark) > Speeding up fromInternal for StructField >

[jira] [Assigned] (SPARK-22025) Speeding up fromInternal for StructField

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22025: Assignee: Apache Spark > Speeding up fromInternal for StructField >

[jira] [Commented] (SPARK-22025) Speeding up fromInternal for StructField

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167846#comment-16167846 ] Apache Spark commented on SPARK-22025: -- User 'maver1ck' has created a pull request for this issue:

[jira] [Updated] (SPARK-22012) CLONE - Spark Streaming, Kafka receiver, "Failed to get records for ... after polling for 512"

2017-09-15 Thread Karan Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karan Singh updated SPARK-22012: Description: My Spark Streaming duration is 5 seconds (5000) and kafka is all at its default

[jira] [Comment Edited] (SPARK-19275) Spark Streaming, Kafka receiver, "Failed to get records for ... after polling for 512"

2017-09-15 Thread Karan Singh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16166112#comment-16166112 ] Karan Singh edited comment on SPARK-19275 at 9/15/17 1:02 PM: -- Hi Team , My

[jira] [Created] (SPARK-22025) Speeding up fromInternal for StructField

2017-09-15 Thread JIRA
Maciej Bryński created SPARK-22025: -- Summary: Speeding up fromInternal for StructField Key: SPARK-22025 URL: https://issues.apache.org/jira/browse/SPARK-22025 Project: Spark Issue Type:

[jira] [Updated] (SPARK-22024) [pySpark] Speeding up fromInternal methods

2017-09-15 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-22024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-22024: --- Summary: [pySpark] Speeding up fromInternal methods (was: Speeding up fromInternal methods)

[jira] [Updated] (SPARK-22010) Slow fromInternal conversion for TimestampType

2017-09-15 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-22010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej Bryński updated SPARK-22010: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-22024 > Slow fromInternal

[jira] [Created] (SPARK-22024) Speeding up fromInternal methods

2017-09-15 Thread JIRA
Maciej Bryński created SPARK-22024: -- Summary: Speeding up fromInternal methods Key: SPARK-22024 URL: https://issues.apache.org/jira/browse/SPARK-22024 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-7276) withColumn is very slow on dataframe with large number of columns

2017-09-15 Thread Barry Becker (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167814#comment-16167814 ] Barry Becker commented on SPARK-7276: - Isn't there still a problem with withColumn performance in

[jira] [Commented] (SPARK-22021) Add a feature transformation to accept a function and apply it on all rows of dataframe

2017-09-15 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167806#comment-16167806 ] Nick Pentreath commented on SPARK-22021: Why a JavaScript function? I think this is not a good

[jira] [Comment Edited] (SPARK-21994) Spark 2.2 can not read Parquet table created by itself

2017-09-15 Thread Jurgis Pods (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167788#comment-16167788 ] Jurgis Pods edited comment on SPARK-21994 at 9/15/17 12:13 PM: --- I have

[jira] [Commented] (SPARK-21994) Spark 2.2 can not read Parquet table created by itself

2017-09-15 Thread Jurgis Pods (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167788#comment-16167788 ] Jurgis Pods commented on SPARK-21994: - I have updated to CDH 5.12.1 and the problem persists. There

[jira] [Updated] (SPARK-21994) Spark 2.2 can not read Parquet table created by itself

2017-09-15 Thread Jurgis Pods (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jurgis Pods updated SPARK-21994: Description: This seems to be a new bug introduced in Spark 2.2, since it did not occur under

[jira] [Updated] (SPARK-22023) Multi-column Spark SQL UDFs broken in Python 3

2017-09-15 Thread Oli Hall (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oli Hall updated SPARK-22023: - Description: I've been testing some existing PySpark code after migrating to Python3, and there seems

[jira] [Created] (SPARK-22023) Multi-column Spark SQL UDFs broken in Python 3

2017-09-15 Thread Oli Hall (JIRA)
Oli Hall created SPARK-22023: Summary: Multi-column Spark SQL UDFs broken in Python 3 Key: SPARK-22023 URL: https://issues.apache.org/jira/browse/SPARK-22023 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-22021) Add a feature transformation to accept a function and apply it on all rows of dataframe

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22021: Assignee: (was: Apache Spark) > Add a feature transformation to accept a function and

[jira] [Commented] (SPARK-22021) Add a feature transformation to accept a function and apply it on all rows of dataframe

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167753#comment-16167753 ] Apache Spark commented on SPARK-22021: -- User 'narahari92' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22021) Add a feature transformation to accept a function and apply it on all rows of dataframe

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22021: Assignee: Apache Spark > Add a feature transformation to accept a function and apply it

[jira] [Comment Edited] (SPARK-22019) JavaBean int type property

2017-09-15 Thread Jen-Ming Chung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167734#comment-16167734 ] Jen-Ming Chung edited comment on SPARK-22019 at 9/15/17 11:29 AM: -- The

[jira] [Commented] (SPARK-22021) Add a feature transformation to accept a function and apply it on all rows of dataframe

2017-09-15 Thread Hosur Narahari (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167736#comment-16167736 ] Hosur Narahari commented on SPARK-22021: If I just apply this function, I can't use it in spark's

[jira] [Comment Edited] (SPARK-22019) JavaBean int type property

2017-09-15 Thread Jen-Ming Chung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167734#comment-16167734 ] Jen-Ming Chung edited comment on SPARK-22019 at 9/15/17 11:28 AM: -- The

[jira] [Commented] (SPARK-22019) JavaBean int type property

2017-09-15 Thread Jen-Ming Chung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167734#comment-16167734 ] Jen-Ming Chung commented on SPARK-22019: The alternative is giving the explicit schema instead

[jira] [Comment Edited] (SPARK-22019) JavaBean int type property

2017-09-15 Thread Jen-Ming Chung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167725#comment-16167725 ] Jen-Ming Chung edited comment on SPARK-22019 at 9/15/17 11:18 AM: -- Hi

[jira] [Commented] (SPARK-22019) JavaBean int type property

2017-09-15 Thread Jen-Ming Chung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167725#comment-16167725 ] Jen-Ming Chung commented on SPARK-22019: Hi [~client.test], The schema inferred after

[jira] [Created] (SPARK-22022) Unable to use Python Profiler with SparkSession

2017-09-15 Thread JIRA
Maciej Bryński created SPARK-22022: -- Summary: Unable to use Python Profiler with SparkSession Key: SPARK-22022 URL: https://issues.apache.org/jira/browse/SPARK-22022 Project: Spark Issue

[jira] [Commented] (SPARK-22021) Add a feature transformation to accept a function and apply it on all rows of dataframe

2017-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167722#comment-16167722 ] Sean Owen commented on SPARK-22021: --- Why can't you just apply this function? Or implement Transformer.

[jira] [Created] (SPARK-22021) Add a feature transformation to accept a function and apply it on all rows of dataframe

2017-09-15 Thread Hosur Narahari (JIRA)
Hosur Narahari created SPARK-22021: -- Summary: Add a feature transformation to accept a function and apply it on all rows of dataframe Key: SPARK-22021 URL: https://issues.apache.org/jira/browse/SPARK-22021

[jira] [Assigned] (SPARK-21780) Simpler Dataset.sample API in R

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21780: Assignee: Apache Spark > Simpler Dataset.sample API in R >

[jira] [Commented] (SPARK-21780) Simpler Dataset.sample API in R

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167571#comment-16167571 ] Apache Spark commented on SPARK-21780: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-21780) Simpler Dataset.sample API in R

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21780: Assignee: (was: Apache Spark) > Simpler Dataset.sample API in R >

[jira] [Resolved] (SPARK-22020) Support session local timezone

2017-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22020. --- Resolution: Duplicate > Support session local timezone > -- > >

[jira] [Resolved] (SPARK-20921) While reading from oracle database, it converts to wrong type.

2017-09-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-20921. --- Resolution: Duplicate > While reading from oracle database, it converts to wrong type. >

[jira] [Commented] (SPARK-21713) Replace LogicalPlan.isStreaming with OutputMode

2017-09-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167506#comment-16167506 ] Apache Spark commented on SPARK-21713: -- User 'joseph-torres' has created a pull request for this

[jira] [Resolved] (SPARK-21987) Spark 2.3 cannot read 2.2 event logs

2017-09-15 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21987. - Resolution: Fixed Assignee: Wenchen Fan Fix Version/s: 2.3.0 > Spark 2.3 cannot read 2.2

[jira] [Resolved] (SPARK-22002) Read JDBC table use custom schema support specify partial fields

2017-09-15 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-22002. - Resolution: Fixed Assignee: Yuming Wang Fix Version/s: 2.3.0 > Read JDBC table use

[jira] [Commented] (SPARK-21994) Spark 2.2 can not read Parquet table created by itself

2017-09-15 Thread Jurgis Pods (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167407#comment-16167407 ] Jurgis Pods commented on SPARK-21994: - Thank you for testing. Which version of Hive are you using? It

[jira] [Created] (SPARK-22020) Support session local timezone

2017-09-15 Thread Navya Krishnappa (JIRA)
Navya Krishnappa created SPARK-22020: Summary: Support session local timezone Key: SPARK-22020 URL: https://issues.apache.org/jira/browse/SPARK-22020 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-21902) BlockManager.doPut will hide actually exception when exception thrown in finally block

2017-09-15 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-21902: Priority: Trivial (was: Major) > BlockManager.doPut will hide actually exception when exception

[jira] [Updated] (SPARK-21902) BlockManager.doPut will hide actually exception when exception thrown in finally block

2017-09-15 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-21902: Issue Type: Improvement (was: Wish) > BlockManager.doPut will hide actually exception when

[jira] [Assigned] (SPARK-21902) BlockManager.doPut will hide actually exception when exception thrown in finally block

2017-09-15 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao reassigned SPARK-21902: --- Assignee: zhoukang > BlockManager.doPut will hide actually exception when exception thrown

[jira] [Resolved] (SPARK-21902) BlockManager.doPut will hide actually exception when exception thrown in finally block

2017-09-15 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao resolved SPARK-21902. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19171

[jira] [Commented] (SPARK-20921) While reading from oracle database, it converts to wrong type.

2017-09-15 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16167373#comment-16167373 ] Yuming Wang commented on SPARK-20921: - Fixed by https://github.com/apache/spark/pull/18266. > While