[jira] [Comment Edited] (SPARK-21199) Its not possible to impute Vector types

2017-06-26 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16063416#comment-16063416 ] Franklyn Dsouza edited comment on SPARK-21199 at 6/26/17 5:16 PM: -

[jira] [Comment Edited] (SPARK-21199) Its not possible to impute Vector types

2017-06-26 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16063416#comment-16063416 ] Franklyn Dsouza edited comment on SPARK-21199 at 6/26/17 5:16 PM: -

[jira] [Comment Edited] (SPARK-21199) Its not possible to impute Vector types

2017-06-26 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16063416#comment-16063416 ] Franklyn Dsouza edited comment on SPARK-21199 at 6/26/17 5:15 PM: -

[jira] [Commented] (SPARK-21199) Its not possible to impute Vector types

2017-06-26 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16063416#comment-16063416 ] Franklyn Dsouza commented on SPARK-21199: - For this particular scenario I have a

[jira] [Comment Edited] (SPARK-12806) Support SQL expressions extracting values from VectorUDT

2017-06-25 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16061718#comment-16061718 ] Franklyn Dsouza edited comment on SPARK-12806 at 6/25/17 11:32 PM:

[jira] [Comment Edited] (SPARK-12806) Support SQL expressions extracting values from VectorUDT

2017-06-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16061718#comment-16061718 ] Franklyn Dsouza edited comment on SPARK-12806 at 6/24/17 2:54 PM: -

[jira] [Updated] (SPARK-21199) Its not possible to impute Vector types

2017-06-23 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-21199: Description: There are cases where nulls end up in vector columns in dataframes. Currently

[jira] [Created] (SPARK-21199) Its not possible to impute Vector types

2017-06-23 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-21199: --- Summary: Its not possible to impute Vector types Key: SPARK-21199 URL: https://issues.apache.org/jira/browse/SPARK-21199 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-12806) Support SQL expressions extracting values from VectorUDT

2017-06-23 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16061718#comment-16061718 ] Franklyn Dsouza commented on SPARK-12806: - [~mengxr] can we get this fixed ? > S

[jira] [Created] (SPARK-19844) UDF in when control function is executed before the when clause is evaluated.

2017-03-06 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-19844: --- Summary: UDF in when control function is executed before the when clause is evaluated. Key: SPARK-19844 URL: https://issues.apache.org/jira/browse/SPARK-19844 P

[jira] [Created] (SPARK-19440) Window in pyspark doesn't have attributes unboundedPreceding, unboundedFollowing and currentRow

2017-02-02 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-19440: --- Summary: Window in pyspark doesn't have attributes unboundedPreceding, unboundedFollowing and currentRow Key: SPARK-19440 URL: https://issues.apache.org/jira/browse/SPARK-19

[jira] [Closed] (SPARK-19388) Reading an empty folder as parquet causes an Analysis Exception

2017-01-27 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza closed SPARK-19388. --- Resolution: Fixed > Reading an empty folder as parquet causes an Analysis Exception > ---

[jira] [Created] (SPARK-19388) Reading an empty folder as parquet causes an Analysis Exception

2017-01-27 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-19388: --- Summary: Reading an empty folder as parquet causes an Analysis Exception Key: SPARK-19388 URL: https://issues.apache.org/jira/browse/SPARK-19388 Project: Spark

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns cause data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Summary: Nulls in non nullable columns cause data corruption in parquet (was: Nulls in non

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Summary: Nulls in non nullable columns causes data corruption in parquet (was: Nulls in no

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a non-nullable field and

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and i

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a non-nullable field and

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Priority: Critical (was: Major) > Nulls in non nullable columns causes data corruption in

[jira] [Commented] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15832048#comment-15832048 ] Franklyn Dsouza commented on SPARK-19299: - These issues also are very likely repr

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and i

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and i

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and i

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and i

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and i

[jira] [Created] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-19299: --- Summary: Nulls in non nullable columns causes data corruption in parquet Key: SPARK-19299 URL: https://issues.apache.org/jira/browse/SPARK-19299 Project: Spark

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:37 PM:

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:29 PM:

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:22 PM:

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:24 PM:

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:21 PM:

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:20 PM:

[jira] [Commented] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15761741#comment-15761741 ] Franklyn Dsouza commented on SPARK-18589: - The sequence of steps that causes this

[jira] [Created] (SPARK-16629) UDTs can not be compared to DataTypes in dataframes.

2016-07-19 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-16629: --- Summary: UDTs can not be compared to DataTypes in dataframes. Key: SPARK-16629 URL: https://issues.apache.org/jira/browse/SPARK-16629 Project: Spark Is

[jira] [Updated] (SPARK-15811) UDFs do not work in Spark 2.0-preview built with scala 2.10

2016-06-09 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-15811: Shepherd: Davies Liu > UDFs do not work in Spark 2.0-preview built with scala 2.10 > --

[jira] [Updated] (SPARK-15811) UDFs do not work in Spark 2.0-preview built with scala 2.10

2016-06-07 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-15811: Description: I've built spark-2.0-preview (8f5a04b) with scala-2.10 using the following {co

[jira] [Updated] (SPARK-15811) UDFs do not work in Spark 2.0-preview built with scala 2.10

2016-06-07 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-15811: Description: I've built spark-2.0-preview (8f5a04b) with scala-2.10 using the following {co

[jira] [Updated] (SPARK-15811) UDFs do not work in Spark 2.0-preview built with scala 2.10

2016-06-07 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-15811: Description: I've built spark-2.0-preview (8f5a04b) with scala-2.10 using the following {c

[jira] [Created] (SPARK-15811) UDFs do not work in Spark 2.0-preview built with scala 2.10

2016-06-07 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-15811: --- Summary: UDFs do not work in Spark 2.0-preview built with scala 2.10 Key: SPARK-15811 URL: https://issues.apache.org/jira/browse/SPARK-15811 Project: Spark

[jira] [Closed] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza closed SPARK-14117. --- Resolution: Fixed > write.partitionBy retains partitioning column when outputting Parquet > -

[jira] [Updated] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-14117: Description: When writing a Dataframe as parquet using a partitionBy on the writer to gene

[jira] [Updated] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-14117: Description: When writing a Dataframe as parquet using a partitionBy on the writer to gene

[jira] [Created] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-14117: --- Summary: write.partitionBy retains partitioning column when outputting Parquet Key: SPARK-14117 URL: https://issues.apache.org/jira/browse/SPARK-14117 Project:

[jira] [Updated] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-14117: Description: When writing a Dataframe as parquet using a partitionBy on the writer to gene

[jira] [Updated] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-14117: Description: When writing a Dataframe as parquet using a partitionBy on the writer to gene

[jira] [Updated] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-14117: Description: When writing a Dataframe as parquet using a partitionBy on the writer to gene

[jira] [Updated] (SPARK-13730) Nulls in dataframes getting converted to 0 with spark 2.0 SNAPSHOT

2016-03-08 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13730: Description: Basically I'm putting nulls into a non-nullable LongType column and doing a t

[jira] [Updated] (SPARK-13730) Nulls in dataframes getting converted to 0 with spark 2.0 SNAPSHOT

2016-03-07 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13730: Description: Basically I'm putting nulls into a non-nullable LongType column and doing a t

[jira] [Created] (SPARK-13730) Nulls in dataframes getting converted to 0 with spark 2.0 SNAPSHOT

2016-03-07 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-13730: --- Summary: Nulls in dataframes getting converted to 0 with spark 2.0 SNAPSHOT Key: SPARK-13730 URL: https://issues.apache.org/jira/browse/SPARK-13730 Project: Spa

[jira] [Updated] (SPARK-13410) unionAll throws error with DataFrames containing UDT columns.

2016-02-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13410: Description: Unioning two DataFrames that contain UDTs fails with {quote} AnalysisExceptio

[jira] [Updated] (SPARK-13410) unionAll throws error with DataFrames containing UDT columns.

2016-02-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13410: Description: Unioning two DataFrames that contain UDTs fails with {quote} AnalysisExceptio

[jira] [Updated] (SPARK-13410) unionAll AnalysisException with DataFrames containing UDT columns.

2016-02-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13410: Summary: unionAll AnalysisException with DataFrames containing UDT columns. (was: unionAll

[jira] [Updated] (SPARK-13410) unionAll throws error with DataFrames containing UDT columns.

2016-02-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13410: Description: Unioning two DataFrames that contain UDTs fails with {quote} AnalysisExceptio

[jira] [Created] (SPARK-13410) unionAll throws error with DataFrames containing UDT columns.

2016-02-19 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-13410: --- Summary: unionAll throws error with DataFrames containing UDT columns. Key: SPARK-13410 URL: https://issues.apache.org/jira/browse/SPARK-13410 Project: Spark

[jira] [Commented] (SPARK-1834) NoSuchMethodError when invoking JavaPairRDD.reduce() in Java

2014-08-04 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14085567#comment-14085567 ] Franklyn Dsouza commented on SPARK-1834: There is no reduce function in JavaPairRD