[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16623672#comment-16623672 ] Wenchen Fan commented on SPARK-22739: - Here it is: https://issues.apache.org/jira/browse/SPARK-24768 > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson >Priority: Major > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16623665#comment-16623665 ] Aleksander Eskilson commented on SPARK-22739: - [~cloud_fan], could you perhaps link here the Spark issue (if any) and PR that committed built-in Avro support to Spark? I would like to take a look at how Avro support for Datasets that was created through the PR on this ticket, and an additional PR in Spark-Avro (see [#217|https://github.com/databricks/spark-avro/pull/217]), might be folded in to the new code. I would imagine that process would be including the same expressions this ticket would have included, and then the new AvroEncoder that was to be included in the Spark-Avro project. cc: [~marmbrus] > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson >Priority: Major > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16623151#comment-16623151 ] Wenchen Fan commented on SPARK-22739: - I'm closing this ticket, since avro is now a builtin data source. We can create another ticket to support avro records in Dataset. > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson >Priority: Major > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16478298#comment-16478298 ] Apache Spark commented on SPARK-22739: -- User 'bdrillard' has created a pull request for this issue: https://github.com/apache/spark/pull/21348 > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson >Priority: Major > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16336616#comment-16336616 ] Dongjoon Hyun commented on SPARK-22739: --- Thank you! > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson >Priority: Major > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16336565#comment-16336565 ] Sameer Agarwal commented on SPARK-22739: Agree, I'll re-target this for 2.4.0 for now – [~aeskilson] [~marmbrus] please let us know if you disagree. > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson >Priority: Major > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16336487#comment-16336487 ] Dongjoon Hyun commented on SPARK-22739: --- Since this is a new feature and the PR is still marked as WIP, can we change the target version to 2.4.0? > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson >Priority: Major > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16312281#comment-16312281 ] Sameer Agarwal commented on SPARK-22739: Raising priority for this. This would be great to have in 2.3! > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson >Priority: Critical > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16303985#comment-16303985 ] Apache Spark commented on SPARK-22739: -- User 'bdrillard' has created a pull request for this issue: https://github.com/apache/spark/pull/20085 > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16299232#comment-16299232 ] Michael Armbrust commented on SPARK-22739: -- Sounds good to me. I'm happy to provide pointers on the PR, just "@marmbrus". > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16299177#comment-16299177 ] Aleksander Eskilson commented on SPARK-22739: - Yes. I've got a branch with the new expressions added. I had been trying to find existing tests over the expressions in {{Objects}} for patterns to copy in unit tests for the new expressions, but was having difficulty finding any that seemed representative, so I could use some guidance. I'll prepare that branch for a PR to field suggestions if that sounds good. > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-22739) Additional Expression Support for Objects
[ https://issues.apache.org/jira/browse/SPARK-22739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16299167#comment-16299167 ] Michael Armbrust commented on SPARK-22739: -- Any progress on this? Branch cut is January 1st, and I'd love to include this. > Additional Expression Support for Objects > - > > Key: SPARK-22739 > URL: https://issues.apache.org/jira/browse/SPARK-22739 > Project: Spark > Issue Type: New Feature > Components: SQL >Affects Versions: 2.3.0 >Reporter: Aleksander Eskilson > > Some discussion in Spark-Avro [1] motivates additions and minor changes to > the {{Objects}} Expressions API [2]. The proposed changes include > * a generalized form of {{initializeJavaBean}} taking a sequence of > initialization expressions that can be applied to instances of varying objects > * an object cast that performs a simple Java type cast against a value > * making {{ExternalMapToCatalyst}} public, for use in outside libraries > These changes would facilitate the writing of custom encoders for varying > objects that cannot already be readily converted to a statically typed > dataset by a JavaBean encoder (e.g. Avro). > [1] -- > https://github.com/databricks/spark-avro/pull/217#issuecomment-342599110 > [2] -- > > https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/objects/objects.scala -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org