[jira] [Updated] (SPARK-30993) GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql type has fixed length
[ https://issues.apache.org/jira/browse/SPARK-30993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-30993: - Fix Version/s: 2.4.6 > GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql > type has fixed length > --- > > Key: SPARK-30993 > URL: https://issues.apache.org/jira/browse/SPARK-30993 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.3.0, 2.3.1, 2.3.2, 2.3.3, 2.3.4, 2.4.5, 3.0.0 >Reporter: Jungtaek Lim >Assignee: Jungtaek Lim >Priority: Major > Labels: correctness > Fix For: 3.0.0, 2.4.6 > > > This is reported by user mailing list, though the mail thread is regarding > suspect of the behavior of mapGroupsWithState. > [https://lists.apache.org/thread.html/r08b44a7afac4e4c971633d30b4e5d11bd7c0d6e28180e03b874ea58b%40%3Cuser.spark.apache.org%3E] > The actual culprit is, there're a couple of methods which don't handle UDT > and it makes GenerateUnsafeRowJoiner to generate incorrect code. > Specifically, the issue occurs when the sql type of UDT has fixed length - > GenerateUnsafeRowJoiner has the logic to update the offset position for all > variable-length data, and due to this bug, UDT field with fixed length is > being treated as variable-length data and its value is modified. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-30993) GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql type has fixed length
[ https://issues.apache.org/jira/browse/SPARK-30993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-30993: -- Affects Version/s: 2.3.0 2.3.1 2.3.2 2.3.3 > GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql > type has fixed length > --- > > Key: SPARK-30993 > URL: https://issues.apache.org/jira/browse/SPARK-30993 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.3.0, 2.3.1, 2.3.2, 2.3.3, 2.3.4, 2.4.5, 3.0.0 >Reporter: Jungtaek Lim >Assignee: Jungtaek Lim >Priority: Major > Labels: correctness > Fix For: 3.0.0 > > > This is reported by user mailing list, though the mail thread is regarding > suspect of the behavior of mapGroupsWithState. > [https://lists.apache.org/thread.html/r08b44a7afac4e4c971633d30b4e5d11bd7c0d6e28180e03b874ea58b%40%3Cuser.spark.apache.org%3E] > The actual culprit is, there're a couple of methods which don't handle UDT > and it makes GenerateUnsafeRowJoiner to generate incorrect code. > Specifically, the issue occurs when the sql type of UDT has fixed length - > GenerateUnsafeRowJoiner has the logic to update the offset position for all > variable-length data, and due to this bug, UDT field with fixed length is > being treated as variable-length data and its value is modified. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-30993) GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql type has fixed length
[ https://issues.apache.org/jira/browse/SPARK-30993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-30993: -- Labels: correctness (was: ) > GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql > type has fixed length > --- > > Key: SPARK-30993 > URL: https://issues.apache.org/jira/browse/SPARK-30993 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.3.4, 2.4.5, 3.0.0 >Reporter: Jungtaek Lim >Assignee: Jungtaek Lim >Priority: Major > Labels: correctness > Fix For: 3.0.0 > > > This is reported by user mailing list, though the mail thread is regarding > suspect of the behavior of mapGroupsWithState. > [https://lists.apache.org/thread.html/r08b44a7afac4e4c971633d30b4e5d11bd7c0d6e28180e03b874ea58b%40%3Cuser.spark.apache.org%3E] > The actual culprit is, there're a couple of methods which don't handle UDT > and it makes GenerateUnsafeRowJoiner to generate incorrect code. > Specifically, the issue occurs when the sql type of UDT has fixed length - > GenerateUnsafeRowJoiner has the logic to update the offset position for all > variable-length data, and due to this bug, UDT field with fixed length is > being treated as variable-length data and its value is modified. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-30993) GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql type has fixed length
[ https://issues.apache.org/jira/browse/SPARK-30993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-30993: - Affects Version/s: 2.3.4 2.4.5 > GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql > type has fixed length > --- > > Key: SPARK-30993 > URL: https://issues.apache.org/jira/browse/SPARK-30993 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 2.3.4, 2.4.5, 3.0.0 >Reporter: Jungtaek Lim >Priority: Major > > This is reported by user mailing list, though the mail thread is regarding > suspect of the behavior of mapGroupsWithState. > [https://lists.apache.org/thread.html/r08b44a7afac4e4c971633d30b4e5d11bd7c0d6e28180e03b874ea58b%40%3Cuser.spark.apache.org%3E] > The actual culprit is, there're a couple of methods which don't handle UDT > and it makes GenerateUnsafeRowJoiner to generate incorrect code. > Specifically, the issue occurs when the sql type of UDT has fixed length - > GenerateUnsafeRowJoiner has the logic to update the offset position for all > variable-length data, and due to this bug, UDT field with fixed length is > being treated as variable-length data and its value is modified. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-30993) GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql type has fixed length
[ https://issues.apache.org/jira/browse/SPARK-30993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-30993: - Summary: GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql type has fixed length (was: GenerateUnsafeRowJoiner incorrectly modifies the value if the datatype is UDF and its sql type has fixed length) > GenerateUnsafeRowJoiner corrupts the value if the datatype is UDF and its sql > type has fixed length > --- > > Key: SPARK-30993 > URL: https://issues.apache.org/jira/browse/SPARK-30993 > Project: Spark > Issue Type: Bug > Components: SQL >Affects Versions: 3.0.0 >Reporter: Jungtaek Lim >Priority: Major > > This is reported by user mailing list, though the mail thread is regarding > suspect of the behavior of mapGroupsWithState. > [https://lists.apache.org/thread.html/r08b44a7afac4e4c971633d30b4e5d11bd7c0d6e28180e03b874ea58b%40%3Cuser.spark.apache.org%3E] > The actual culprit is, there're a couple of methods which don't handle UDT > and it makes GenerateUnsafeRowJoiner to generate incorrect code. > Specifically, the issue occurs when the sql type of UDT has fixed length - > GenerateUnsafeRowJoiner has the logic to update the offset position for all > variable-length data, and due to this bug, UDT field with fixed length is > being treated as variable-length data and its value is modified. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org