[jira] [Commented] (SPARK-38809) Implement option to skip null values in symmetric hash impl of stream-stream joins

2022-04-06 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-38809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17518483#comment-17518483
 ] 

Apache Spark commented on SPARK-38809:
--

User 'anishshri-db' has created a pull request for this issue:
https://github.com/apache/spark/pull/36090

> Implement option to skip null values in symmetric hash impl of stream-stream 
> joins
> --
>
> Key: SPARK-38809
> URL: https://issues.apache.org/jira/browse/SPARK-38809
> Project: Spark
>  Issue Type: Bug
>  Components: Structured Streaming
>Affects Versions: 3.2.1
>Reporter: Anish Shrigondekar
>Priority: Major
>
> Implement option to skip null values in symmetric hash impl of stream-stream 
> joins
>  * In the symmetric has join state manager, we can receive entries with null 
> values for a key and that caused the `removeByValue` and get iterators to 
> fail and run into the NullPointerException.
>  * This is possible if the state recovered is written from a old spark 
> version or its corrupted on disk. Since we don't have a utility to query this 
> state, we would like to provide a conf option to skip nulls for the symmetric 
> hash impl in stream stream joins.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-38809) Implement option to skip null values in symmetric hash impl of stream-stream joins

2022-04-06 Thread Apache Spark (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-38809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17518481#comment-17518481
 ] 

Apache Spark commented on SPARK-38809:
--

User 'anishshri-db' has created a pull request for this issue:
https://github.com/apache/spark/pull/36090

> Implement option to skip null values in symmetric hash impl of stream-stream 
> joins
> --
>
> Key: SPARK-38809
> URL: https://issues.apache.org/jira/browse/SPARK-38809
> Project: Spark
>  Issue Type: Bug
>  Components: Structured Streaming
>Affects Versions: 3.2.1
>Reporter: Anish Shrigondekar
>Priority: Major
>
> Implement option to skip null values in symmetric hash impl of stream-stream 
> joins
>  * In the symmetric has join state manager, we can receive entries with null 
> values for a key and that caused the `removeByValue` and get iterators to 
> fail and run into the NullPointerException.
>  * This is possible if the state recovered is written from a old spark 
> version or its corrupted on disk. Since we don't have a utility to query this 
> state, we would like to provide a conf option to skip nulls for the symmetric 
> hash impl in stream stream joins.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-38809) Implement option to skip null values in symmetric hash impl of stream-stream joins

2022-04-06 Thread Anish Shrigondekar (Jira)


[ 
https://issues.apache.org/jira/browse/SPARK-38809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17518344#comment-17518344
 ] 

Anish Shrigondekar commented on SPARK-38809:


Working on this PR and will send the change out soon

> Implement option to skip null values in symmetric hash impl of stream-stream 
> joins
> --
>
> Key: SPARK-38809
> URL: https://issues.apache.org/jira/browse/SPARK-38809
> Project: Spark
>  Issue Type: Bug
>  Components: Structured Streaming
>Affects Versions: 3.2.1
>Reporter: Anish Shrigondekar
>Priority: Major
>
> Implement option to skip null values in symmetric hash impl of stream-stream 
> joins
>  * In the symmetric has join state manager, we can receive entries with null 
> values for a key and that caused the `removeByValue` and get iterators to 
> fail and run into the NullPointerException.
>  * This is possible if the state recovered is written from a old spark 
> version or its corrupted on disk. Since we don't have a utility to query this 
> state, we would like to provide a conf option to skip nulls for the symmetric 
> hash impl in stream stream joins.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org