[ https://issues.apache.org/jira/browse/SPARK-42034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17686914#comment-17686914 ]
Nick Hryhoriev commented on SPARK-42034: ---------------------------------------- I can do PR, but I need someone to target me to some doc or piece of code to understand how this part of spark works. I really do not understand how rdd execution is linked to SQL query listeners. > QueryExecutionListener and Observation API, df.observe do not work with > `foreach` action. > ----------------------------------------------------------------------------------------- > > Key: SPARK-42034 > URL: https://issues.apache.org/jira/browse/SPARK-42034 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 3.1.3, 3.2.2, 3.3.1 > Environment: I test it locally and on YARN in cluster mode. > Spark 3.3.1 and 3.2.2 and 3.1.1. > Yarn 2.9.2 and 3.2.1. > Reporter: Nick Hryhoriev > Priority: Major > Labels: sql-api > > Observation API, {{observe}} dataframe transformation, and custom > QueryExecutionListener. > Do not work with {{foreach}} or {{foreachPartition actions.}} > {{This is due to }}QueryExecutionListener functions do not trigger on queries > whose action is {{foreach}} or {{{}foreachPartition{}}}. > But the Spark GUI SQL tab sees this query as SQL query and shows its query > plans and etc. > here is the code to reproduce it: > https://gist.github.com/GrigorievNick/e7cf9ec5584b417d9719e2812722e6d3 -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org