[ https://issues.apache.org/jira/browse/SPARK-42988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17760829#comment-17760829 ]
Wujunzhe edited comment on SPARK-42988 at 8/31/23 9:45 AM: ----------------------------------------------------------- h3. I have the same question. Here's the question I posed on stackflow [link title|[apache spark - Why are spark3 dynamic partitions slow to write to hive - Stack Overflow|https://stackoverflow.com/questions/76997680/why-are-spark3-dynamic-partitions-slow-to-write-to-hive]] This is one of my tasks, the code is almost the same adjusted part of the log printing, respectively, using {{spark2}} (left) and {{spark3}} run (right), in the case of the same parameters, {{spark3}} each job running speed are significantly better than {{{}spark3{}}}, but the total running time {{spark3}} spent 1.2h, while {{spark2}} only spent 44 min. Why this phenomenon occurs? What is this extra time used for?[The top part is the eventTime forspark2 r, and the bottom part is for a spark3] !image-2023-08-31-17-42-31-348.png! was (Author: JIRAUSER299051): h3. I have the same question. this is my question that report on stackflow [link title|[apache spark - Why are spark3 dynamic partitions slow to write to hive - Stack Overflow|https://stackoverflow.com/questions/76997680/why-are-spark3-dynamic-partitions-slow-to-write-to-hive]] This is one of my tasks, the code is almost the same adjusted part of the log printing, respectively, using {{spark2}} (left) and {{spark3}} run (right), in the case of the same parameters, {{spark3}} each job running speed are significantly better than {{{}spark3{}}}, but the total running time {{spark3}} spent 1.2h, while {{spark2}} only spent 44 min. Why this phenomenon occurs? What is this extra time used for?[The top part is the eventTime forspark2 r, and the bottom part is for a spark3] !image-2023-08-31-17-42-31-348.png! > Spark Sql insert into hive table dynamic partitions slow > --------------------------------------------------------- > > Key: SPARK-42988 > URL: https://issues.apache.org/jira/browse/SPARK-42988 > Project: Spark > Issue Type: Question > Components: SQL > Affects Versions: 3.3.0 > Reporter: thomasgx > Priority: Major > Attachments: image-2023-08-31-17-42-31-348.png > > -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org