[ https://issues.apache.org/jira/browse/SPARK-35365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342981#comment-17342981 ]
Yuming Wang commented on SPARK-35365: ------------------------------------- {noformat} -- 2.4 org.apache.spark.sql.catalyst.analysis.Analyzer$ResolveReferences 46101271113 / 47150195238 1415 / 2368 -- 3.1 org.apache.spark.sql.catalyst.analysis.Analyzer$ResolveReferences 113500482546 / 115500482546 2300 / 4124 {noformat} > spark3.1.1 use too long time to analyze table fields > ---------------------------------------------------- > > Key: SPARK-35365 > URL: https://issues.apache.org/jira/browse/SPARK-35365 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 3.1.1 > Reporter: yao > Priority: Major > Attachments: spark2.4report, spark3.1.1_report_originalsql, > spark3.11report > > > I have a big sql with a few width tables join and complex logic, when I run > that in spark 2.4 , it will take 20 minues in analyze phase, when I use spark > 3.1.1, it will use about 40 minutes, > I need set spark.sql.analyzer.maxIterations=1000 in spark3.1.1. > or spark.sql.optimizer.maxIterations=1000 in spark2.4. > no other special setting for this . > I check on the spark ui , I find that there is no job generated, all executor > have no active tasks, and when I set log level to debug, I find that the job > is in analyze phase, analyze the fields reference. > this phase use too long time. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org