Hi JJ Consider it only takes 3mins on SparkSQL, maybe there are some mistakes in query options. Try run "set;" in impala-shell and check all query options, e.g: BATCH_SIZE: [0] DISABLE_CODEGEN: [0] RUNTIME_FILTER_MODE: GLOBAL
Just a guess, thanks. 在 27/10/2017 10:25, 俊杰陈 写道: The profile file is damaged. Here is a screenshot for exec summary [cid:ii_j999ymep1_15f5ba563aeabb91] 2017-10-27 10:04 GMT+08:00 俊杰陈 <cjjnj...@gmail.com<mailto:cjjnj...@gmail.com>>: Hi Devs I met a performance issue on big table join. The query takes more than 3 hours on Impala and only 3 minutes on Spark SQL on the same 5 nodes cluster. when running query, the left scanner and exchange node are very slow. Did I miss some key arguments? you can see profile file in attachment. [cid:ii_j9998pph2_15f5b92f2cf47020] -- Thanks & Best Regards -- Thanks & Best Regards -- Regards, Hongxu.