Maybe you could try “--conf spark.sql.statistics.fallBackToHdfs=true"
On 2019/05/11 01:54:27, V0lleyBallJunki3 wrote:
> Hello,>
>I have set spark.sql.autoBroadcastJoinThreshold=1GB and I am running the>
> spark job. However, my application is failing with:>
>
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)>
> at>
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)>
>
> at>
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)>
>
> at java.lang.reflect.Method.invoke(Method.java:498)>
> at>
> org.apache.spark.deploy.yarn.ApplicationMaster$$anon$2.run(ApplicationMaster.scala:678)>
>
> Caused by: org.apache.spark.SparkException: Cannot broadcast the table that>
> is larger than 8GB: 8 GB>
> at>
> org.apache.spark.sql.execution.exchange.BroadcastExchangeExec$$anonfun$relationFuture$1$$anonfun$apply$1.apply(BroadcastExchangeExec.scala:103)>
>
> at>
> org.apache.spark.sql.execution.exchange.BroadcastExchangeExec$$anonfun$relationFuture$1$$anonfun$apply$1.apply(BroadcastExchangeExec.scala:76)>
>
> at>
> org.apache.spark.sql.execution.SQLExecution$$anonfun$withExecutionId$1.apply(SQLExecution.scala:101)>
>
> at>
> org.apache.spark.sql.execution.SQLExecution$.withSQLConfPropagated(SQLExecution.scala:125)>
>
> at>
> org.apache.spark.sql.execution.SQLExecution$.withExecutionId(SQLExecution.scala:98)>
>
> at>
> org.apache.spark.sql.execution.exchange.BroadcastExchangeExec$$anonfun$relationFuture$1.apply(BroadcastExchangeExec.scala:75)>
>
> at>
> org.apache.spark.sql.execution.exchange.BroadcastExchangeExec$$anonfun$relationFuture$1.apply(BroadcastExchangeExec.scala:75)>
>
> at>
> scala.concurrent.impl.Future$PromiseCompletingRunnable.liftedTree1$1(Future.scala:24)>
>
> at>
> scala.concurrent.impl.Future$PromiseCompletingRunnable.run(Future.scala:24)>
> at>
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)>
>
> at>
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)>
>
> at java.lang.Thread.run(Thread.java:748)>
>
> When I am running with a limit 1 GB how can I hit the 8 GB limit? I made>
> sure in the Spark History Server as well by printing out the value of>
> spark.sql.autoBroadcastJoinThreshold that the value is correctly set and>
> explain plan also shows that it is trying to do a Broadcast Join. Any ideas?
> >
>
>
>
> -->
> Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/>
>
> ->
> To unsubscribe e-mail: user-unsubscr...@spark.apache.org>
>
>
-
To unsubscribe e-mail: user-unsubscr...@spark.apache.org