Yarn client is much like Spark client mode, except that the
executors are running in Yarn containers managed by the Yarn
resource manager on the cluster instead of as Spark workers managed
by the Spark master. The driver executes as a local client in your
local JVM. It communicates with the workers on the cluster.
Transformations are scheduled on the cluster by the driver's logic.
Actions involve communication between local driver and remote
cluster executors. So, there is some additional network overhead,
especially if the driver is not co-located on the cluster. In
yarn-cluster mode -- in contrast, the driver is executed as a thread
in a Yarn application master on the cluster. In either case, the assembly JAR must be available to the application on the cluster. Best to copy it to HDFS and specify its location by exporting its location as SPARK_JAR. Kevin Markey On 06/19/2014 11:22 AM, Koert Kuipers
wrote:
|
- trying to understand yarn-client mode Koert Kuipers
- Re: trying to understand yarn-client mode Kevin Markey
- Re: trying to understand yarn-client mode DB Tsai
- Re: trying to understand yarn-client mode Koert Kuipers
- Re: trying to understand yarn-client mode DB Tsai
- Re: trying to understand yarn-client ... Koert Kuipers
- Re: trying to understand yarn-client mode Marcelo Vanzin
- Re: trying to understand yarn-client mode Koert Kuipers
- Re: trying to understand yarn-client mode Marcelo Vanzin