Yes, the driver has to be able to accept incoming connections. All the
executors connect back to the driver sending heartbeats, map status,
metrics. It is critical and I don't know of a way around it. You could look
into using something like the
https://github.com/spark-jobserver/spark-jobserver that could run outside
the firewall. Then from inside the firewall you can make REST calls to the
server.

On Thu, Feb 5, 2015 at 5:03 PM, Kane Kim <kane.ist...@gmail.com> wrote:

> I submit spark job from machine behind firewall, I can't open any incoming
> connections to that box, does driver absolutely need to accept incoming
> connections? Is there any workaround for that case?
>
> Thanks.
>

Reply via email to