Re: How to broadcast a variable read from a file in yarn-cluster mode?

Sandy Ryza Thu, 05 Feb 2015 17:08:12 -0800

Hi Jon,

You'll need to put the file on HDFS (or whatever distributed filesystem
you're running on) and load it from there.


-Sandy

On Thu, Feb 5, 2015 at 3:18 PM, YaoPau <jonrgr...@gmail.com> wrote:

> I have a file "badFullIPs.csv" of bad IP addresses used for filtering.  In
> yarn-client mode, I simply read it off the edge node, transform it, and
> then
> broadcast it:
>
>       val badIPs = fromFile(edgeDir + "badfullIPs.csv")
>       val badIPsLines = badIPs.getLines
>       val badIpSet = badIPsLines.toSet
>       val badIPsBC = sc.broadcast(badIpSet)
>       badIPs.close
>
> How can I accomplish this in yarn-cluster mode?
>
> Jon
>
>
>
> --
> View this message in context:
> http://apache-spark-user-list.1001560.n3.nabble.com/How-to-broadcast-a-variable-read-from-a-file-in-yarn-cluster-mode-tp21524.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
> For additional commands, e-mail: user-h...@spark.apache.org
>
>

Re: How to broadcast a variable read from a file in yarn-cluster mode?

Reply via email to