You can easily add a function (say setup_pig) inside the function
setup_cluster in this script
<https://github.com/apache/spark/blob/master/ec2/spark_ec2.py#L649>

Thanks
Best Regards

On Thu, Feb 26, 2015 at 7:08 AM, Sameer Tilak <ssti...@live.com> wrote:

>  Hi,
>
> I was looking at the documentation for deploying Spark cluster on EC2.
> http://spark.apache.org/docs/latest/ec2-scripts.html
>
> We are using Pig to build the data pipeline and then use MLLib for
> analytics. I was wondering if someone has any experience to include
> additional tools/services such as Pig/Hadoop in the above deployment
> script?
>
>

Reply via email to