Hi, I was looking at the documentation for deploying Spark cluster on EC2. http://spark.apache.org/docs/latest/ec2-scripts.html We are using Pig to build the data pipeline and then use MLLib for analytics. I was wondering if someone has any experience to include additional tools/services such as Pig/Hadoop in the above deployment script?
- Spark cluster set up on EC2 customization Sameer Tilak
- Re: Spark cluster set up on EC2 customization Akhil Das
- RE: Spark cluster set up on EC2 customization Sameer Tilak