Error starting EC2 cluster

2014-05-04 Thread Aliaksei Litouka
I am using Spark 0.9.1. When I'm trying to start a EC2 cluster with the spark-ec2 script, an error occurs and the following message is issued: AttributeError: 'module' object has no attribute 'check_output'. By this time, EC2 instances are up and running but Spark doesn't seem to be installed on th

Re: Error starting EC2 cluster

2014-05-16 Thread Aliaksei Litouka
Well... the reason was an out-of-date version of Python (2.6.6) on the machine where I ran the script. If anyone else experiences this issue - just update your Python. On Sun, May 4, 2014 at 7:51 PM, Aliaksei Litouka wrote: > I am using Spark 0.9.1. When I'm trying to start a EC2 clus

How to specify executor memory in EC2 ?

2014-06-10 Thread Aliaksei Litouka
I am testing my application in EC2 cluster of m3.medium machines. By default, only 512 MB of memory on each machine is used. I want to increase this amount and I'm trying to do it by passing --executor-memory 2G option to the spark-submit script, but it doesn't seem to work - each machine uses only

An attempt to implement dbscan algorithm on top of Spark

2014-06-12 Thread Aliaksei Litouka
find it useful. Or maybe someone will want to join development. The application is available at https://github.com/alitouka/spark_dbscan Any questions, comments, suggestions, as well as criticism are welcome :) Best regards, Aliaksei Litouka

Re: How to specify executor memory in EC2 ?

2014-06-12 Thread Aliaksei Litouka
et it to 512, > and is overriding the application’s settings. Take a look in there and > delete that line if possible. > > Matei > > On Jun 10, 2014, at 2:38 PM, Aliaksei Litouka > wrote: > > > I am testing my application in EC2 cluster of m3.medium machines. By &g

Re: How to specify executor memory in EC2 ?

2014-06-12 Thread Aliaksei Litouka
> Are you launching this using our EC2 scripts? Or have you set up a cluster > by hand? > > Matei > > On Jun 12, 2014, at 2:32 PM, Aliaksei Litouka > wrote: > > spark-env.sh doesn't seem to contain any settings related to memory size > :( I will continue searching fo

Re: An attempt to implement dbscan algorithm on top of Spark

2014-06-12 Thread Aliaksei Litouka
RDD[Double,Double] as well, instead of just a file. > > val data = IOHelper.readDataset(sc, "/path/to/my/data.csv") > > And other distance measures ofcourse. > > Thanks, > Vipul > > > > > On Jun 12, 2014, at 2:31 PM, Aliaksei Litouka > wrote: > &g

Re: How to specify executor memory in EC2 ?

2014-06-13 Thread Aliaksei Litouka
> that's pretty odd. > > > On Thu, Jun 12, 2014 at 6:02 PM, Aliaksei Litouka < > aliaksei.lito...@gmail.com> wrote: > >> Yes, I am launching a cluster with the spark_ec2 script. I checked >> /root/spark/conf/spark-env.sh on the master node and on slaves and