Re: use additional ebs volumes for hsdf storage with spark-ec2

Daniel Mahler Thu, 30 Oct 2014 21:58:04 -0700

Thanks Akhil. I tried changing /root/ephemeral-hdfs/conf/hdfs-site.xml to
have


  <property>
    <name>dfs.data.dir</name>

<value>/vol,/vol0,/vol1,/vol2,/vol3,/vol4,/vol5,/vol6,/vol7,/mnt/ephemeral-hdfs/data,/mnt2/ephemeral-hdfs/data</value>
  </property>

and then running

/root/ephemeral-hdfs/bin/stop-all.sh
copy-dir  /root/ephemeral-hdfs/conf/
/root/ephemeral-hdfs/bin/start-all.sh

to try and make sure the new configurations taks on the entire cluster.
I then ran spark to write to the local hdfs.
It failed after filling the original /mnt* mounted drives,,
without writing anything to the attached /vol* drives.

I also tried completely stopping and restarting the cluster,
but restarting resets /root/ephemeral-hdfs/conf/hdfs-site.xml to the
default state.

thanks
Daniel



On Thu, Oct 30, 2014 at 1:56 AM, Akhil Das <ak...@sigmoidanalytics.com>
wrote:

> I think you can check in the core-site.xml or hdfs-site.xml file under
> /root/ephemeral-hdfs/etc/hadoop/ where you can see data node dir property
> which will be a comma separated list of volumes.
>
> Thanks
> Best Regards
>
> On Thu, Oct 30, 2014 at 5:21 AM, Daniel Mahler <dmah...@gmail.com> wrote:
>
>> I started my ec2 spark cluster with
>>
>>     ./ec2/spark---ebs-vol-{size=100,num=8,type=gp2} -t m3.xlarge -s 10
>> launch mycluster
>>
>> I see the additional volumes attached but they do not seem to be set up
>> for hdfs.
>> How can I check if they are being utilized on all workers,
>> and how can I get all workers to utilize the extra volumes for hdfs.
>> I do not have experience using hadoop directly, only through spark.
>>
>> thanks
>> Daniel
>>
>
>

Re: use additional ebs volumes for hsdf storage with spark-ec2

Reply via email to