I am looking at using HOD (Hadoop On Demand) to manage a production
cluster. After reading the documentation It seems that HOD is missing
some things that would need to be carefully set in a production
cluster.

Rack Locality:
HOD uses the -N 5 option and starts a cluster of N nodes. There seems
to be no way to pass specific options to them individually. How can I
make sure the set of servers selected selected will end up in
different racks?

data node blacklist/white list
These are listed in a file can that file be generated?

hadoop-env
Can I set my these options from HOD or do I have to build them into
the hadoop tar.

JMX settings
Can I set my these options from HOD or do I have to build them into
the hadoop tar.

Upgrade with non symmetric configurations:

Old servers > /mnt/drive1  /mnt/drive2
New Servers > /mnt/drive1  /mnt/drive2  /mnt/drive3

Can HOD ship out different configuration files to different nodes? As
new nodes are joining the cluster for an upgrade they may have
different configurations then the old one.

>From reading the docs it seems like HOD is great for building on
demand clusters, but may not be ideal for managing a single permanent
long term cluster.

Accidental cluster destruction.
Sounds silly but might the wrong command take out a cluster in one
swipe. Possibly block this feature.

Any thoughts?

Reply via email to