hello,
i am developing a tool that will do some analysis tasks using hadoop
map/reduce on a cluster
the tool user interfaces will be run on the client windows system and
should run the analysis tasks as map/reduce jobs on a hadoop cluster
(configured by the user).
my question is how to run hadoop jobs on a cluster from a client machine
(other than the master) from inside java code.
I know that I should have a hadoop installation on the client that
should be configured to point to the cluster's master, but I am not sure
how to do it.
another necessity for my tool would be to copy files from the local
client file system to the HDFS on the cluster. I am also not sure if I
can access the HDFS of the cluster from a client machine using java code.
hope anybody could give me some hints
thanks,
Deyaa