hello,

i am developing a tool that will do some analysis tasks using hadoop map/reduce on a cluster

the tool user interfaces will be run on the client windows system and should run the analysis tasks as map/reduce jobs on a hadoop cluster (configured by the user).

my question is how to run hadoop jobs on a cluster from a client machine (other than the master) from inside java code. I know that I should have a hadoop installation on the client that should be configured to point to the cluster's master, but I am not sure how to do it.

another necessity for my tool would be to copy files from the local client file system to the HDFS on the cluster. I am also not sure if I can access the HDFS of the cluster from a client machine using java code.

hope anybody could give me some hints

thanks,

Deyaa

Reply via email to