Hi All,

I am porting a machine learning application on Hadoop using MapReduce. The
architecture of the application goes like this: 
1. run a number of server processes which take around 2-3 minutes to start
and then remain as daemon waiting for a client to call for a connection.
During the startup these server processes get trained on the trainng
dataset.

2. A client is then run which connects to servers and process or test any
data that it wants to. The client is basically our job, which we will be
converted to the mapreduce model of hadoop.

Now, since each server takes a good amount of time to start, needless to say
that we want each of these server processes to be pre-running on all the
tasktrackers(all nodes) so that when a mapreduce(client) task come to that
node, the servers are already running and the client just uses them for its
purpose. The server process keeps on running waiting for another map task
that may be assigned to that node.


That means, a server process is started on each node once and it waits for a
connection by a client. When clients( implemeted as map reduce) come to that
node they connect to the server, do they their processing and leave(or
finish).

Can you please tell me how should I go about starting the server on each
node. If I am not clear, please ask any questions. Any help in this regard
will be greatly appreciated.

Thank You!
Akhil

-- 
View this message in context: 
http://www.nabble.com/Implementing-CLient-Server-architecture-using-MapReduce-tp23916757p23916757.html
Sent from the Hadoop core-user mailing list archive at Nabble.com.

Reply via email to