Thanks j harsh: I have another question , though --- You mentioned that :
The client needs access to " the DataNodes (for actually writing the previous files to DFS for the JobTracker to pick up)" What do you mean by previous files? It seems like, if designing Hadoop from scratch , I wouldn't want to force the client to communicate with data nodes at all, since those can be added and removed during a job. Jay Vyas MMSB UCHC On Apr 21, 2012, at 1:14 AM, Harsh J <ha...@cloudera.com> wrote: > the > DataNodes (for actually writing the previous files to DFS for the > JobTracker to pick up)