> Since the hadoop MR streaming job requires the file to be processed to be > present in HDFS, > I was thinking whether can it get directly from mongodb instead of me > manually fetching it > and placing it in a directory before submitting the hadoop job?
Hadoop M/R can get data directly from Cassandra. See CqlInputFormat. ~mck