Hi,

I'm having some trouble implementing what I want to achieve... essentially I
have a large input list of documents that I want to get statistics on. For
each document I have 12 different stats to work out.

So my input file is a text file with one document filepath on each line. The
documents are stored on a remote server. I want to fetch each document and
calculate certain stats from it.

My problem is with the output.

I want my output to be similar to this:

docname1,stat1,stat2,stat3,....stat12
docname2,stat1,stat2,stat3,....stat12
docname3,stat1,stat2,stat3,....stat12
.
.
.
docnameN,stat1,stat2,stat3,....stat12

I can fetch the document in my map code and perform my stats calculation on
it but dont know how to return more than one value for a key, the key in
this case being the document name.

Cheers,
John

Reply via email to