Extend TextInputFormat and change the behavior any way you like.

You can look at org.apache.hadoop.mapred.KeyValueTextInputFormat for an
example. 


On 9/19/07 6:58 AM, "贺皓(He Hao)" <[EMAIL PROTECTED]> wrote:

> Hi Everyone,
>   I'm a newer to hadoop. I want to write a mapreduce program to implement the
> inverted index. My question is which input format should I use? It seems that
> the TextInputFormat's key is the offset of each line in the document. How can
> I get the document name(as the document id) that the map function processes?
>    
>   Thanks!
>    
>   He Hao
> 
>        
> ---------------------------------
>  @yahoo.cn 新域名、无限量,快来抢注!

Reply via email to