Hi Морозов,

It's a directory containing Hadoop map file(s) that stores key/value pairs. 
Hadoop Text class is the key and Nutch' Content class is the value. You would 
need Hadoop to easily process the files

http://svn.apache.org/viewvc/nutch/trunk/src/java/org/apache/nutch/protocol/Content.java?view=markup

Cheers,
Markus
 
 
-----Original message-----
> From:Морозов Евгений <ant...@yandex.ru>
> Sent: Sat 27-Oct-2012 18:32
> To: user@nutch.apache.org
> Subject: Format of &quot;content&quot; file in segments?
> 
> Where can I find the format of the content file in a segment directory?
> Either source code or documentation. I'm looking at reading it with a
> program external to nutch.
> 
> regards, keanta
> 

Reply via email to