Hi Морозов, It's a directory containing Hadoop map file(s) that stores key/value pairs. Hadoop Text class is the key and Nutch' Content class is the value. You would need Hadoop to easily process the files
http://svn.apache.org/viewvc/nutch/trunk/src/java/org/apache/nutch/protocol/Content.java?view=markup Cheers, Markus -----Original message----- > From:Морозов Евгений <ant...@yandex.ru> > Sent: Sat 27-Oct-2012 18:32 > To: user@nutch.apache.org > Subject: Format of "content" file in segments? > > Where can I find the format of the content file in a segment directory? > Either source code or documentation. I'm looking at reading it with a > program external to nutch. > > regards, keanta >