Re: Help getting started

2012-04-22 Thread Markus Jelsma
h it. > * How do I pause and then resume crawling? How do I check the status > of a > crawl? You cannot pause or resume a crawl safely. You can view the status in the Hadoop web gui. > > Thanks, > Ben > > > -- > View this message in context: > > http://lu

Re: Help getting started

2012-04-22 Thread benmccann
ent and the parsed content. A generated segments contains one >> K/V database with URL's to crawl. The other DB's are created by the >> fetcher and the parser. >> >> > * What is the format of the output and how do I consume the output of >> > the >> &

Re: Help getting started

2012-04-22 Thread Markus Jelsma
//lucene.472066.n3.nabble.com/Help-getting-started-tp3929132p3929132.html Sent from the Nutch - User mailing list archive at Nabble.com.

Help getting started

2012-04-21 Thread benmccann
s the format of the output and how do I consume the output of the crawl to do something useful with it? * How do I pause and then resume crawling? How do I check the status of a crawl? Thanks, Ben -- View this message in context: http://lucene.472066.n3.nabble.com/Help-getting-started-tp3929132p39