Hi,

When I was using nutch 0.7, I found the bin/nutch fetchlist -dumpurls
command to be very useful. However, I have not been able to find an
equivalent command in nutch 0.8.x.

Essentially all I want to do is dump all urls stored in a certain segment
(or group of segments) into a text file.

In nutch 0.7.x I would call a command like this:

*$ bin/nutch org.apache.nutch.pagedb.FetchListEntry -dumpurls $s1 >foo.txt

*Any suggestions for how this can be accomplished in nutch 0.8.x are very
much appreciated.

Thanks,
Bryan
-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to