One trick would be to search on a URL, explain link shows what segments it belongs to, say 1200604211450.
Then using segread command (this works for 0.7.2) bin/nutch segread -dumpsort -nocontent segments/1200604211450 That shows text, parse data for a URL. Thanks P -----Original Message----- From: Dennis Kubes [mailto:[EMAIL PROTECTED] Sent: Wednesday, April 26, 2006 1:42 AM To: [email protected] Subject: How to get Text and Parse data for URL Can somebody direct me on how to get the stored text and parse metadata for a given url? Dennis ------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid0709&bid&3057&dat1642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
