[Nutch-general] Re: How to get Text and Parse data for URL

Dennis Kubes Tue, 25 Apr 2006 13:41:58 -0700

That got me started. I think that I am not fully understanding the rolethe segments directory and its contents play. It looks like it holdsparse text and parse data in map files, but what is the content folder(also a map file)? And is the segments contents used once the index iscreated?


Dennis Kubes



Doug Cutting wrote:

NutchBean.getContent() and NutchBean.getParseData() do this, butrequire a HitDetails instance. In the non-distributed case, the onlyrequired field of the HitDetails for these calls is "url". In thedistributed case, the "segment" field must also be provided, so thatthe request can be routed to a node serving that segment. These areimplemented by FetchedSegments.java and DistributedSearch.java.
Doug

Dennis Kubes wrote:
Can somebody direct me on how to get the stored text and parsemetadata for a given url?
Dennis



-------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

[Nutch-general] Re: How to get Text and Parse data for URL

Reply via email to