Truly I am just not understanding the concept of a segment.

Dennis Kubes wrote:
That got me started. I think that I am not fully understanding the role the segments directory and its contents play. It looks like it holds parse text and parse data in map files, but what is the content folder (also a map file)? And is the segments contents used once the index is created?

Dennis Kubes


Doug Cutting wrote:
NutchBean.getContent() and NutchBean.getParseData() do this, but require a HitDetails instance. In the non-distributed case, the only required field of the HitDetails for these calls is "url". In the distributed case, the "segment" field must also be provided, so that the request can be routed to a node serving that segment. These are implemented by FetchedSegments.java and DistributedSearch.java.

Doug

Dennis Kubes wrote:
Can somebody direct me on how to get the stored text and parse metadata for a given url?

Dennis


-------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to