Michael Böckling wrote: > Hi! > > Hi, > I know there is a MultiSearcher class, but it seems that Nutch is using a > very different index layout than Lucene, or am I wrong here? Nutch uses lucene as an inverted index. Lucene does not have an index structure. You create the structure (I mean the fields) using lucene. Nutch stores some default fields in the index as well as extra fields from index plugins. You can check out the structure of the index from the wiki : http://wiki.apache.org/nutch/IndexStructure
What you should do is to compare the structure nutch uses with the structure you use, and somehow combine the two. In most of the fields, you sould converge to the nutch version. Other than that, once index the index is created from nutch, it is lucene stuff. You can merge the indexes or run a MultiSearcher, or open seperate DistributedSearch$Clients and combine the results from seperate indexes on the fly. However there is an issue about summaries. Do you intend to use them? > My end goal is > a list of results with the most relevant hits from both indexes at the top > positions. > > How would you go about this? > Thanks a lot for your input! > > Regards, > > Michael > > ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
