Hello, we all know that Lucene supports, among others, boolean queries. Even though Nutch is built on Lucene, boolean clauses are removed by Nutch filters so boolean queries end up as "flat" queries where terms are implicitly connected by an OR operator, as far as I can see.
Is there any simple way to turn off the filtering so a boolean query remains as such after it is submitted to Nutch? Just in case a simple way doesn't exist, Ravi Chintakunta suggests the following workaround: "We have to modify the analyzer and add more plugins to Nutch to use the Lucene's query syntax. Or we have to directly use Lucene's Query Parser. I tried the second approach by modifying org.apache.nutch.searcher.IndexSearcher and that seems to work." Can anyone please elaborate on what Ravi actually means by "modifying org.apache.nutch.searcher.IndexSearcher"? Which methods are supposed to be modified and how? It would be really nice to know how to do this. I believe many other Nutch users would also benefit from an answer to this question. Thanks so much, Cristina ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
