Andrzej Bialecki wrote: > > [EMAIL PROTECTED] wrote: > > >Is there a way to match k out of n phrases in a query? We can write > >this ourselves but I thought to ask if anyone has already implemented > >it. > > > >E.g., if the query is > > > >"italian restaurants" "New York" Manhattan Broadway > > > >but no document matches all of these, we would like to get documents > >that match 3 out of 4 of the query phrases, e.g., > > > >"italian restaurants" "New York" Manhattan > > > >or > > > >"italian restaurants" Manhattan Broadway > > > > > > What you describe is an OR query. It is currently not supported, but (as > I wrote before) the underlying index supports this, so it's just a > matter of extending the query syntax parser / translator.
It's not exactly an "or" query. It would be equivalent to an "or" if there were only two terms involved. Example query: A B C D E Matching documents: A B C D E score = 5 A B D E score = 4 A C D E score = 4 A C E score = 3 A D score = 2 etc. We will probably end up implementing such a query for our Q&A project. -- Drago > > -- > Best regards, > Andrzej Bialecki > > ------------------------------------------------- > Software Architect, System Integration Specialist > CEN/ISSS EC Workshop, ECIMF project chair > EU FP6 E-Commerce Expert/Evaluator > ------------------------------------------------- > FreeBSD developer (http://www.freebsd.org) > > > > > ------------------------------------------------------- > The SF.Net email is sponsored by EclipseCon 2004 > Premiere Conference on Open Tools Development and Integration > See the breadth of Eclipse activity. February 3-5 in Anaheim, CA. > http://www.eclipsecon.org/osdn > _______________________________________________ > Nutch-developers mailing list > [EMAIL PROTECTED] > https://lists.sourceforge.net/lists/listinfo/nutch-developers > -- Dragomir R. Radev [EMAIL PROTECTED] Assistant Professor of Information, Electrical Engineering and Computer Science, and Linguistics, the University of Michigan, Ann Arbor Phone: 734-615-5225 Fax: 734-764-2475 http://www.si.umich.edu/~radev ------------------------------------------------------- The SF.Net email is sponsored by EclipseCon 2004 Premiere Conference on Open Tools Development and Integration See the breadth of Eclipse activity. February 3-5 in Anaheim, CA. http://www.eclipsecon.org/osdn _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers
