Andrzej Bialecki wrote:
> 
> [EMAIL PROTECTED] wrote:
> 
> >Is there a way to match k out of n phrases in a query? We can write
> >this ourselves but I thought to ask if anyone has already implemented
> >it. 
> >
> >E.g., if the query is
> >
> >"italian restaurants" "New York" Manhattan Broadway
> >
> >but no document matches all of these, we would like to get documents
> >that match 3 out of 4 of the query phrases, e.g.,
> >
> >"italian restaurants" "New York" Manhattan
> >
> >or
> >
> >"italian restaurants" Manhattan Broadway
> >  
> >
> 
> What you describe is an OR query. It is currently not supported, but (as 
> I wrote before) the underlying index supports this, so it's just a 
> matter of extending the query syntax parser / translator.

It's not exactly an "or" query. It would be equivalent to an "or" if
there were only two terms involved.

Example query: 

A B C D E

Matching documents:

A B C D E   score = 5
A B D E     score = 4
A C D E     score = 4
A C E       score = 3
A D         score = 2

etc.

We will probably end up implementing such a query for our Q&A project.

--
Drago

> 
> -- 
> Best regards,
> Andrzej Bialecki
> 
> -------------------------------------------------
> Software Architect, System Integration Specialist
> CEN/ISSS EC Workshop, ECIMF project chair
> EU FP6 E-Commerce Expert/Evaluator
> -------------------------------------------------
> FreeBSD developer (http://www.freebsd.org)
> 
> 
> 
> 
> -------------------------------------------------------
> The SF.Net email is sponsored by EclipseCon 2004
> Premiere Conference on Open Tools Development and Integration
> See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
> http://www.eclipsecon.org/osdn
> _______________________________________________
> Nutch-developers mailing list
> [EMAIL PROTECTED]
> https://lists.sourceforge.net/lists/listinfo/nutch-developers
> 


-- 
Dragomir R. Radev                                         [EMAIL PROTECTED]
Assistant Professor of Information, Electrical Engineering and
Computer Science, and Linguistics, the University of Michigan, Ann Arbor
Phone: 734-615-5225   Fax: 734-764-2475    http://www.si.umich.edu/~radev


-------------------------------------------------------
The SF.Net email is sponsored by EclipseCon 2004
Premiere Conference on Open Tools Development and Integration
See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
http://www.eclipsecon.org/osdn
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to