Philosophy(??) question

2004-01-12 Thread Scott Smith
I have some documents I'm indexing which have multiple languages in them (i.e., some fields in the document are always English; other fields may be other languages). Now, I understand why a query against a certain field must use the same analyzer as was used when that field was indexed (stemming,

Query question

2004-01-12 Thread Scott Smith
I have two fields, call them FieldA and FieldB. I have a set of words I'm looking for in FieldA, call them A1 and A2. I have a different set of words for FieldB, call them B1 and B2. Now I want a hit list which contains items that have at least one A item in FieldA and one B item in FieldB. In

Re: merged search of document

2004-01-12 Thread Thomas Scheffler
Thomas Scheffler sagte: > > Erik Hatcher sagte: >> On Jan 12, 2004, at 8:21 AM, Thomas Scheffler wrote: >>> OK, I've looked inside QueryParser and it's seems to be the right >>> place to >>> do that. But it's rather complicated to transform a query to another, >>> since QueryParserTokenManager as

Re: merged search of document

2004-01-12 Thread Thomas Scheffler
Erik Hatcher sagte: > On Jan 12, 2004, at 8:21 AM, Thomas Scheffler wrote: >> OK, I've looked inside QueryParser and it's seems to be the right >> place to >> do that. But it's rather complicated to transform a query to another, >> since QueryParserTokenManager as an extreme example is not quite >

Re: merged search of document

2004-01-12 Thread Erik Hatcher
On Jan 12, 2004, at 8:21 AM, Thomas Scheffler wrote: OK, I've looked inside QueryParser and it's seems to be the right place to do that. But it's rather complicated to transform a query to another, since QueryParserTokenManager as an extreme example is not quite understandable and needs a huge tim

Re: merged search of document

2004-01-12 Thread Thomas Scheffler
Erik Hatcher sagte: > On Jan 7, 2004, at 4:18 PM, Dror Matalon wrote: >> Actually I would guess that performence should be fine. I would look at >> the code generated by the standard analyzer, >> http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/analysis/ >> standard/package-summary.html

Re: Lucene based projects...?

2004-01-12 Thread Erik Hatcher
On Jan 12, 2004, at 6:24 AM, [EMAIL PROTECTED] wrote: who knows other software projects (like Nutch) which are based and build around Lucene?? I think it can be quite interesting and helpful for new people to see and learn from examples... This is the purpose of the "Powered by" section on Lucen

Getting top n most frequent words ?

2004-01-12 Thread Ralph
Hi, does Lucene have functionality to get the top n most frequent words from a given text / stream / troken stream etc. ? With frequencies? Ralf -- +++ GMX - die erste Adresse für Mail, Message, More +++ Neu: Preissenkung für MMS und FreeMMS! http://www.gmx.net -

Lucene based projects...?

2004-01-12 Thread ambiesense
Hello group, who knows other software projects (like Nutch) which are based and build around Lucene?? I think it can be quite interesting and helpful for new people to see and learn from examples... Cheers, Ralf -- +++ GMX - die erste Adresse für Mail, Message, More +++ Neu: Preissenkung für