I have some documents I'm indexing which have multiple languages in them
(i.e., some fields in the document are always English; other fields may be
other languages). Now, I understand why a query against a certain field
must use the same analyzer as was used when that field was indexed
(stemming,
I have two fields, call them FieldA and FieldB. I have a set of words I'm
looking for in FieldA, call them A1 and A2. I have a different set of words
for FieldB, call them B1 and B2. Now I want a hit list which contains items
that have at least one A item in FieldA and one B item in FieldB. In
Thomas Scheffler sagte:
>
> Erik Hatcher sagte:
>> On Jan 12, 2004, at 8:21 AM, Thomas Scheffler wrote:
>>> OK, I've looked inside QueryParser and it's seems to be the right
>>> place to
>>> do that. But it's rather complicated to transform a query to another,
>>> since QueryParserTokenManager as
Erik Hatcher sagte:
> On Jan 12, 2004, at 8:21 AM, Thomas Scheffler wrote:
>> OK, I've looked inside QueryParser and it's seems to be the right
>> place to
>> do that. But it's rather complicated to transform a query to another,
>> since QueryParserTokenManager as an extreme example is not quite
>
On Jan 12, 2004, at 8:21 AM, Thomas Scheffler wrote:
OK, I've looked inside QueryParser and it's seems to be the right
place to
do that. But it's rather complicated to transform a query to another,
since QueryParserTokenManager as an extreme example is not quite
understandable and needs a huge tim
Erik Hatcher sagte:
> On Jan 7, 2004, at 4:18 PM, Dror Matalon wrote:
>> Actually I would guess that performence should be fine. I would look at
>> the code generated by the standard analyzer,
>> http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/analysis/
>> standard/package-summary.html
On Jan 12, 2004, at 6:24 AM, [EMAIL PROTECTED] wrote:
who knows other software projects (like Nutch) which are based and
build
around Lucene?? I think it can be quite interesting and helpful for
new people
to see and learn from examples...
This is the purpose of the "Powered by" section on Lucen
Hi,
does Lucene have functionality to get the top n most frequent words from a
given text / stream / troken stream etc. ? With frequencies?
Ralf
--
+++ GMX - die erste Adresse für Mail, Message, More +++
Neu: Preissenkung für MMS und FreeMMS! http://www.gmx.net
-
Hello group,
who knows other software projects (like Nutch) which are based and build
around Lucene?? I think it can be quite interesting and helpful for new people
to see and learn from examples...
Cheers,
Ralf
--
+++ GMX - die erste Adresse für Mail, Message, More +++
Neu: Preissenkung für