One way to address Umesh's concern is to boost terms
you *do* know enough about to assign to a specific field.
But the observation that
"That said, Best solution depends on your requirement"
is right on.....

Best
Erick

On Mon, Sep 22, 2008 at 5:29 AM, Umesh Prasad <[EMAIL PROTECTED]> wrote:

> Hi,
> Having an extra indexed but unstored field is equivalent to having a bag of
> words. So the search results quality will be affected.
> Consider an Example:
>
> Text : ----  President  of USA--
> Other Fields ..
>
> Text : --
> Occupation: President of USA
>
> In both cases searchable-mash = BAG of WORDs, will have President of USA
> hence will score almost same, which would be undesirable.
>
>
> Another solution is to learn the field name of each term in the
> unstructured
> query and then form the query programmatically.
> You will have to write 2 additional subsystems.
> 1. Field Learning System
> 2. Customized Query Tokenizer and Query Parser
>
> That said, Best solution depends on your requirement.
>
> Thanks
> Umesh
> On Mon, Sep 22, 2008 at 2:18 PM, Dino Korah <[EMAIL PROTECTED]> wrote:
>
> > I would think, with the current capabilities of lucene, denormalisation
> is
> > the solution. Create an extra indexed but not stored field called
> > "searchable-mash" which will hold the values from all fields with added
> > words to connect the data like "Male named George Bush whoes occupation
> is
> > President of USA ... Etc" so that you can run that generic query on that
> > field.
> >
> > So you pass "searchable-mash: George bush and president" to query parser.
> >
> > You will pay a penalty here, of bigger index and slower indexing.
> >
> > -----Original Message-----
> > From: Anshul jain [mailto:[EMAIL PROTECTED]
> > Sent: 21 September 2008 20:27
> > To: java-user@lucene.apache.org
> > Subject: Multi Field search without Multifieldqueryparser
> >
> > Hi!
> >
> > I've a lucene document structured like:
> > Field: Text
> > name: George Bush
> > Sex: Male
> > Occupation: President of USA
> >
> > Now I can have two types of queries:
> > Structured query:
> > name: George Bush AND Occupation: President
> >
> > Unstructured Query:
> > George Bush AND President.
> >
> > After parsing it will become, value: George bush and president.
> > "value" is some default field that has to defined during parsing.
> >
> > But as you can see that this unstructured query would not work because of
> > the structure of the lucene document. Now what I want to do is that when
> an
> > user gives an Unstructured query Lucene should search in all fields.
> (Multi
> > field query parser is an option but we have to define all the fields
> first,
> > and it can be expensive as the query can get really big).
> >
> > I would really appreciate if you can help me out with this.
> >
> > Regards,
> > Anshul Jain
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: [EMAIL PROTECTED]
> > For additional commands, e-mail: [EMAIL PROTECTED]
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: [EMAIL PROTECTED]
> > For additional commands, e-mail: [EMAIL PROTECTED]
> >
> >
>
>
> --
> Thanking you
>
> Regards
> Umesh Prasad
>

Reply via email to