Re: Fixing query-time multi-word synonym issue

2013-03-28 Thread Otis Gospodnetic
Ha! Right, it's not about synonyms, it's about the QP doing the wrong thing. At this point I'm not even 100% sure which QP is the problem. I think it's the Lucene one, but also the SolrQueryParser, is this correct? But isn't there a flex parser in Lucene contrib somewhere? What is the state of

Re: Fixing query-time multi-word synonym issue

2013-01-26 Thread Michael McCandless
On Sat, Jan 26, 2013 at 11:16 AM, Jack Krupansky wrote: > Yeah, I suspected that it was going to be expert/cryptic. I think the real > point, from my September proposal, was that we need a common piece of code > that has all that expert smarts to reconstruct the "graph" and then can > generate the

Re: Fixing query-time multi-word synonym issue

2013-01-26 Thread Jack Krupansky
ky -Original Message- From: Michael McCandless Sent: Saturday, January 26, 2013 6:31 AM To: dev@lucene.apache.org Subject: Re: Fixing query-time multi-word synonym issue On Fri, Jan 25, 2013 at 6:28 PM, Jack Krupansky wrote: Is there a decent writeup on PositionLengthAttribute?

Re: Fixing query-time multi-word synonym issue

2013-01-26 Thread Michael McCandless
On Fri, Jan 25, 2013 at 6:28 PM, Jack Krupansky wrote: > Is there a decent writeup on PositionLengthAttribute? I mean, the Javadoc > says "The positionLength determines how many positions this token spans", > which doesn't sound very relevant to multi-term synonyms that span multiple > positions.

Re: Fixing query-time multi-word synonym issue

2013-01-25 Thread Jack Krupansky
, January 25, 2013 4:41 PM To: dev@lucene.apache.org Subject: Re: Fixing query-time multi-word synonym issue PositionLengthAttribute is sufficient to "express" the true graph, but SynonymFilter has not been fully fixed to properly set it. Specifically, it cannot "create new positions

Re: Fixing query-time multi-word synonym issue

2013-01-25 Thread Michael McCandless
single input string. > > Yes, the position information is preserved, but there is no "path" attribute > to be able to tell that "heart" was before "attack" as opposed to before > "infarction". > > > -- Jack Krupansky > > -Origina

Re: Fixing query-time multi-word synonym issue

2013-01-25 Thread Jack Krupansky
Krupansky -Original Message- From: Robert Muir Sent: Friday, January 25, 2013 9:47 AM To: dev@lucene.apache.org Subject: Re: Fixing query-time multi-word synonym issue On Fri, Jan 25, 2013 at 9:19 AM, Jack Krupansky wrote: Here's an example query with q.op=AND: causes of hea

Re: Fixing query-time multi-word synonym issue

2013-01-25 Thread Robert Muir
On Fri, Jan 25, 2013 at 9:19 AM, Jack Krupansky wrote: > Here's an example query with q.op=AND: > >causes of heart attack > > And I have this synonym definition: > >heart attack, myocardial infarction > > So, what is the alleged query parser fix so that the query is treated as: > >caus

Re: Fixing query-time multi-word synonym issue

2013-01-25 Thread Jack Krupansky
tack", but that alone doesn't address the synonym filter misbehavior. -- Jack Krupansky -Original Message- From: Robert Muir Sent: Friday, January 25, 2013 3:46 AM To: dev@lucene.apache.org Subject: Re: Fixing query-time multi-word synonym issue On Fri, Jan 25, 2013 at 12:48 A

Re: Fixing query-time multi-word synonym issue

2013-01-25 Thread Robert Muir
On Fri, Jan 25, 2013 at 12:48 AM, Jack Krupansky wrote: > Otis, this is precisely why nothing will get done any time soon on the > multi-term synonym issue - there isn't even common agreement that there is a > problem, let alone common agreement on the specifics of the problem, let > alone common

Re: Fixing query-time multi-word synonym issue

2013-01-24 Thread Varun Thacker
pect general community agreement on a path forward > for multi-term synonyms any time soon. > > -- Jack Krupansky > > -Original Message- From: Robert Muir > Sent: Thursday, January 24, 2013 8:17 PM > To: dev@lucene.apache.org > Subject: Re: Fixing query-time multi-word

Re: Fixing query-time multi-word synonym issue

2013-01-24 Thread Jack Krupansky
Muir Sent: Thursday, January 24, 2013 8:17 PM To: dev@lucene.apache.org Subject: Re: Fixing query-time multi-word synonym issue On Thu, Jan 24, 2013 at 8:10 PM, Otis Gospodnetic wrote: Funny, I expected everyone would jump on this thread considering so many people hit this multi-word synonym

Re: Fixing query-time multi-word synonym issue

2013-01-24 Thread Robert Muir
On Thu, Jan 24, 2013 at 8:10 PM, Otis Gospodnetic wrote: > Funny, I expected everyone would jump on this thread considering so many > people hit this multi-word synonym issue... > Probably because its not a synonyms problem: its just a bug in a specific queryparser. If you dont use that querypar

Re: Fixing query-time multi-word synonym issue

2013-01-24 Thread Otis Gospodnetic
Funny, I expected everyone would jump on this thread considering so many people hit this multi-word synonym issue... Mikhail, I looked at the slides and while I understand that use case well, I don't see where query-time multi-word synonyms come into play. Please enlighten me :) Thanks! Otis Sol

Re: Fixing query-time multi-word synonym issue

2013-01-22 Thread Mikhail Khludnev
FWIW, multi-word synonyms is a side benefit of query parsing approach implemented by my team. Here how it looks like https://docs.google.com/a/griddynamics.com/presentation/pub?id=1oifLFI0MiA3ZyXZWisHJVRK13P8cki5yCABvABPObKw&start=false&loop=false&delayms=3000#slide=id.g1006de00_2_34"fee people" fr

Fixing query-time multi-word synonym issue

2013-01-22 Thread Otis Gospodnetic
Hello, I'm looking for some guidance around solving the infamous index-time vs. query-time multi-word synonym problem. Looking for help with understanding the pieces and effort involved, and also being on a lookout for any potential "man, it will take you forever, you'll have to do major Lucene s