Re: Bad fieldNorm when using morphologic synonyms

Robert Muir Fri, 06 Dec 2013 05:58:24 -0800

termvectors have nothing to do with any of this.

please, fix your analyzer first. if you want to add a synonym, it
should be position increment of zero.


i bet exact phrase queries aren't working correctly either.

On Fri, Dec 6, 2013 at 12:50 AM, Isaac Hebsh <isaac.he...@gmail.com> wrote:
> 1) positions look all right (for me).
> 2) fieldNorm is determined by the size of the termVector, isn't it? the
> termVector size isn't affected by the positions.
>
>
> On Fri, Dec 6, 2013 at 10:46 AM, Robert Muir <rcm...@gmail.com> wrote:
>
>> Your analyzer needs to set positionIncrement correctly: sounds like its
>> broken.
>>
>> On Thu, Dec 5, 2013 at 1:53 PM, Isaac Hebsh <isaac.he...@gmail.com> wrote:
>> > Hi,
>> > we implemented a morphologic analyzer, which stems words on index time.
>> > For some reasons, we index both the original word and the stem (on the
>> same
>> > position, of course).
>> > The stemming is done on a specific language, so other languages are not
>> > stemmed at all.
>> >
>> > Because of that, two documents with the same amount of terms, may have
>> > different termVector size. document which contains many words that being
>> > stemmed, will have a double sized termVector. This behaviour affects the
>> > relevance score in a BAD way. the fieldNorm of these documents reduces
>> > thier score. This is NOT the wanted behaviour in our case.
>> >
>> > We are looking for a way to "mark" the stemmed words (on index time, of
>> > course) so they won't affect the fieldNorm. Do such a way exist?
>> >
>> > Do you have another idea?
>>

Re: Bad fieldNorm when using morphologic synonyms

Reply via email to