Example for position end and positionLength of SGF.
query: natural forest
WT text start end positionLength type position
natural 0 7 1 word 1
forest 8 14 1 word 2
...
SPF text start end positionLength type position
natural 0 7 1 word 1
natural forest 0 14 2 shingle 2
forest 8 14 1 word 3
SGF text start end positionLength type position
natural 0 7 1 word 1
naturwald 0 14 1 SYNONYM 2
forêt naturelle 0 14 1 SYNONYM 2
natürlicher wald 0 14 1 SYNONYM 2
natural forest 0 14 1 shingle 2
forest 8 14 1 word 3
SPF text start end positionLength type position
natural 0 7 1 word 1
naturwald 0 9 1 SYNONYM 2
"forêt naturelle" 0 17 2 SYNONYM 2
"natürlicher wald" 0 18 2 SYNONYM 2
"natural forest" 0 16 2 shingle 2
forest 8 14 1 word 3
SGF (SynonymsGraphFilter) has for all SYNONYM's the same position end and
positionLength.
I suppose that it is not correct?
Regards
Bernd
Am 09.02.2017 um 18:39 schrieb Michael McCandless:
> On Thu, Feb 9, 2017 at 2:40 AM, Bernd Fehling
> <[email protected]> wrote:
>> I tried SynonymGraphFilter with my setup and it works right away.
>> It payed of that I did some modifications on my filters while
>> testing 6.3 with my setup.
>
> Good!
>
>> I only replaced SynonymFilter with SynonymGraphFilter and did not
>> use FlattenGraphFilter, pretty simple. So I can confirm that, up
>> to this point, SynonymGraphFilter is a full replacement for
>> SynonymFilter. At least for search-time synonym handling.
>>
>> But this also means there is still some work with the attributes, right?
>> Position looks good, type and start are no problem anyway, but
>> the end position is still wrong and the positionLength for multi-word
>> synonyms.
>
> Can you give an example or make a small test case?
> PositionLengthAttribute is supposed to be correct coming out of
> SynonymGraphFilter.
>
>> One thing I noticed was that the originating token which "produces"
>> synonyms comes out last from SynonymGraphFilter, after the
>> "produced" synonyms.
>> I will have a look inside with debugger but I guess this is due
>> to output buffering of SynonymGraphFilter?
>
> Yeah they do come out in a different order, which token filters are
> allowed to do in general for all tokens leaving from the same position
> ...
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
--
*************************************************************
Bernd Fehling Bielefeld University Library
Dipl.-Inform. (FH) LibTec - Library Technology
Universitätsstr. 25 and Knowledge Management
33615 Bielefeld
Tel. +49 521 106-4060 bernd.fehling(at)uni-bielefeld.de
BASE - Bielefeld Academic Search Engine - www.base-search.net
*************************************************************
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]