Re: Questions for SynonymGraphFilter and WordDelimiterGraphFilter

2019-01-08 Thread Wei
bump..

On Mon, Jan 7, 2019 at 11:53 AM Wei  wrote:

> Thanks Thomas. You mentioned "Also there is no need for the
> FlattenGraphFilter", that's quite interesting because the Solr
> documentation says it's mandatory for indexing:
> https://lucene.apache.org/solr/guide/7_6/filter-descriptions.html. Is
> there any more explanation for this?
>
> Best regards,
> Wei
>
>
> On Mon, Jan 7, 2019 at 7:56 AM Thomas Aglassinger <
> t.aglassin...@netconomy.net> wrote:
>
>> Hi Wei,
>>
>> here's a fairly simple field type we currently use in a project that
>> seems to do the job with graph synonyms. Maybe this helps as a starting
>> point for you:
>>
>> > positionIncrementGap="100">
>> 
>> 
>> > managed="de" />
>> > managed="de" />
>> > preserveOriginal="1"
>> generateWordParts="1" generateNumberParts="1"
>> catenateWords="1"
>> catenateNumbers="1" catenateAll="0"
>> splitOnCaseChange="1" />
>> 
>> 
>> 
>> 
>> 
>> 
>>
>> As you can see we use the same filters for both indexing and query, so
>> this might have some impact on positional queries but so far it seems
>> negligible for the short synonyms we use in practice. Also there is no need
>> for the FlattenGraphFilter.
>>
>> The WhitespaceTokenizerFactory ensures that you can define synonyms with
>> hyphens like mac-book -> macbook.
>>
>> Best regards, Thomas.
>>
>>
>> On 05.01.19, 02:11, "Wei"  wrote:
>>
>> Hello,
>>
>> We are upgrading to Solr 7.6.0 and noticed that SynonymFilter and
>> WordDelimiterFilter have been deprecated. Solr doc recommends to use
>> SynonymGraphFilter and WordDelimiterGraphFilter instead
>> I guess the StopFilter mess up the SynonymGraphFilter output? Not sure
>> if  it's a solr defect or there is a guideline that StopFilter should
>> not be put after graph filters.
>>
>> Thanks in advance for you input.
>>
>>
>> Thanks,
>>
>> Wei
>>
>>
>>


Re: Questions for SynonymGraphFilter and WordDelimiterGraphFilter

2019-01-07 Thread Wei
Thanks Thomas. You mentioned "Also there is no need for the
FlattenGraphFilter", that's quite interesting because the Solr
documentation says it's mandatory for indexing:
https://lucene.apache.org/solr/guide/7_6/filter-descriptions.html. Is there
any more explanation for this?

Best regards,
Wei


On Mon, Jan 7, 2019 at 7:56 AM Thomas Aglassinger <
t.aglassin...@netconomy.net> wrote:

> Hi Wei,
>
> here's a fairly simple field type we currently use in a project that seems
> to do the job with graph synonyms. Maybe this helps as a starting point for
> you:
>
>  positionIncrementGap="100">
> 
> 
>  managed="de" />
>  />
>  preserveOriginal="1"
> generateWordParts="1" generateNumberParts="1"
> catenateWords="1"
> catenateNumbers="1" catenateAll="0"
> splitOnCaseChange="1" />
> 
> 
> 
> 
> 
> 
>
> As you can see we use the same filters for both indexing and query, so
> this might have some impact on positional queries but so far it seems
> negligible for the short synonyms we use in practice. Also there is no need
> for the FlattenGraphFilter.
>
> The WhitespaceTokenizerFactory ensures that you can define synonyms with
> hyphens like mac-book -> macbook.
>
> Best regards, Thomas.
>
>
> On 05.01.19, 02:11, "Wei"  wrote:
>
> Hello,
>
> We are upgrading to Solr 7.6.0 and noticed that SynonymFilter and
> WordDelimiterFilter have been deprecated. Solr doc recommends to use
> SynonymGraphFilter and WordDelimiterGraphFilter instead
> I guess the StopFilter mess up the SynonymGraphFilter output? Not sure
> if  it's a solr defect or there is a guideline that StopFilter should
> not be put after graph filters.
>
> Thanks in advance for you input.
>
>
> Thanks,
>
> Wei
>
>
>


Re: Questions for SynonymGraphFilter and WordDelimiterGraphFilter

2019-01-07 Thread Thomas Aglassinger
Hi Wei,

here's a fairly simple field type we currently use in a project that seems to 
do the job with graph synonyms. Maybe this helps as a starting point for you:














As you can see we use the same filters for both indexing and query, so this 
might have some impact on positional queries but so far it seems negligible for 
the short synonyms we use in practice. Also there is no need for the 
FlattenGraphFilter.

The WhitespaceTokenizerFactory ensures that you can define synonyms with 
hyphens like mac-book -> macbook.

Best regards, Thomas.


On 05.01.19, 02:11, "Wei"  wrote:

Hello,

We are upgrading to Solr 7.6.0 and noticed that SynonymFilter and
WordDelimiterFilter have been deprecated. Solr doc recommends to use
SynonymGraphFilter and WordDelimiterGraphFilter instead 
I guess the StopFilter mess up the SynonymGraphFilter output? Not sure
if  it's a solr defect or there is a guideline that StopFilter should
not be put after graph filters.

Thanks in advance for you input.


Thanks,

Wei