Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-04-14 Thread Mary Holstege
cts:stem will show the alternative stems, but basic stemming will only use the 
first stem given.
Stemmed search matching depends on matching stem to stem. In basic stemming, 
that means matching on the first stem; in advanced stemming that means matching 
on any of the stems. So, consider your words here:

cts:stem($word,"fr") =>
mourir = mourir
meurt = mourir
mourant = mourir, mourant
mourrait = mourir

Since "mourir" is the first stem for all these, they will all match each other 
under basic stemming.

disparu = disparu, disparaître
disparues = disparu, disparaître
disparaître = disparaître

Since the first stem "disparu" does not match the first stem "disparaître", 
"disparaître" will not match "disparu" under basic stemming although it would 
under advanced stemming.

marche = marche, marcher
marcher = marcher

Since the first stem  "marche" does not match the first stem "marcher", 
"marche" will not match "marcher" under basic stemming although it would under 
advanced stemming.

With respect to "baux" -> "bau"; "bau" is actually a word in French with the 
plural of "baux", although perhaps an obscure word. But even so, in general the 
stemming is a combination of dictionary information and algorithms, and you 
will occasionally turn up cases where you get something that isn't actually a 
word as the stem. But that doesn't really matter: what matters is whether the 
stems match. If a particular word is being stemmed in a way that causes trouble 
for your application, you can always add it to your custom dictionary to force 
a different result.

In general I would say that if basic stemming is not giving you what you want 
in terms of search recall, use advanced stemming. The need is partly dependent 
on the characteristics of the language, and partly on the needs of your 
application. I think French in particular is a language with a lot of words 
that have the same surface forms but different underlying stems, and where the 
shorter stem (which is generally the first) may not be the high probability 
choice, so advanced stemming could make a big difference for some applications.

//Mary


On 04/14/2016 12:58 PM, Gontla Praveen wrote:
Hi Mary,

While testing found more when only basic stemming is enabled.

For example the term "mourir" with basic stemming enabled returns me 
meurt,mourant, mourrait, mourir

let $text:= marcher avec la bau rupture de baux septembre 
1997, bail marche cette disparues situation bau fait disparaître la 
justification. Les services fournis disparu par la demanderesse l'ont été dans 
l'attente d'une rémunération,
return
cts:highlight($text,cts:query(
mourir
case-insensitive
diacritic-insensitive
punctuation-insensitive
  ),{$cts:text})

Why does not the same happens for the term disparu  or marche?

Why advanced stemming required for these terms? Is it anything specific to 
French language ?

Also, when i did check for stems of cts:stem("baux","fr") i get bau,bail where 
bau doesnt have any meaning in french.

Since only basic stemming is enabled at my DB level i am seeing documents 
contains baux or bau but not bail.

Can you tell me why this difference in bahaviour on french stems.

Thanks,
Praveen.


___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general


Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-04-14 Thread Gontla Praveen
Hi Mary,

While testing found more when only basic stemming is enabled.

For example the term "mourir" with basic stemming enabled returns me meurt,
mourant, mourrait, mourir

let $text:= marcher avec la bau rupture de baux
septembre 1997, bail marche cette disparues situation bau fait disparaître
la justification. Les services fournis disparu par la demanderesse l'ont
été dans l'attente d'une rémunération,
return
cts:highlight($text,cts:query(
mourir
case-insensitive

diacritic-insensitive

punctuation-insensitive
  ),{$cts:text})

Why does not the same happens for the term disparu  or marche?

Why advanced stemming required for these terms? Is it anything specific to
French language ?

Also, when i did check for stems of cts:stem("baux","fr") i get bau,bail
where bau doesnt have any meaning in french.

Since only basic stemming is enabled at my DB level i am seeing documents
contains baux or bau but not bail.

Can you tell me why this difference in bahaviour on french stems.

Thanks,
Praveen.

On Tue, Apr 12, 2016 at 10:45 AM, Mary Holstege  wrote:

> On Tue, 12 Apr 2016 07:10:46 -0700, Gontla Praveen <
> praveenkumargontla...@gmail.com> wrote:
>
> Hi Mary,
>>
>> Why an advanced stemming need to be enabled any specific reason for that?
>>
>
> Not everyone needs or wants advanced stemming: it does more work (so,
> slightly slower) with larger indexes.
> For some languages, the slight increase in recall is not worth it for many
> use cases.
>
>
>> What will be difference between using basic stemming and advanced
>> stemming ?
>>
>
> Basic stemming only indexes the preferred stem for each token (typically,
> the shortest one). Advanced stemming indexes all possible stems.
>
> Completing the picture:
> * decompounding is like advanced stemming, but with additional indexing
> for components of compounds. This principally applies to German and
> languages like that that create long noun clusters as single words.
> * you can also turn stemming off entirely; principally useful where you
> searching over non-linguistic content
>
> //Mary
>
>
>> Thanks,
>> Praveen.
>>
>> On Thu, Mar 31, 2016 at 12:58 PM, Mary Holstege <
>> mary.holst...@marklogic.com
>>
>>> wrote:
>>>
>>
>>
>>> Do you have advanced stemming enabled? With basic stemming only the first
>>> stem returned from cts:stem indexed and used for matching in search.
>>>
>>> //Mary
>>>
>>>
>>> On 03/31/2016 03:00 AM, Debin, Infant Jerald (LNG-CON) wrote:
>>>
>>> Hi Team,
>>>
>>>
>>>
>>> For the term French term *“disparu”* corresponding French stemmed word
>>> *“disparaître”* is not getting recognized when performing search.
>>>
>>>
>>>
>>> *Example:*
>>>
>>>
>>>
>>> *Query:*
>>>
>>>
>>>
>>> let $text:= avec la rupture de septembre 1997, cette
>>> disparues situation fait disparaître la justification. Les services
>>> fournis
>>> disparu par la demanderesse l'ont été dans l'attente d'une
>>> rémunération,
>>>
>>> return
>>>
>>> cts:highlight($text,cts:query(
>>>
>>> >> xml:lang="fr">disparu
>>>
>>> case-insensitive
>>>
>>>
>>> diacritic-insensitive
>>>
>>>
>>> punctuation-insensitive
>>>
>>>   ),{$cts:text})
>>>
>>>
>>>
>>> *Result:*
>>>
>>>
>>>
>>> Disparaître is not getting recognized and highlighted as below,
>>>
>>>
>>>
>>> avec la rupture de septembre 1997, cette
>>> disparues situation fait disparaître la justification. Les
>>> services fournis disparu par la demanderesse l'ont été dans
>>> l'attente d'une rémunération,
>>>
>>>
>>>
>>> Below is the result of cts:stem,
>>>
>>>
>>>
>>> cts:stem("disparu","fr")
>>>
>>>
>>>
>>> disparu
>>>
>>> disparaître
>>>
>>>
>>>
>>> Please let us know on this issue.
>>>
>>>
>>>
>>> Thanks and Regards,
>>>
>>>
>>>
>>> Debin
>>>
>>> Mob: +91-9789826001
>>>
>>>
>>>
>>>
>>> ___
>>> General mailing listgene...@developer.marklogic.com
>>> Manage your subscription at:
>>> http://developer.marklogic.com/mailman/listinfo/general
>>>
>>>
>>>
>>> ___
>>> General mailing list
>>> General@developer.marklogic.com
>>> Manage your subscription at:
>>> http://developer.marklogic.com/mailman/listinfo/general
>>>
>>>
>>>
>
> --
> Using Opera's revolutionary email client: http://www.opera.com/mail/
>
___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general


Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-04-12 Thread Mary Holstege
On Tue, 12 Apr 2016 07:10:46 -0700, Gontla Praveen  
 wrote:

> Hi Mary,
>
> Why an advanced stemming need to be enabled any specific reason for that?

Not everyone needs or wants advanced stemming: it does more work (so,  
slightly slower) with larger indexes.
For some languages, the slight increase in recall is not worth it for many  
use cases.

>
> What will be difference between using basic stemming and advanced  
> stemming ?

Basic stemming only indexes the preferred stem for each token (typically,  
the shortest one). Advanced stemming indexes all possible stems.

Completing the picture:
* decompounding is like advanced stemming, but with additional indexing  
for components of compounds. This principally applies to German and  
languages like that that create long noun clusters as single words.
* you can also turn stemming off entirely; principally useful where you  
searching over non-linguistic content

//Mary

>
> Thanks,
> Praveen.
>
> On Thu, Mar 31, 2016 at 12:58 PM, Mary Holstege  
> > wrote:
>
>>
>> Do you have advanced stemming enabled? With basic stemming only the  
>> first
>> stem returned from cts:stem indexed and used for matching in search.
>>
>> //Mary
>>
>>
>> On 03/31/2016 03:00 AM, Debin, Infant Jerald (LNG-CON) wrote:
>>
>> Hi Team,
>>
>>
>>
>> For the term French term *“disparu”* corresponding French stemmed word
>> *“disparaître”* is not getting recognized when performing search.
>>
>>
>>
>> *Example:*
>>
>>
>>
>> *Query:*
>>
>>
>>
>> let $text:= avec la rupture de septembre 1997, cette
>> disparues situation fait disparaître la justification. Les services  
>> fournis
>> disparu par la demanderesse l'ont été dans l'attente d'une
>> rémunération,
>>
>> return
>>
>> cts:highlight($text,cts:query(
>>
>> > xml:lang="fr">disparu
>>
>> case-insensitive
>>
>>
>> diacritic-insensitive
>>
>>
>> punctuation-insensitive
>>
>>   ),{$cts:text})
>>
>>
>>
>> *Result:*
>>
>>
>>
>> Disparaître is not getting recognized and highlighted as below,
>>
>>
>>
>> avec la rupture de septembre 1997, cette  
>> disparues situation fait disparaître la justification. Les  
>> services fournis disparu par la demanderesse l'ont été dans  
>> l'attente d'une rémunération,
>>
>>
>>
>> Below is the result of cts:stem,
>>
>>
>>
>> cts:stem("disparu","fr")
>>
>>
>>
>> disparu
>>
>> disparaître
>>
>>
>>
>> Please let us know on this issue.
>>
>>
>>
>> Thanks and Regards,
>>
>>
>>
>> Debin
>>
>> Mob: +91-9789826001
>>
>>
>>
>>
>> ___
>> General mailing listgene...@developer.marklogic.com
>> Manage your subscription at:  
>> http://developer.marklogic.com/mailman/listinfo/general
>>
>>
>>
>> ___
>> General mailing list
>> General@developer.marklogic.com
>> Manage your subscription at:
>> http://developer.marklogic.com/mailman/listinfo/general
>>
>>


-- 
Using Opera's revolutionary email client: http://www.opera.com/mail/
___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general


Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-04-12 Thread Gontla Praveen
Found the answer myself in docs. Please ignore my email.

On Tue, Apr 12, 2016 at 10:10 AM, Gontla Praveen <
praveenkumargontla...@gmail.com> wrote:

> Hi Mary,
>
> Why an advanced stemming need to be enabled any specific reason for that?
>
> What will be difference between using basic stemming and advanced stemming
> ?
>
> Thanks,
> Praveen.
>
> On Thu, Mar 31, 2016 at 12:58 PM, Mary Holstege <
> mary.holst...@marklogic.com> wrote:
>
>>
>> Do you have advanced stemming enabled? With basic stemming only the first
>> stem returned from cts:stem indexed and used for matching in search.
>>
>> //Mary
>>
>>
>> On 03/31/2016 03:00 AM, Debin, Infant Jerald (LNG-CON) wrote:
>>
>> Hi Team,
>>
>>
>>
>> For the term French term *“disparu”* corresponding French stemmed word
>> *“disparaître”* is not getting recognized when performing search.
>>
>>
>>
>> *Example:*
>>
>>
>>
>> *Query:*
>>
>>
>>
>> let $text:= avec la rupture de septembre 1997, cette
>> disparues situation fait disparaître la justification. Les services fournis
>> disparu par la demanderesse l'ont été dans l'attente d'une
>> rémunération,
>>
>> return
>>
>> cts:highlight($text,cts:query(
>>
>> disparu
>> 
>>
>> case-insensitive
>>
>>
>> diacritic-insensitive
>>
>>
>> punctuation-insensitive
>>
>>   ),{$cts:text})
>>
>>
>>
>> *Result:*
>>
>>
>>
>> Disparaître is not getting recognized and highlighted as below,
>>
>>
>>
>> avec la rupture de septembre 1997, cette 
>> disparues situation fait disparaître la justification. Les services 
>> fournis disparu par la demanderesse l'ont été dans l'attente d'une 
>> rémunération,
>>
>>
>>
>> Below is the result of cts:stem,
>>
>>
>>
>> cts:stem("disparu","fr")
>>
>>
>>
>> disparu
>>
>> disparaître
>>
>>
>>
>> Please let us know on this issue.
>>
>>
>>
>> Thanks and Regards,
>>
>>
>>
>> Debin
>>
>> Mob: +91-9789826001
>>
>>
>>
>>
>> ___
>> General mailing listgene...@developer.marklogic.com
>> Manage your subscription at: 
>> http://developer.marklogic.com/mailman/listinfo/general
>>
>>
>>
>> ___
>> General mailing list
>> General@developer.marklogic.com
>> Manage your subscription at:
>> http://developer.marklogic.com/mailman/listinfo/general
>>
>>
>
___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general


Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-04-12 Thread Gontla Praveen
Hi Mary,

Why an advanced stemming need to be enabled any specific reason for that?

What will be difference between using basic stemming and advanced stemming ?

Thanks,
Praveen.

On Thu, Mar 31, 2016 at 12:58 PM, Mary Holstege  wrote:

>
> Do you have advanced stemming enabled? With basic stemming only the first
> stem returned from cts:stem indexed and used for matching in search.
>
> //Mary
>
>
> On 03/31/2016 03:00 AM, Debin, Infant Jerald (LNG-CON) wrote:
>
> Hi Team,
>
>
>
> For the term French term *“disparu”* corresponding French stemmed word
> *“disparaître”* is not getting recognized when performing search.
>
>
>
> *Example:*
>
>
>
> *Query:*
>
>
>
> let $text:= avec la rupture de septembre 1997, cette
> disparues situation fait disparaître la justification. Les services fournis
> disparu par la demanderesse l'ont été dans l'attente d'une
> rémunération,
>
> return
>
> cts:highlight($text,cts:query(
>
> disparu
>
> case-insensitive
>
>
> diacritic-insensitive
>
>
> punctuation-insensitive
>
>   ),{$cts:text})
>
>
>
> *Result:*
>
>
>
> Disparaître is not getting recognized and highlighted as below,
>
>
>
> avec la rupture de septembre 1997, cette 
> disparues situation fait disparaître la justification. Les services 
> fournis disparu par la demanderesse l'ont été dans l'attente d'une 
> rémunération,
>
>
>
> Below is the result of cts:stem,
>
>
>
> cts:stem("disparu","fr")
>
>
>
> disparu
>
> disparaître
>
>
>
> Please let us know on this issue.
>
>
>
> Thanks and Regards,
>
>
>
> Debin
>
> Mob: +91-9789826001
>
>
>
>
> ___
> General mailing listgene...@developer.marklogic.com
> Manage your subscription at: 
> http://developer.marklogic.com/mailman/listinfo/general
>
>
>
> ___
> General mailing list
> General@developer.marklogic.com
> Manage your subscription at:
> http://developer.marklogic.com/mailman/listinfo/general
>
>
___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general


Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-03-31 Thread Mary Holstege

Do you have advanced stemming enabled? With basic stemming only the first stem 
returned from cts:stem indexed and used for matching in search.

//Mary

On 03/31/2016 03:00 AM, Debin, Infant Jerald (LNG-CON) wrote:
Hi Team,

For the term French term “disparu” corresponding French stemmed word  
“disparaître” is not getting recognized when performing search.

Example:

Query:

let $text:= avec la rupture de septembre 1997, cette 
disparues situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une rémunération,
return
cts:highlight($text,cts:query(
disparu
case-insensitive
diacritic-insensitive
punctuation-insensitive
  ),{$cts:text})

Result:

Disparaître is not getting recognized and highlighted as below,


avec la rupture de septembre 1997, cette disparues 
situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une 
rémunération,

Below is the result of cts:stem,

cts:stem("disparu","fr")

disparu
disparaître

Please let us know on this issue.

Thanks and Regards,

Debin
Mob: +91-9789826001




___
General mailing list
General@developer.marklogic.com
Manage your subscription at:
http://developer.marklogic.com/mailman/listinfo/general


___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general


Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-03-31 Thread Geert Josten
Hi Debin,

That is because ‘consign’ is a stem of all three the words you wanted 
highlighted, whereas ‘disparu’ wasn’t a stem of ‘disparaître’. I can imagine 
MarkLogic applies cts:stem on search terms, but only includes the shortest or 
first stem. Applying cts:stem to your search terms manually will make sure all 
stems are considered..

Cheers,
Geert

From: 
<general-boun...@developer.marklogic.com<mailto:general-boun...@developer.marklogic.com>>
 on behalf of "Debin, Infant Jerald (LNG-CON)" 
<infantjerald.de...@lexisnexis.com<mailto:infantjerald.de...@lexisnexis.com>>
Reply-To: MarkLogic Developer Discussion 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Date: Thursday, March 31, 2016 at 2:28 PM
To: MarkLogic Developer Discussion 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Subject: Re: [MarkLogic Dev General] French stemmed word is not getting 
recognized and highlighted

Hi Greet,

It is working perfectly for English in below example without having stem 
introduced in the query,

let $text:= He asked the accused what he knew of the 
consigned goods. The accused told Mr. Osmond that he was familiar ...
... own goods as opposed to consign reimbursing the owner of the consigning 
goods.
return
cts:highlight($text,cts:query(
consign
case-insensitive
diacritic-insensitive
punctuation-insensitive
  ),{$cts:text})


Result : He asked the accused what he knew of the 
consigned goods. The accused told Mr. Osmond that he was familiar ...
... own goods as opposed to consign reimbursing the owner of the 
consigning goods.


All the stem words are recognized and highlighted for English without having 
cts:stem .

But French words it is not working as expected.

Please advise.

Thanks and Regards,

Debin
Mob: +91-9789826001

From: 
general-boun...@developer.marklogic.com<mailto:general-boun...@developer.marklogic.com>
 [mailto:general-boun...@developer.marklogic.com] On Behalf Of Geert Josten
Sent: Thursday, March 31, 2016 7:47 AM
To: MarkLogic Developer Discussion
Subject: Re: [MarkLogic Dev General] French stemmed word is not getting 
recognized and highlighted

Hi Debin,

How about applying cts:stem on search terms first?

let $text:= avec la rupture de septembre 1997, cette 
disparues situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une rémunération,
return
cts:highlight($text,cts:word-query(cts:stem("disparu", "fr"), 
("lang=fr")),{$cts:text})

Cheers,
Geert

From: 
<general-boun...@developer.marklogic.com<mailto:general-boun...@developer.marklogic.com>>
 on behalf of "Debin, Infant Jerald (LNG-CON)" 
<infantjerald.de...@lexisnexis.com<mailto:infantjerald.de...@lexisnexis.com>>
Reply-To: MarkLogic Developer Discussion 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Date: Thursday, March 31, 2016 at 1:14 PM
To: MarkLogic Developer Discussion 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Subject: Re: [MarkLogic Dev General] French stemmed word is not getting 
recognized and highlighted

Hi Geert,

I understand stem for disparaître is not disparu.

But stem for “disparu” is “disparaître”. The same is not getting recognized and 
highlighted.

We need “disparaître” to be highlighted when we give “disparu”.


avec la rupture de septembre 1997, cette disparues 
situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une 
rémunération,

Please advise.

Thanks and Regards,

Debin
Mob: +91-9789826001

From:general-boun...@developer.marklogic.com<mailto:general-boun...@developer.marklogic.com>
 [mailto:general-boun...@developer.marklogic.com] On Behalf Of Geert Josten
Sent: Thursday, March 31, 2016 7:00 AM
To: MarkLogic Developer Discussion
Subject: Re: [MarkLogic Dev General] French stemmed word is not getting 
recognized and highlighted

Hi Debin,

The issue is that the stem for disparaître is not disparu, but disparaître. If 
you highlight with disparaître instead, you should see all highlights you are 
looking for:

for $word in ("disparu", "disparues", "disparaître")
return ("==", $word, "", cts:stem("disparu","fr"))

Cheers,
Geert

From: 
<general-boun...@developer.marklogic.com<mailto:general-boun...@developer.marklogic.com>>
 on behalf of "Debin, Infant Jerald (LNG-CON)" 
<infantjerald.de...@lexisnexis.com<mailto:infantjerald.de...@lexisnexis.com>>
Reply-To: MarkLogic Developer Discussion 
<general@developer.marklogic.com<

Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-03-31 Thread Geert Josten
Hi Debin,

How about applying cts:stem on search terms first?

let $text:= avec la rupture de septembre 1997, cette 
disparues situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une rémunération,
return
cts:highlight($text,cts:word-query(cts:stem("disparu", "fr"), 
("lang=fr")),{$cts:text})

Cheers,
Geert

From: 
<general-boun...@developer.marklogic.com<mailto:general-boun...@developer.marklogic.com>>
 on behalf of "Debin, Infant Jerald (LNG-CON)" 
<infantjerald.de...@lexisnexis.com<mailto:infantjerald.de...@lexisnexis.com>>
Reply-To: MarkLogic Developer Discussion 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Date: Thursday, March 31, 2016 at 1:14 PM
To: MarkLogic Developer Discussion 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Subject: Re: [MarkLogic Dev General] French stemmed word is not getting 
recognized and highlighted

Hi Geert,

I understand stem for disparaître is not disparu.

But stem for “disparu” is “disparaître”. The same is not getting recognized and 
highlighted.

We need “disparaître” to be highlighted when we give “disparu”.


avec la rupture de septembre 1997, cette disparues 
situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une 
rémunération,

Please advise.

Thanks and Regards,

Debin
Mob: +91-9789826001

From: 
general-boun...@developer.marklogic.com<mailto:general-boun...@developer.marklogic.com>
 [mailto:general-boun...@developer.marklogic.com] On Behalf Of Geert Josten
Sent: Thursday, March 31, 2016 7:00 AM
To: MarkLogic Developer Discussion
Subject: Re: [MarkLogic Dev General] French stemmed word is not getting 
recognized and highlighted

Hi Debin,

The issue is that the stem for disparaître is not disparu, but disparaître. If 
you highlight with disparaître instead, you should see all highlights you are 
looking for:

for $word in ("disparu", "disparues", "disparaître")
return ("==", $word, "", cts:stem("disparu","fr"))

Cheers,
Geert

From: 
<general-boun...@developer.marklogic.com<mailto:general-boun...@developer.marklogic.com>>
 on behalf of "Debin, Infant Jerald (LNG-CON)" 
<infantjerald.de...@lexisnexis.com<mailto:infantjerald.de...@lexisnexis.com>>
Reply-To: MarkLogic Developer Discussion 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Date: Thursday, March 31, 2016 at 12:00 PM
To: "general@developer.marklogic.com<mailto:general@developer.marklogic.com>" 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Subject: [MarkLogic Dev General] French stemmed word is not getting recognized 
and highlighted

Hi Team,

For the term French term “disparu” corresponding French stemmed word  
“disparaître” is not getting recognized when performing search.

Example:

Query:

let $text:= avec la rupture de septembre 1997, cette 
disparues situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une rémunération,
return
cts:highlight($text,cts:query(
disparu
case-insensitive
diacritic-insensitive
punctuation-insensitive
  ),{$cts:text})

Result:

Disparaître is not getting recognized and highlighted as below,


avec la rupture de septembre 1997, cette disparues 
situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une 
rémunération,

Below is the result of cts:stem,

cts:stem("disparu","fr")

disparu
disparaître

Please let us know on this issue.

Thanks and Regards,

Debin
Mob: +91-9789826001

___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general


Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-03-31 Thread Debin, Infant Jerald (LNG-CON)
Hi Geert,

I understand stem for disparaître is not disparu.

But stem for "disparu" is "disparaître". The same is not getting recognized and 
highlighted.

We need "disparaître" to be highlighted when we give "disparu".


avec la rupture de septembre 1997, cette disparues 
situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une 
rémunération,

Please advise.

Thanks and Regards,

Debin
Mob: +91-9789826001

From: general-boun...@developer.marklogic.com 
[mailto:general-boun...@developer.marklogic.com] On Behalf Of Geert Josten
Sent: Thursday, March 31, 2016 7:00 AM
To: MarkLogic Developer Discussion
Subject: Re: [MarkLogic Dev General] French stemmed word is not getting 
recognized and highlighted

Hi Debin,

The issue is that the stem for disparaître is not disparu, but disparaître. If 
you highlight with disparaître instead, you should see all highlights you are 
looking for:

for $word in ("disparu", "disparues", "disparaître")
return ("==", $word, "", cts:stem("disparu","fr"))

Cheers,
Geert

From: 
<general-boun...@developer.marklogic.com<mailto:general-boun...@developer.marklogic.com>>
 on behalf of "Debin, Infant Jerald (LNG-CON)" 
<infantjerald.de...@lexisnexis.com<mailto:infantjerald.de...@lexisnexis.com>>
Reply-To: MarkLogic Developer Discussion 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Date: Thursday, March 31, 2016 at 12:00 PM
To: "general@developer.marklogic.com<mailto:general@developer.marklogic.com>" 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Subject: [MarkLogic Dev General] French stemmed word is not getting recognized 
and highlighted

Hi Team,

For the term French term "disparu" corresponding French stemmed word  
"disparaître" is not getting recognized when performing search.

Example:

Query:

let $text:= avec la rupture de septembre 1997, cette 
disparues situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une rémunération,
return
cts:highlight($text,cts:query(
disparu
case-insensitive
diacritic-insensitive
punctuation-insensitive
  ),{$cts:text})

Result:

Disparaître is not getting recognized and highlighted as below,


avec la rupture de septembre 1997, cette disparues 
situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une 
rémunération,

Below is the result of cts:stem,

cts:stem("disparu","fr")

disparu
disparaître

Please let us know on this issue.

Thanks and Regards,

Debin
Mob: +91-9789826001

___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general


Re: [MarkLogic Dev General] French stemmed word is not getting recognized and highlighted

2016-03-31 Thread Geert Josten
Hi Debin,

The issue is that the stem for disparaître is not disparu, but disparaître. If 
you highlight with disparaître instead, you should see all highlights you are 
looking for:

for $word in ("disparu", "disparues", "disparaître")
return ("==", $word, "", cts:stem("disparu","fr"))

Cheers,
Geert

From: 
<general-boun...@developer.marklogic.com<mailto:general-boun...@developer.marklogic.com>>
 on behalf of "Debin, Infant Jerald (LNG-CON)" 
<infantjerald.de...@lexisnexis.com<mailto:infantjerald.de...@lexisnexis.com>>
Reply-To: MarkLogic Developer Discussion 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Date: Thursday, March 31, 2016 at 12:00 PM
To: "general@developer.marklogic.com<mailto:general@developer.marklogic.com>" 
<general@developer.marklogic.com<mailto:general@developer.marklogic.com>>
Subject: [MarkLogic Dev General] French stemmed word is not getting recognized 
and highlighted

Hi Team,

For the term French term “disparu” corresponding French stemmed word  
“disparaître” is not getting recognized when performing search.

Example:

Query:

let $text:= avec la rupture de septembre 1997, cette 
disparues situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une rémunération,
return
cts:highlight($text,cts:query(
disparu
case-insensitive
diacritic-insensitive
punctuation-insensitive
  ),{$cts:text})

Result:

Disparaître is not getting recognized and highlighted as below,


avec la rupture de septembre 1997, cette disparues 
situation fait disparaître la justification. Les services fournis 
disparu par la demanderesse l'ont été dans l'attente d'une 
rémunération,

Below is the result of cts:stem,

cts:stem("disparu","fr")

disparu
disparaître

Please let us know on this issue.

Thanks and Regards,

Debin
Mob: +91-9789826001

___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general