Re: [MarkLogic Dev General] Wildcard search in marklogic

2016-08-24 Thread Mary Holstege

Three things:
(1) The trailing wildcard index contains exactly entries for words that  
begin with certain 3+ character sequences, so "murd*" would be a single  
term lookup that will match murd, murdy, murds, etc.
(2) The three character wildcard index is actually capable of  
distinguishing a 3 character sequence at the start of a word from one at  
the end or in the middle of the word and query resolution will take care  
of this.
(3) Wildcard query formulation is complex and subject to a lot of  
heuristics. Turning on a codepoint word lexicon can be useful as well,  
particularly for complex wildcard patterns. Positions will matter for  
phrases and wildcards within element or proximity query contexts.  
xdmp:plan is a good tool for telling you exactly how a particular wildcard  
(or any other) query will be resolved.

//Mary


On Wed, 24 Aug 2016 06:38:50 -0700, Debin, Infant Jerald (LNG-CON)  
 wrote:

> Hi Team,
>
> I am analyzing on the wildcard character search supported by ML.  
> Currently ML supports '*'- zero or more non space character match and  
> '?' - exactly one non space character match for wild character search.
>
> Based on the character index setting, ie. Three character, Two Character  
> and One Character the term with the wildcard character will be searched  
> accordingly.
>
> For Ex: I have set for Three character search and I perform search for  
> murd*. The index will be set as mur* and urd* and I will get results  
> with words that start with murd and urd as well.
>
> But I want to do a wildcard search which takes zero or one non space  
> character match
>
> For Ex: If I give murd*, it should match for murd or murd(y), or murd(s)  
> etc.
>
> Please let know if there is any option in ML to get the above mentioned  
> results.
>
> Thanks.
>


-- 
Using Opera's revolutionary email client: http://www.opera.com/mail/
___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general


Re: [MarkLogic Dev General] Wildcard search in marklogic

2016-08-24 Thread Debin, Infant Jerald (LNG-CON)
Hi,

For murd?, if Three character search is enabled in ML It will again get indexed 
as mur? And urd? Isn’t it?   So it will be treated as (murd or mur? or urd? ). 
Let me know if my understanding is correct.

What I actually need I don’t want it to get indexed based on characters but I 
still need to search by making use of wild character.

And what I expect if I give murd? It should return results with only murd(one 
nonspace character).

Let me know if this is feasible in ML.

Thanks,
Debin

From: general-boun...@developer.marklogic.com 
[mailto:general-boun...@developer.marklogic.com] On Behalf Of Andreas Hubmer
Sent: Wednesday, August 24, 2016 9:45 AM
To: MarkLogic Developer Discussion
Cc: DEVASIGAMANI, EDWIN JOHN (LNG-CON)
Subject: Re: [MarkLogic Dev General] Wildcard search in marklogic

Hi,

You could simply do an OR search for "murd" and "murd?".

Andreas

2016-08-24 15:38 GMT+02:00 Debin, Infant Jerald (LNG-CON) 
<infantjerald.de...@lexisnexis.com<mailto:infantjerald.de...@lexisnexis.com>>:
Hi Team,

I am analyzing on the wildcard character search supported by ML. Currently ML 
supports ‘*’- zero or more non space character match and ‘?’ - exactly one non 
space character match for wild character search.

Based on the character index setting, ie. Three character, Two Character and 
One Character the term with the wildcard character will be searched accordingly.

For Ex: I have set for Three character search and I perform search for murd*. 
The index will be set as mur* and urd* and I will get results with words that 
start with murd and urd as well.

But I want to do a wildcard search which takes zero or one non space character 
match

For Ex: If I give murd*, it should match for murd or murd(y), or murd(s) etc.

Please let know if there is any option in ML to get the above mentioned results.

Thanks.


___
General mailing list
General@developer.marklogic.com<mailto:General@developer.marklogic.com>
Manage your subscription at:
http://developer.marklogic.com/mailman/listinfo/general



--
Andreas Hubmer
Senior IT Consultant

___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general


Re: [MarkLogic Dev General] Wildcard search in marklogic

2016-08-24 Thread Andreas Hubmer
Hi,

You could simply do an OR search for "murd" and "murd?".

Andreas

2016-08-24 15:38 GMT+02:00 Debin, Infant Jerald (LNG-CON) <
infantjerald.de...@lexisnexis.com>:

> Hi Team,
>
>
>
> I am analyzing on the wildcard character search supported by ML. Currently
> ML supports ‘*’- zero or more non space character match and ‘?’ - exactly
> one non space character match for wild character search.
>
>
>
> Based on the character index setting, ie. Three character, Two Character
> and One Character the term with the wildcard character will be searched
> accordingly.
>
>
>
> For Ex: I have set for Three character search and I perform search for
> murd*. The index will be set as mur* and urd* and I will get results with
> words that start with murd and urd as well.
>
>
>
> But I want to do a wildcard search which takes zero or one non space
> character match
>
>
>
> For Ex: If I give murd*, it should match for murd or murd(y), or murd(s)
> etc.
>
>
>
> Please let know if there is any option in ML to get the above mentioned
> results.
>
>
>
> Thanks.
>
>
>
> ___
> General mailing list
> General@developer.marklogic.com
> Manage your subscription at:
> http://developer.marklogic.com/mailman/listinfo/general
>
>


-- 
Andreas Hubmer
Senior IT Consultant
___
General mailing list
General@developer.marklogic.com
Manage your subscription at: 
http://developer.marklogic.com/mailman/listinfo/general