The upgrade did not correct the problem.

 

  _____  

From: general-boun...@developer.marklogic.com
[mailto:general-boun...@developer.marklogic.com] On Behalf Of Tim Meagher
Sent: Tuesday, June 23, 2009 11:16 AM
To: 'General Mark Logic Developer Discussion'
Subject: [MarkLogic Dev General] RE: Can someone explain what the
wildcardsearch problem was that was fixed in 4.0.5?

 

Hi Doug, et al,

 

Somehow I missed your post regarding this subject until a colleague pointed
it out to me, so I recovered your post from markmail and included it below.

 

Yes, I do realize that the two search strings are mutually exclusive. I
should have made that clearer in my post.

 

I tried the query with stemming enabled and disabled, but that does not fix
the wildcard problem.  The main problem is that supplying a wildcarded query
of "Personality*" returns results that do not include any form of the word
"Personality".  For example, I get "Abstract" as one of the results.  I
expect to get only one result, that being "Personality".  I've noticed this
anomaly before and kind of chalked it up to problems related to an
unfiltered search, but even so I haven't fully understood why this is
happening.

 

Case-insensitive searches work fine and changing the request to all lower
case does not solve the wildcard problem.

 

The support staff at MarkLogic opened up a support case for this issue and
recommended that I upgrade form 4.0.4 to 4.0.5 as a known wildcard bug was
found in 4.0.3, so I'm curious to see the results after the upgrade today.

 

Thank you!

 

Tim Meagher

 

 

Tim,

 

The particular issue that was being discussed in that thread involved a

bug only in cts:element-attribute-value-queries with wildcards, so I

don't think it applies to your situation.

 

A couple of notes on your situation:

 

*           

 

            Just to make sure, you realize that your two search strings are

mutually exclusive, right?  The search for "Personality*" will return

only results where the Title is a single word that starts with

"Personality".  The search for "Personality *" will return only results

where the Title is multiple words, the first of which is "Personality".

To match both of those, you would need to search for "Personality* *".

*           

 

            I'm not sure (maybe there's somebody else on the list who has a

better understanding and could explain it) how using a trailing wildcard

and stemming in the same search acts.  I'd assume you should also be

getting results for "Personalities*", and maybe "Personalize*" or

similar words (don't know how much variation the stemming engine

allows).

*           

 

            Even though you explicitly specified case-insensitive, have you

tried putting the search string in all lower-case?  Just an idea to try

to narrow down the scope.

 

Otherwise, could you perhaps include some examples of results that are

being included that shouldn't be?

 

Doug Glidden 

 

 

  _____  

From: Tim Meagher [mailto:t...@aaom.net] 
Sent: Thursday, June 18, 2009 11:14 AM
To: 'General Mark Logic Developer Discussion'
Subject: Can someone explain what the wildcard search problem was that was
fixed in 4.0.5?

 

Hi Folks,

 

I noticed a comment by Danny Sokolsky at one point.  I'm running into some
interesting results performing a wildcarded lexicon search using
cts:element-values() with a cts:element-value-query() subsearch:

cts:element-values(xs:QName("Title"), (),

  ("item-frequency", "item-order", "descending"),

  cts:and-query((

    cts:element-value-query(xs:QName("Title"), string("Personality*"),

      ("case-insensitive", "diacritic-insensitive",
"punctuation-insensitive",

        "stemmed", "wildcarded")),

    cts:directory-query("/Root/SubDir/", "infinity") 

  ))

)

 

Some of the results do not contain "Personality".  If I add a space before
the asterisk in the search ("Personality *"), then the results only include
titles containing "Personality".

 

Note that I have set my fragment root so that there should be only one
occurrence of the element "Title" within a fragment.

 

Thank you!

 

Tim Meagher - AAOM Consulting

 

_______________________________________________
General mailing list
General@developer.marklogic.com
http://xqzone.com/mailman/listinfo/general

Reply via email to