(sorry for cross postings - I think this is important information to disseminate)

Executive Summary: you probably need to increase your query slop. A lot.


We recently had a feedback ticket that a title search with a hyphen wasn't working properly. This is especially curious because we solved a bunch of problems with hyphen searching AND WROTE TESTS in the process, and all the existing hyphen tests pass. Tests like "hyphens with no spaces before or after, 3 significant terms, 2 stopwords" pass.

Our metadata contains:
record A with title:   Red-rose chain.
record B with title:   Prisoner in a red-rose chain.

A title search:  prisoner in a red-rose chain  returns no results

Further exploration (the following are all title searches):
red-rose chain  ==>  record A only
"red rose" chain ==>  record A only
"red rose chain" ==> record A only
"red-rose chain" ==> record A only
red rose chain ==>  records A and B
red "rose chain" ==>  records A and B  (!!)

For more details and more about the solution, see  
http://discovery-grindstone.blogspot.com/2010/11/solr-and-hyphenated-words.html

- Naomi Dushay
Senior Developer
Stanford University Libraries

Reply via email to