Hi all

sorry for the long post

We are switching from indexdata's zebra to solr for a new book
archival/preservation project with multiple languages, so expect more
questions soon (sorry for that)
The features of solr are pretty cool and more or less overwhelming!

But there is one thing I found after a little test with wildcards.

I'm using the latest svn build and didn't change anything except the
schema.xml
Solr Specification Version: 1.3.0.2009.08.20.07.53.52
Solr Implementation Version: 1.4-dev 806060 - ait015 - 2009-08-20 07:53:52
Lucene Specification Version: 2.9-dev
Lucene Implementation Version: 2.9-dev 804692 - 2009-08-16 09:33:41

I have a text_ws field with this schema config:

<fieldType name="text_ws" class="solr.TextField" positionIncrementGap="100">
   <analyzer>
      <charFilter class="solr.MappingCharFilterFactory"
mapping="mapping-ISOLatin1Accent.txt"/>
      <filter class="solr.LowerCaseFilterFactory"/>
      <tokenizer class="solr.WhitespaceTokenizerFactory"/>
   </analyzer>
</fieldType>
...
and I added a dynamic field for everything since I'm not sure what field
we will use...

<dynamicField name="*"  type="text_ws"    indexed="true"  stored="true"
multiValued="true"/>
...


So I <add>ed this content:
...
<field name="PhysicalDescription">
   X, 143, XIV S.:
   124 feine Farbendrucktafeln mit über 600 Abbildungen;
   24,5 cm.
</field>
...

since it's German, and I couldn't find a tokenizer for German compound
words (any help appreciated) I wanted to search for 'Farb*'

The final row of the query analyzer in the admin section told me:
farb*
for the content:
x,      143,    xiv     s.:     124     feine   farbendrucktafeln       mit     
uber    600     abbildungen;
24,5    cm.

so everything seems to be ok, everything in lower case

Now, for the rest service:
http://localhost:8983/solr/select/?q=PhysicalDescription:Farb*&debugQuery=true
<str name="rawquerystring">PhysicalDescription:Farb*</str>
<str name="querystring">PhysicalDescription:Farb*</str>
<str name="parsedquery">PhysicalDescription:Farb*</str>
<str name="parsedquery_toString">PhysicalDescription:Farb*</str>

Since Farb* has a capital letter, nothing is found.
When using farb* as query, I get the result.

Where can I add/change a query anaylizer that "lower cases" wildcard
searches?

thanks, best wishes,
Alexander

Reply via email to