Lance Norskog created SOLR-4007:
-----------------------------------

             Summary: Morfologik dictionaries not available in Solr field type
                 Key: SOLR-4007
                 URL: https://issues.apache.org/jira/browse/SOLR-4007
             Project: Solr
          Issue Type: Bug
          Components: Schema and Analysis
    Affects Versions: 4.1
            Reporter: Lance Norskog
            Priority: Minor


The Polish Morfologik type does not find its dictionaries when used in Solr. To 
demonstrate:

1) Add this to example/solr/collection1/conf/schema.xml:
{noformat}
    <!-- Polish -->
    <fieldType name="text_pl" class="solr.TextField" positionIncrementGap="100">
      <analyzer>
        <tokenizer class="solr.StandardTokenizerFactory"/>
        <filter class="solr.MorfologikFilterFactory" dictionary="MORFOLOGIK" />
      </analyzer>
    </fieldType>
{noformat}

2) Add this to example/solr/collection1/conf/solrconfig.xml:

{noformat}
  <lib dir="../../../../lucene/build/analysis/morfologik/" regex=".*\.jar" />
  <lib dir="../../../contrib/analysis-extras/lib" regex=".*\.jar" />
  <lib dir="../../../dist/" regex="apache-solr-analysis-extras-\d.*\.jar" />
{noformat}

3) Test 'text_pl' in the analysis page. You will get an exception.
{noformat}
Oct 28, 2012 8:27:19 PM org.apache.solr.core.SolrCore execute
INFO: [collection1] webapp=/solr path=/analysis/field 
params={analysis.showmatch=true&analysis.query=&wt=json&analysis.fieldvalue=blah+blah&analysis.fieldtype=text_pl}
 status=500 QTime=26 
Oct 28, 2012 8:27:19 PM org.apache.solr.common.SolrException log
SEVERE: null:java.lang.RuntimeException: Default dictionary resource for 
language 'plnot found.
        at morfologik.stemming.Dictionary.getForLanguage(Dictionary.java:163)
        at morfologik.stemming.PolishStemmer.<init>(PolishStemmer.java:64)
        at 
org.apache.lucene.analysis.morfologik.MorfologikFilter.<init>(MorfologikFilter.java:70)
        at 
org.apache.lucene.analysis.morfologik.MorfologikFilterFactory.create(MorfologikFilterFactory.java:63)
        at 
org.apache.solr.handler.AnalysisRequestHandlerBase.analyzeValue(AnalysisRequestHandlerBase.java:125)
        at 
org.apache.solr.handler.FieldAnalysisRequestHandler.analyzeValues(FieldAnalysisRequestHandler.java:220)
        at 
org.apache.solr.handler.FieldAnalysisRequestHandler.handleAnalysisRequest(FieldAnalysisRequestHandler.java:181)
        at 
org.apache.solr.handler.FieldAnalysisRequestHandler.doAnalysis(FieldAnalysisRequestHandler.java:100)
        at 

[...........]

Caused by: java.io.IOException: Could not locate resource: 
morfologik/dictionaries/pl.dict
        at morfologik.util.ResourceUtils.openInputStream(ResourceUtils.java:56)
        at morfologik.stemming.Dictionary.getForLanguage(Dictionary.java:156)
        ... 38 more

{noformat}

{{morfologik-polish-1.5.3.jar}} has {{morfologik/dictionaries/pl.dict}}.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to