-christian created NUTCH-1607:
---------------------------------

             Summary: Make inproper multiValued field configurable
                 Key: NUTCH-1607
                 URL: https://issues.apache.org/jira/browse/NUTCH-1607
             Project: Nutch
          Issue Type: Bug
          Components: indexer
    Affects Versions: 1.7
            Reporter: -christian


After using 1.7 some errors on commiting to Solr came up:

Nutch is indexing a specific URL where he is parsing a field, that - according 
to schema.xml for solr - is a single value field, e.g. description

For that particular site the description is declared twice - which is beyond 
our control, leading to an error when commiting those documents to Solr. Is 
there any workaround for that, e.g. telling nutch to take only the first 
occurance and ignore the rest? 

Maybe this is an regression from 
https://issues.apache.org/jira/browse/NUTCH-1560

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to