[ https://issues.apache.org/jira/browse/NUTCH-2130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Nagel resolved NUTCH-2130. ------------------------------------ Resolution: Fixed Committed to 2.x, r1714655. Thanks! > copyField rawcontent creates error within schema.xml > ---------------------------------------------------- > > Key: NUTCH-2130 > URL: https://issues.apache.org/jira/browse/NUTCH-2130 > Project: Nutch > Issue Type: Bug > Affects Versions: 2.3.1 > Reporter: Lewis John McGibbney > Assignee: Lewis John McGibbney > Fix For: 2.3.1 > > Attachments: NUTCH-2130.patch > > > The presence of the rawcontent copyField within the Nutch Solr schema.xml is > creating problems for users when attempting to index NutchDocuments into Solr. > The rawcontent field is produced by the > [index-html|https://github.com/apache/nutch/tree/2.x/src/plugin/index-html] > plugin however in committing this feature we have forgotten to add the field > definition to schema.xml before applying the copyField instruction. > There are two ways to resolve this > * remove rawcontent from copyField, or > * add rawcontent as a field prior to it's copyFields defintiion. > I propose to do the latter and will submit a patch ASAP unless someone else > is able to do so. > > This was explained on [this > thread|http://www.mail-archive.com/user%40nutch.apache.org/msg13885.html] -- This message was sent by Atlassian JIRA (v6.3.4#6332)