[ https://issues.apache.org/jira/browse/NUTCH-760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12766213#action_12766213 ]
Andrzej Bialecki commented on NUTCH-760: ----------------------------------------- Thanks David, this is a good start. We also need to address the searching part, i.e. SolrSearchBean, where Nutch hardcodes the same field names. > Allow field mapping from nutch to solr index > -------------------------------------------- > > Key: NUTCH-760 > URL: https://issues.apache.org/jira/browse/NUTCH-760 > Project: Nutch > Issue Type: Improvement > Components: indexer > Reporter: David Stuart > Attachments: solrindex_schema.patch, solrindex_schema.patch > > > I am using nutch to crawl sites and have combined it > with solr pushing the nutch index using the solrindex command. I have > set it up as specified on the wiki using the copyField url to id in the > schema. Whilst this works fine it is stuff's up my inputs from other > sources in solr (e.g. using the solr data import handler) as they have > both id's and url's. I have patch that implements a nutch xml schema > defining what basic nutch fields map to in your solr push. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.