[ https://issues.apache.org/jira/browse/NUTCH-956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Alexis updated NUTCH-956: ------------------------- Attachment: solr.patch Here are the changes: - Avoid multiple values for id field. (NUTCH-819) - Allow multiple values for tag field. Add tld (Top Level Domain) field. - Get the content-type from WebPage object's member. Otherwise, you will see NullPointerExceptions. - Compare strings with equalsTo. That's pretty random, but it avoids having some suprises. > soldindex issues > ---------------- > > Key: NUTCH-956 > URL: https://issues.apache.org/jira/browse/NUTCH-956 > Project: Nutch > Issue Type: Bug > Components: indexer > Affects Versions: 2.0 > Reporter: Alexis > Attachments: solr.patch > > > I ran into a few caveats with solrindex command trying to index documents. > Please refer to > http://techvineyard.blogspot.com/2010/12/build-nutch-20.html#solrindex that > describes my tests. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.