[ https://issues.apache.org/jira/browse/NUTCH-1890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14215231#comment-14215231 ]
ASF GitHub Bot commented on NUTCH-1890: --------------------------------------- GitHub user chrismattmann opened a pull request: https://github.com/apache/nutch/pull/5 Fix for NUTCH-1890 You can merge this pull request into a Git repository by running: $ git pull https://github.com/chrismattmann/nutch trunk Alternatively you can review and apply these changes as the patch at: https://github.com/apache/nutch/pull/5.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #5 ---- commit 70047ee4b0b09b2bbf344a3b96f8ab043c98678f Author: Chris Mattmann <mattm...@apache.org> Date: 2014-11-17T21:35:15Z Fix for NUTCH-1890: add copyfields and default text field catch all to schema.xml ---- > Major Typo in Documentation > --------------------------- > > Key: NUTCH-1890 > URL: https://issues.apache.org/jira/browse/NUTCH-1890 > Project: Nutch > Issue Type: Bug > Components: documentation > Affects Versions: 1.9 > Environment: web url: https://wiki.apache.org/nutch/NutchTutorial > Reporter: Boadu Akoto Charles Jnr > Assignee: Chris A. Mattmann > Labels: bug, docuentation, ommission > Original Estimate: 1h > Remaining Estimate: 1h > > Problematic Page: https://wiki.apache.org/nutch/NutchTutorial > 1. Duplicated Text > In section "6. Integrate Solr with Nutch" the following line is asked to be > commented from: > <!-- <filter class="solr. > EnglishPorterFilterFactory" protected="protwords.txt"/> --> > to > <!-- <filter class="solr. > EnglishPorterFilterFactory" protected="protwords.txt"/> --> > but I think it should rather read from: > <filter class="solr. > EnglishPorterFilterFactory" protected="protwords.txt"/> > to > <!-- <filter class="solr. > EnglishPorterFilterFactory" protected="protwords.txt"/> --> > 2. Addition of extra step > After going through the recommended steps in Section 6 to integrate with > solr, I got an error. The error read 'field text not defined'. This error is > so because apparently in my solrconfig.xml, I had defined 'text' as my > default field but it was not defined the schema.xml that I imported from the > nutch conf folder. > I propose that either the schema.xml in the nutch conf folder be shipped with > the 'text' field already defined or an extra step be added to Section 6 that > reads: > Add the following line under the definition of 'content' field: > <field name="text" type="text" stored="true" indexed="true"/> > or better till steps be added to allow the user to change the default field > in solrconfig.xml from 'text' to 'content' whichever solution seems the most > appropriate. -- This message was sent by Atlassian JIRA (v6.3.4#6332)