[ 
https://issues.apache.org/jira/browse/NUTCH-1005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13197876#comment-13197876
 ] 

Julien Nioche commented on NUTCH-1005:
--------------------------------------

yep, you've corrected the typo yourself.

bq. That still works well with the indexfilter when testing via indexchecker.

that's because of the way you generate the field in your indexer i.e. 
parse.getData().getMeta(heading) => which means that it gets it from either the 
parse or content metadata. I was not saying that your code did not work, just 
that it would be conceptually more correct to put it in the parse md, well 
because it is obtained during the parse. My other point was that it would be 
better to use the generic indexer from NUTCH-1264. Could you please give it a 
try?
                
> Index headings plugin
> ---------------------
>
>                 Key: NUTCH-1005
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1005
>             Project: Nutch
>          Issue Type: New Feature
>          Components: indexer, parser
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: HeadingsIndexingFilter.java, HeadingsParseFilter.java, 
> NUTCH-1005-1.4-2.patch, NUTCH-1005-1.4-3.patch, NUTCH-1005-1.5-4.patch
>
>
> Very simple plugin for extracting and indexing a comma separated list of 
> headings via the headings configuration directive.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to