[ 
https://issues.apache.org/jira/browse/NUTCH-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17276667#comment-17276667
 ] 

Hudson commented on NUTCH-1403:
-------------------------------

SUCCESS: Integrated in Jenkins build Nutch ยป Nutch-trunk #24 (See 
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/24/])
fix for NUTCH-1403 contributed by aalbahem (ameer.albahem: 
[https://github.com/apache/nutch/commit/598bbc40a3d3438233813b607cb031a6bb0a2f84])
* (add) src/plugin/scoring-metadata/pom.xml
* (add) src/plugin/scoring-metadata/plugin.xml
* (add) 
src/plugin/scoring-metadata/src/test/org/apache/nutch/scoring/metadata/MetadataScoringFilterTest.java
* (add) src/plugin/scoring-metadata/build.xml
* (add) 
src/plugin/scoring-metadata/src/java/org/apache/nutch/scoring/metadata/package.html
* (add) 
src/plugin/scoring-metadata/src/java/org/apache/nutch/scoring/metadata/MetadataScoringFilter.java
* (add) src/plugin/scoring-metadata/ivy.xml
* (edit) build.xml
* (edit) src/plugin/build.xml
Improve fix for NUTCH-1403 (ameer.albahem: 
[https://github.com/apache/nutch/commit/cdb6b52b02958385497804ef7cd6a6b646616208])
* (edit) default.properties
* (delete) src/plugin/scoring-metadata/pom.xml
* (edit) conf/nutch-default.xml
* (delete) 
src/plugin/scoring-metadata/src/test/org/apache/nutch/scoring/metadata/MetadataScoringFilterTest.java
* (edit) 
src/plugin/scoring-metadata/src/java/org/apache/nutch/scoring/metadata/package.html
* (add) 
src/plugin/scoring-metadata/src/test/org/apache/nutch/scoring/metadata/TestMetadataScoringFilter.java
* (edit) 
src/plugin/scoring-metadata/src/java/org/apache/nutch/scoring/metadata/MetadataScoringFilter.java
Improve NUTCH-1403, add ASLv2 header (ameer.albahem: 
[https://github.com/apache/nutch/commit/93aa2ab41097511f3afe8d34c9c13cafd735cec9])
* (edit) 
src/plugin/scoring-metadata/src/test/org/apache/nutch/scoring/metadata/TestMetadataScoringFilter.java
* (edit) 
src/plugin/scoring-metadata/src/java/org/apache/nutch/scoring/metadata/package.html


> Add default ScoringFilter for manipulating metadata 
> ----------------------------------------------------
>
>                 Key: NUTCH-1403
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1403
>             Project: Nutch
>          Issue Type: Improvement
>            Reporter: Julien Nioche
>            Priority: Major
>             Fix For: 1.19
>
>
> This is currently done by the urlmeta plugin, which has too vague a name and 
> a redundant indexing filter now that we have the index-metadata plugin. This 
> scoring filter would help defining which metadata to pass from : 
> - the crawl metadata to the content metadata
> - the content metadata to the parse metadata
> - the parse metadata to the crawldatum for the outlinks
> I'd make this scoring filter available by default i.e. not in a separate 
> plugin as its functionalities are commonly used.   



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to