[ 
https://issues.apache.org/jira/browse/NUTCH-2048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14639462#comment-14639462
 ] 

Michael Joyce commented on NUTCH-2048:
--------------------------------------

Alright, hopefully this one is a bit more on track =D

As for plugin dependencies docs, here a huge +1 from me. I don't know that I'm 
necessarily versed enough in the build to do it myself but it would be a great 
to get up on the wiki.

As for Tika upgrades, there's actually a "how to" in the parse-tika folder. I 
went through that and ended up with the current patch which seems to have 
addressed the duplicate dependency issues. Given the instructions I'm not 
really certain how we ended up with the duplicates in the first place though. 
Maybe the doc is a recent addition

{code}
1. Upgrade Tika depencency in trunk/ivy/ivy.xml

2. Upgrade Tika dependency in src/plugin/parse-tika/ivy.xml

3. Upgrade Tika's own dependencies in src/plugin/parse-tika/plugin.xml
   To get the list of dependencies and their versions execute:
   $ ant -f ./build-ivy.xml
   $ ls lib | sed 's/^/      <library name="/g' | sed 's/$/"\/>/g'
{code}

> parse-tika: fix dependencies in plugin.xml
> ------------------------------------------
>
>                 Key: NUTCH-2048
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2048
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: 1.10
>            Reporter: Sebastian Nagel
>            Priority: Trivial
>             Fix For: 1.11
>
>         Attachments: NUTCH-2048_Joyce_20150723.patch, 
> NUTCH-2048_Joyce_20150723_2.patch
>
>
> Duplicate library dependencies listed in parse-tika's plugin.xml should be 
> cleaned up. There are a duplicates, only the version differs, e.g.:
> {noformat}
> tika-parsers-1.7.jar
> tika-parsers-1.8.jar
> {noformat}
> Not critical because libs which are not present should be just ignored.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to