[ https://issues.apache.org/jira/browse/NUTCH-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14328194#comment-14328194 ]
Tyler Palsulich commented on NUTCH-1925: ---------------------------------------- Thanks [~wastl-nagel]. Looking into it more, org.apache.nutch.parse.tika.TikaConfig was deleted on the 1.x branch in NUTCH-1234 (see [this commit|https://github.com/apache/nutch/commit/7f44cdc998117eacc04609008fdac4ce1e2bb387#diff-a883bfa38ab4c09e2ee777564297367e]) in favor of org.apache.tika.config.TikaConfig. But, the same change was never done on the 2.x branch. I can supply a patch that does it, but it will require some API changes. That should fix the discrepancy we're seeing between 1.x and 2.x in this issue. Thoughts? > Upgrade Tika to version 1.7 > --------------------------- > > Key: NUTCH-1925 > URL: https://issues.apache.org/jira/browse/NUTCH-1925 > Project: Nutch > Issue Type: Improvement > Components: build > Reporter: Tyler Palsulich > Assignee: Markus Jelsma > Priority: Blocker > Fix For: 1.10, 2.3.1 > > Attachments: NUTCH-1925-2x.patch, NUTCH-1925.palsulich.p2.patch, > NUTCH-1925.palsulich.patch, NUTCH-1925.palsulich.v2.patch > > > Hi Folks. Nutch currently uses version 1.6 of Tika. There were no significant > API changes between 1.6 and 1.7. So, this should be a one line update. -- This message was sent by Atlassian JIRA (v6.3.4#6332)