[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2019-01-04 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16734213#comment-16734213
 ] 

Hudson commented on NUTCH-2667:
---

SUCCESS: Integrated in Jenkins build Nutch-nutchgora #1628 (See 
[https://builds.apache.org/job/Nutch-nutchgora/1628/])
NUTCH-2667 Update Tika and Commons Collections 4 - explicitly add (snagel: 
[https://github.com/apache/nutch/commit/90dede99e8bed4dbbad9907fe9330212bf216fd4])
* (edit) src/plugin/parse-tika/ivy.xml
* (edit) src/plugin/parse-tika/plugin.xml
* (edit) ivy/ivy.xml


> Update Tika and Commons Collections 4
> -
>
> Key: NUTCH-2667
> URL: https://issues.apache.org/jira/browse/NUTCH-2667
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 2.4
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 2.4
>
>
> Tika and Commons Collections 4 need to be updated. This issue needs to 
> address them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2019-01-04 Thread Sebastian Nagel (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16734177#comment-16734177
 ] 

Sebastian Nagel commented on NUTCH-2667:


The test failures in the nightly Nutch builds
{noformat}
Testcase: testIt took 1.526 sec
Caused an ERROR
org.apache.commons.compress.archivers.ArchiveStreamFactory.detect(Ljava/io/InputStream;)Ljava/lang/String;
java.lang.NoSuchMethodError: 
org.apache.commons.compress.archivers.ArchiveStreamFactory.detect(Ljava/io/InputStream;)Ljava/lang/String;
at 
org.apache.tika.parser.pkg.ZipContainerDetector.detectArchiveFormat(ZipContainerDetector.java:160)
at 
org.apache.tika.parser.pkg.ZipContainerDetector.detect(ZipContainerDetector.java:104)
at 
org.apache.tika.detect.CompositeDetector.detect(CompositeDetector.java:84)
at org.apache.tika.Tika.detect(Tika.java:156)
at org.apache.tika.Tika.detect(Tika.java:288)
at org.apache.nutch.util.MimeUtil.getMimeType(MimeUtil.java:272)
at 
org.apache.nutch.parse.tika.TestImageMetadata.testIt(TestImageMetadata.java:67)
{noformat}
are caused by a library conflict: Nutch core includes as a transitive 
dependency commons-compress 1.14.1 while Tika requires 1.18. This applies also 
to tika-core (used for MIME type detection), not only to parse-tika which 
provides commons-compress 1.18.

> Update Tika and Commons Collections 4
> -
>
> Key: NUTCH-2667
> URL: https://issues.apache.org/jira/browse/NUTCH-2667
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 2.4
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 2.4
>
>
> Tika and Commons Collections 4 need to be updated. This issue needs to 
> address them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2019-01-04 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16733989#comment-16733989
 ] 

Hudson commented on NUTCH-2667:
---

FAILURE: Integrated in Jenkins build Nutch-nutchgora #1624 (See 
[https://builds.apache.org/job/Nutch-nutchgora/1624/])
NUTCH-2667 Update Tika and Commons Collections 4 - upgrade to Tika (snagel: 
[https://github.com/apache/nutch/commit/70443655ee5c2aaab6ecaf3f5175384af5cdb9e1])
* (edit) src/plugin/parse-tika/plugin.xml
* (edit) ivy/ivysettings.xml
* (edit) 
src/plugin/protocol-http/src/test/org/apache/nutch/protocol/http/TestProtocolHttp.java
* (edit) src/plugin/build-plugin.xml
* (edit) ivy/ivy.xml
* (edit) src/plugin/parse-tika/howto_upgrade_tika.txt
* (edit) src/plugin/parse-tika/build-ivy.xml
* (edit) src/plugin/parse-tika/ivy.xml


> Update Tika and Commons Collections 4
> -
>
> Key: NUTCH-2667
> URL: https://issues.apache.org/jira/browse/NUTCH-2667
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 2.4
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 2.4
>
>
> Tika and Commons Collections 4 need to be updated. This issue needs to 
> address them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2019-01-04 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16733945#comment-16733945
 ] 

ASF GitHub Bot commented on NUTCH-2667:
---

sebastian-nagel commented on pull request #423: NUTCH-2667 Update Tika and 
Commons Collections 4
URL: https://github.com/apache/nutch/pull/423
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Update Tika and Commons Collections 4
> -
>
> Key: NUTCH-2667
> URL: https://issues.apache.org/jira/browse/NUTCH-2667
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 2.4
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 2.4
>
>
> Tika and Commons Collections 4 need to be updated. This issue needs to 
> address them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2018-12-27 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729699#comment-16729699
 ] 

ASF GitHub Bot commented on NUTCH-2667:
---

sebastian-nagel commented on issue #403: NUTCH-2667 Update Tika and Commons 
Collections 4
URL: https://github.com/apache/nutch/pull/403#issuecomment-450182163
 
 
   Obsoleted by [Tika 1.20 which has been released a week 
ago](https://dist.apache.org/repos/dist/release/tika/CHANGES-1.20.txt). Thanks 
@lewismc! I'm closing this PR in favor of #423 which upgrades to Tika 1.20 and 
also includes the upgrade to commons-collections4 version 4.2.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Update Tika and Commons Collections 4
> -
>
> Key: NUTCH-2667
> URL: https://issues.apache.org/jira/browse/NUTCH-2667
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 2.4
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 2.4
>
>
> Tika and Commons Collections 4 need to be updated. This issue needs to 
> address them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2018-12-27 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729700#comment-16729700
 ] 

ASF GitHub Bot commented on NUTCH-2667:
---

sebastian-nagel commented on pull request #403: NUTCH-2667 Update Tika and 
Commons Collections 4
URL: https://github.com/apache/nutch/pull/403
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Update Tika and Commons Collections 4
> -
>
> Key: NUTCH-2667
> URL: https://issues.apache.org/jira/browse/NUTCH-2667
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 2.4
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 2.4
>
>
> Tika and Commons Collections 4 need to be updated. This issue needs to 
> address them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2018-12-27 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729694#comment-16729694
 ] 

ASF GitHub Bot commented on NUTCH-2667:
---

sebastian-nagel commented on pull request #423: NUTCH-2667 Update Tika and 
Commons Collections 4
URL: https://github.com/apache/nutch/pull/423
 
 
   - upgrade to Tika dependencies to version 1.20 (add exclusions of transitive 
dependencies alreadyprovided as core dependencies)
   - upgrade Commons Collections 4 to version 4.2
   - upgrade Commons Codec to version 3.11
   - adapt unit tests to dependency upgrades
   - add "report" target to plugin builds for debugging
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Update Tika and Commons Collections 4
> -
>
> Key: NUTCH-2667
> URL: https://issues.apache.org/jira/browse/NUTCH-2667
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 2.4
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 2.4
>
>
> Tika and Commons Collections 4 need to be updated. This issue needs to 
> address them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2018-10-24 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16662565#comment-16662565
 ] 

ASF GitHub Bot commented on NUTCH-2667:
---

axrdb edited a comment on issue #403: NUTCH-2667 Update Tika and Commons 
Collections 4
URL: https://github.com/apache/nutch/pull/403#issuecomment-432746464
 
 
   here is the final part of output after running command inside 
src/plugin/parse-tika - "ant -Dpackaging.type=jar -v -f ./build-ivy.xml"
   
   [ivy:retrieve] :: downloading artifacts ::
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.tika#tika-parsers;1.19.1!tika-parsers.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.glassfish.jaxb#jaxb-core;2.3.0.1!jaxb-core.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.glassfish.jaxb#jaxb-runtime;2.3.0.1!jaxb-runtime.jar
   [ivy:retrieve]   [NOT REQUIRED] 
javax.activation#activation;1.1.1!activation.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.gagravarr#vorbis-java-tika;0.8!vorbis-java-tika.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.healthmarketscience.jackcess#jackcess;2.1.12!jackcess.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.healthmarketscience.jackcess#jackcess-encrypt;2.1.4!jackcess-encrypt.jar
   [ivy:retrieve]   [NOT REQUIRED] org.tallison#jmatio;1.5!jmatio.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.james#apache-mime4j-core;0.8.2!apache-mime4j-core.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.james#apache-mime4j-dom;0.8.2!apache-mime4j-dom.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.commons#commons-compress;1.18!commons-compress.jar
   [ivy:retrieve]   [NOT REQUIRED] org.tukaani#xz;1.8!xz.jar
   [ivy:retrieve]   [NOT REQUIRED] com.epam#parso;2.0.9!parso.jar
   [ivy:retrieve]   [NOT REQUIRED] org.brotli#dec;0.1.2!dec.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.pdfbox#pdfbox;2.0.12!pdfbox.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.pdfbox#pdfbox-tools;2.0.12!pdfbox-tools.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.pdfbox#jempbox;1.8.16!jempbox.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.bouncycastle#bcmail-jdk15on;1.60!bcmail-jdk15on.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.bouncycastle#bcprov-jdk15on;1.60!bcprov-jdk15on.jar
   [ivy:retrieve]   [NOT REQUIRED] org.apache.poi#poi;4.0.0!poi.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.poi#poi-scratchpad;4.0.0!poi-scratchpad.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.poi#poi-ooxml;4.0.0!poi-ooxml.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.ccil.cowan.tagsoup#tagsoup;1.2.1!tagsoup.jar
   [ivy:retrieve]   [NOT REQUIRED] org.ow2.asm#asm;6.2!asm.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.googlecode.mp4parser#isoparser;1.1.22!isoparser.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.drewnoakes#metadata-extractor;2.11.0!metadata-extractor.jar
   [ivy:retrieve]   [NOT REQUIRED] 
de.l3s.boilerpipe#boilerpipe;1.1.0!boilerpipe.jar
   [ivy:retrieve]   [NOT REQUIRED] com.rometools#rome;1.5.1!rome.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.gagravarr#vorbis-java-core;0.8!vorbis-java-core.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.googlecode.juniversalchardet#juniversalchardet;1.0.3!juniversalchardet.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.codelibs#jhighlight;1.0.3!jhighlight.jar
   [ivy:retrieve]   [NOT REQUIRED] com.pff#java-libpst;0.8.1!java-libpst.jar
   [ivy:retrieve]   [NOT REQUIRED] com.github.junrar#junrar;2.0.0!junrar.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.cxf#cxf-rt-rs-client;3.2.6!cxf-rt-rs-client.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.commons#commons-exec;1.3!commons-exec.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.opennlp#opennlp-tools;1.9.0!opennlp-tools.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] commons-io#commons-io;2.6!commons-io.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.googlecode.json-simple#json-simple;1.1.1!json-simple.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
com.github.openjson#openjson;1.0.10!openjson.jar
   [ivy:retrieve]   [NOT REQUIRED] com.google.code.gson#gson;2.8.5!gson.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.slf4j#jul-to-slf4j;1.7.25!jul-to-slf4j.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.slf4j#jcl-over-slf4j;1.7.25!jcl-over-slf4j.jar
   [ivy:retrieve]   [NOT REQUIRED] edu.ucar#netcdf4;4.5.5!netcdf4.jar
   [ivy:retrieve]   [NOT REQUIRED] edu.ucar#grib;4.5.5!grib.jar
   [ivy:retrieve]   [NOT REQUIRED] net.java.dev.jna#jna;4.3.0!jna.jar
   [ivy:retrieve]   [NOT REQUIRED] org.jsoup#jsoup;1.11.3!jsoup.jar
   [ivy:retrieve]   [NOT REQUIRED] edu.ucar#cdm;4.5.5!cdm.jar
   [ivy:retrieve]   [NOT REQUIRED] 
edu.ucar#httpservices;4.5.5!httpservices.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.httpcomponents#httpmime;4.5.6!httpmime.jar
   [ivy:retrieve]   [NOT 

[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2018-10-24 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16662562#comment-16662562
 ] 

ASF GitHub Bot commented on NUTCH-2667:
---

axrdb commented on issue #403: NUTCH-2667 Update Tika and Commons Collections 4
URL: https://github.com/apache/nutch/pull/403#issuecomment-432746464
 
 
   here is the final part of output after running command - "ant 
-Dpackaging.type=jar -v -f ./build-ivy.xml"
   
   [ivy:retrieve] :: downloading artifacts ::
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.tika#tika-parsers;1.19.1!tika-parsers.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.glassfish.jaxb#jaxb-core;2.3.0.1!jaxb-core.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.glassfish.jaxb#jaxb-runtime;2.3.0.1!jaxb-runtime.jar
   [ivy:retrieve]   [NOT REQUIRED] 
javax.activation#activation;1.1.1!activation.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.gagravarr#vorbis-java-tika;0.8!vorbis-java-tika.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.healthmarketscience.jackcess#jackcess;2.1.12!jackcess.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.healthmarketscience.jackcess#jackcess-encrypt;2.1.4!jackcess-encrypt.jar
   [ivy:retrieve]   [NOT REQUIRED] org.tallison#jmatio;1.5!jmatio.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.james#apache-mime4j-core;0.8.2!apache-mime4j-core.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.james#apache-mime4j-dom;0.8.2!apache-mime4j-dom.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.commons#commons-compress;1.18!commons-compress.jar
   [ivy:retrieve]   [NOT REQUIRED] org.tukaani#xz;1.8!xz.jar
   [ivy:retrieve]   [NOT REQUIRED] com.epam#parso;2.0.9!parso.jar
   [ivy:retrieve]   [NOT REQUIRED] org.brotli#dec;0.1.2!dec.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.pdfbox#pdfbox;2.0.12!pdfbox.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.pdfbox#pdfbox-tools;2.0.12!pdfbox-tools.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.pdfbox#jempbox;1.8.16!jempbox.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.bouncycastle#bcmail-jdk15on;1.60!bcmail-jdk15on.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.bouncycastle#bcprov-jdk15on;1.60!bcprov-jdk15on.jar
   [ivy:retrieve]   [NOT REQUIRED] org.apache.poi#poi;4.0.0!poi.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.poi#poi-scratchpad;4.0.0!poi-scratchpad.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.poi#poi-ooxml;4.0.0!poi-ooxml.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.ccil.cowan.tagsoup#tagsoup;1.2.1!tagsoup.jar
   [ivy:retrieve]   [NOT REQUIRED] org.ow2.asm#asm;6.2!asm.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.googlecode.mp4parser#isoparser;1.1.22!isoparser.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.drewnoakes#metadata-extractor;2.11.0!metadata-extractor.jar
   [ivy:retrieve]   [NOT REQUIRED] 
de.l3s.boilerpipe#boilerpipe;1.1.0!boilerpipe.jar
   [ivy:retrieve]   [NOT REQUIRED] com.rometools#rome;1.5.1!rome.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.gagravarr#vorbis-java-core;0.8!vorbis-java-core.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.googlecode.juniversalchardet#juniversalchardet;1.0.3!juniversalchardet.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.codelibs#jhighlight;1.0.3!jhighlight.jar
   [ivy:retrieve]   [NOT REQUIRED] com.pff#java-libpst;0.8.1!java-libpst.jar
   [ivy:retrieve]   [NOT REQUIRED] com.github.junrar#junrar;2.0.0!junrar.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.cxf#cxf-rt-rs-client;3.2.6!cxf-rt-rs-client.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.commons#commons-exec;1.3!commons-exec.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.opennlp#opennlp-tools;1.9.0!opennlp-tools.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] commons-io#commons-io;2.6!commons-io.jar
   [ivy:retrieve]   [NOT REQUIRED] 
com.googlecode.json-simple#json-simple;1.1.1!json-simple.jar(bundle)
   [ivy:retrieve]   [NOT REQUIRED] 
com.github.openjson#openjson;1.0.10!openjson.jar
   [ivy:retrieve]   [NOT REQUIRED] com.google.code.gson#gson;2.8.5!gson.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.slf4j#jul-to-slf4j;1.7.25!jul-to-slf4j.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.slf4j#jcl-over-slf4j;1.7.25!jcl-over-slf4j.jar
   [ivy:retrieve]   [NOT REQUIRED] edu.ucar#netcdf4;4.5.5!netcdf4.jar
   [ivy:retrieve]   [NOT REQUIRED] edu.ucar#grib;4.5.5!grib.jar
   [ivy:retrieve]   [NOT REQUIRED] net.java.dev.jna#jna;4.3.0!jna.jar
   [ivy:retrieve]   [NOT REQUIRED] org.jsoup#jsoup;1.11.3!jsoup.jar
   [ivy:retrieve]   [NOT REQUIRED] edu.ucar#cdm;4.5.5!cdm.jar
   [ivy:retrieve]   [NOT REQUIRED] 
edu.ucar#httpservices;4.5.5!httpservices.jar
   [ivy:retrieve]   [NOT REQUIRED] 
org.apache.httpcomponents#httpmime;4.5.6!httpmime.jar
   [ivy:retrieve]   [NOT REQUIRED] 

[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2018-10-24 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16662530#comment-16662530
 ] 

ASF GitHub Bot commented on NUTCH-2667:
---

axrdb commented on a change in pull request #403: NUTCH-2667 Update Tika and 
Commons Collections 4
URL: https://github.com/apache/nutch/pull/403#discussion_r227874807
 
 

 ##
 File path: src/plugin/parse-tika/plugin.xml
 ##
 @@ -25,87 +25,108 @@
   
  
   
-
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
+  
+  
+  
+  
+  
+  
+  
+  
+  
+  
+  
+  
+  
+  
 
 Review comment:
   Even after including both (1.6 and 1.10) for commons-codec. The dependency 
is still an issue and its unable to find module: javax.measure#unit-api


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Update Tika and Commons Collections 4
> -
>
> Key: NUTCH-2667
> URL: https://issues.apache.org/jira/browse/NUTCH-2667
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 2.4
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 2.4
>
>
> Tika and Commons Collections 4 need to be updated. This issue needs to 
> address them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2018-10-24 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16662237#comment-16662237
 ] 

ASF GitHub Bot commented on NUTCH-2667:
---

sebastian-nagel commented on a change in pull request #403: NUTCH-2667 Update 
Tika and Commons Collections 4
URL: https://github.com/apache/nutch/pull/403#discussion_r227679697
 
 

 ##
 File path: .gitignore
 ##
 @@ -8,4 +8,10 @@ logs/
 *.iml
 .idea
 .log
-
+.classpath
+.naivebayes-model.crc
+.project
+ivy/ivy-2.3.0.jar
+ivy/ivy-2.4.0.jar
+naivebayes-model
+src/plugin/parse-tika/lib/
 
 Review comment:
   The directory `src/plugin/parse-tika/lib/` should be removed instead. It may 
cause that there are duplicate library dependencies (conflicting versions), see 
comments below.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Update Tika and Commons Collections 4
> -
>
> Key: NUTCH-2667
> URL: https://issues.apache.org/jira/browse/NUTCH-2667
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 2.4
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 2.4
>
>
> Tika and Commons Collections 4 need to be updated. This issue needs to 
> address them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2018-10-24 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16662235#comment-16662235
 ] 

ASF GitHub Bot commented on NUTCH-2667:
---

sebastian-nagel commented on a change in pull request #403: NUTCH-2667 Update 
Tika and Commons Collections 4
URL: https://github.com/apache/nutch/pull/403#discussion_r227682607
 
 

 ##
 File path: src/plugin/parse-tika/plugin.xml
 ##
 @@ -25,87 +25,108 @@
   
  
   
-
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
-   
+  
+  
+  
+  
+  
+  
+  
+  
+  
+  
+  
+  
+  
+  
 
 Review comment:
   @lewismc, was this caused by an existing and non-empty folder 
src/plugin/parse-tika/lib/? But we should add `rm lib/*` as a necessary step in 
howto_upgrade_tika.txt anyway.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Update Tika and Commons Collections 4
> -
>
> Key: NUTCH-2667
> URL: https://issues.apache.org/jira/browse/NUTCH-2667
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 2.4
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 2.4
>
>
> Tika and Commons Collections 4 need to be updated. This issue needs to 
> address them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (NUTCH-2667) Update Tika and Commons Collections 4

2018-10-23 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/NUTCH-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16661740#comment-16661740
 ] 

ASF GitHub Bot commented on NUTCH-2667:
---

lewismc opened a new pull request #403: NUTCH-2667 Update Tika and Commons 
Collections 4
URL: https://github.com/apache/nutch/pull/403
 
 
   This issue addresses https://issues.apache.org/jira/browse/NUTCH-2667


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Update Tika and Commons Collections 4
> -
>
> Key: NUTCH-2667
> URL: https://issues.apache.org/jira/browse/NUTCH-2667
> Project: Nutch
>  Issue Type: Improvement
>  Components: build
>Affects Versions: 2.4
>Reporter: Lewis John McGibbney
>Assignee: Lewis John McGibbney
>Priority: Blocker
> Fix For: 2.4
>
>
> Tika and Commons Collections 4 need to be updated due to known CVE's.
> This issue needs to address them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)