[jira] [Commented] (TIKA-3795) General upgrades for 2.4.2

2022-07-14 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567090#comment-17567090
 ] 

Hudson commented on TIKA-3795:
--

FAILURE: Integrated in Jenkins build Tika » tika-main-jdk8 #688 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/688/])
TIKA-3795: update aws (tilman: 
[https://github.com/apache/tika/commit/724c5a3e9aa6dbfaf747a965c7de0632a435eb6b])
* (edit) tika-parent/pom.xml
TIKA-3795: update spring (tilman: 
[https://github.com/apache/tika/commit/f0d5318e65c36af2db39d1243fb9a0174874cc85])
* (edit) tika-parent/pom.xml


> General upgrades for 2.4.2
> --
>
> Key: TIKA-3795
> URL: https://issues.apache.org/jira/browse/TIKA-3795
> Project: Tika
>  Issue Type: Improvement
>  Components: build
>Reporter: Tilman Hausherr
>Priority: Minor
> Fix For: 2.4.2
>
> Attachments: image-2022-07-12-06-46-00-215.png
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[GitHub] [tika] dependabot[bot] opened a new pull request, #611: Bump exec-maven-plugin from 3.0.0 to 3.1.0

2022-07-14 Thread GitBox


dependabot[bot] opened a new pull request, #611:
URL: https://github.com/apache/tika/pull/611

   Bumps [exec-maven-plugin](https://github.com/mojohaus/exec-maven-plugin) 
from 3.0.0 to 3.1.0.
   
   Release notes
   Sourced from https://github.com/mojohaus/exec-maven-plugin/releases;>exec-maven-plugin's
 releases.
   
   3.1.0
   
    New features and improvements
   
   Require Maven 3.2.5 (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/319;>#319)
 https://github.com/slawekjaranowski;>@​slawekjaranowski
   Support stream inheritance for the forked process, fixes https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/71;>#71
 (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/196;>#196)
 https://github.com/gnodet;>@​gnodet
   
    Bug Fixes
   
   Fix NullPointerException when using plugin dependencies in version 1.6.0 
(https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/77;>#77)
 https://github.com/jonasrutishauser;>@​jonasrutishauser
   preload common pool - issue https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/198;>#198
 (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/279;>#279)
 https://github.com/rmannibucau;>@​rmannibucau
   fix handling of LongModulePathArgument and LongClassPathArgument (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/172;>#172)
 https://github.com/jan-feindt;>@​jan-feindt
   Do not drop environment variables that contain '=' in their value, or 
have no value.  (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/97;>#97)
 https://github.com/AlanUnderwood;>@​AlanUnderwood
   Empty argument tag should add empty string instead of null (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/150;>#150)
 https://github.com/Anders-E;>@​Anders-E
   Fixes https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/160;>#160,
 ensure the java classloader is a child first one and supports to excludes some 
gathered classpath element to solve manually conflicts (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/161;>#161)
 https://github.com/rmannibucau;>@​rmannibucau
   
    Dependency updates
   
   Bump commons-io from 1.1 to 2.7 in /src/it/projects/project6/project5lib 
(https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/320;>#320)
 https://github.com/dependabot;>@​dependabot
   Bump plexus-utils from 1.1 to 3.0.16 in 
/src/it/projects/java_module-mainclass (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/313;>#313)
 https://github.com/dependabot;>@​dependabot
   Bump plexus-utils from 1.1 to 3.0.16 in /src/it/projects/longModulepath 
with spaces (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/312;>#312)
 https://github.com/dependabot;>@​dependabot
   Bump plexus-utils from 1.1 to 3.0.16 in 
/src/it/projects/java_module-module+mainclass (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/311;>#311)
 https://github.com/dependabot;>@​dependabot
   Bump plexus-utils from 1.1 to 3.0.16 in /src/it/projects/jigsaw (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/310;>#310)
 https://github.com/dependabot;>@​dependabot
   Bump plexus-utils from 1.1 to 3.0.16 in /src/it/mrm/java_module (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/309;>#309)
 https://github.com/dependabot;>@​dependabot
   Bump mojo-parent from 68 to 69 (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/314;>#314)
 https://github.com/dependabot;>@​dependabot
   Bump plexus-utils from 1.1 to 3.0.16 in 
/src/it/projects/java_module-export (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/308;>#308)
 https://github.com/dependabot;>@​dependabot
   Bump mrm-maven-plugin from 1.3.0 to 1.4.1 (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/307;>#307)
 https://github.com/dependabot;>@​dependabot
   Bump mojo-parent from 67 to 68 (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/305;>#305)
 https://github.com/dependabot;>@​dependabot
   Bump groovy from 3.0.10 to 3.0.11 (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/302;>#302)
 https://github.com/dependabot;>@​dependabot
   Bump maven-invoker-plugin from 3.2.2 to 3.3.0 (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/301;>#301)
 https://github.com/dependabot;>@​dependabot
   Bump mockito-core from 4.5.1 to 4.6.1 (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/303;>#303)
 https://github.com/dependabot;>@​dependabot
   Bump plexus-utils from 3.4.1 to 3.4.2 (https://github-redirect.dependabot.com/mojohaus/exec-maven-plugin/issues/299;>#299)
 https://github.com/dependabot;>@​dependabot
   Bump release-drafter/release-drafter from 5.19.0 

[GitHub] [tika] dependabot[bot] opened a new pull request, #610: Bump google-cloud-storage from 2.9.3 to 2.10.0

2022-07-14 Thread GitBox


dependabot[bot] opened a new pull request, #610:
URL: https://github.com/apache/tika/pull/610

   Bumps [google-cloud-storage](https://github.com/googleapis/java-storage) 
from 2.9.3 to 2.10.0.
   
   Release notes
   Sourced from https://github.com/googleapis/java-storage/releases;>google-cloud-storage's
 releases.
   
   v2.10.0
   https://github.com/googleapis/java-storage/compare/v2.9.3...v2.10.0;>2.10.0
 (2022-07-13)
   Features
   
   Custom Placement Config Dual Region Support (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1470;>#1470)
 (https://github.com/googleapis/java-storage/commit/36440fc587976acc78f4dae480214b80ad3fc477;>36440fc)
   
   Bug Fixes
   
   enable longpaths support for windows test (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1485;>#1485)
 (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1506;>#1506)
 (https://github.com/googleapis/java-storage/commit/e36e80934644ccd5e873754c625367f24e5fc7f3;>e36e809)
   
   Documentation
   
   sample: upgrade maven-surefire and remove 
junit-platform-native (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1476;>#1476)
 (https://github.com/googleapis/java-storage/commit/9b222e483937bd0639123899442c1e863683718a;>9b222e4)
   
   Dependencies
   
   update dependency org.graalvm.buildtools:native-maven-plugin to v0.9.13 
(https://github-redirect.dependabot.com/googleapis/java-storage/issues/1501;>#1501)
 (https://github.com/googleapis/java-storage/commit/36e389857ea3f86350af5763c829e5f1c3ec;>36e3898)
   
   
   
   
   Changelog
   Sourced from https://github.com/googleapis/java-storage/blob/main/CHANGELOG.md;>google-cloud-storage's
 changelog.
   
   https://github.com/googleapis/java-storage/compare/v2.9.3...v2.10.0;>2.10.0
 (2022-07-13)
   Features
   
   Custom Placement Config Dual Region Support (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1470;>#1470)
 (https://github.com/googleapis/java-storage/commit/36440fc587976acc78f4dae480214b80ad3fc477;>36440fc)
   
   Bug Fixes
   
   enable longpaths support for windows test (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1485;>#1485)
 (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1506;>#1506)
 (https://github.com/googleapis/java-storage/commit/e36e80934644ccd5e873754c625367f24e5fc7f3;>e36e809)
   
   Documentation
   
   sample: upgrade maven-surefire and remove 
junit-platform-native (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1476;>#1476)
 (https://github.com/googleapis/java-storage/commit/9b222e483937bd0639123899442c1e863683718a;>9b222e4)
   
   Dependencies
   
   update dependency org.graalvm.buildtools:native-maven-plugin to v0.9.13 
(https://github-redirect.dependabot.com/googleapis/java-storage/issues/1501;>#1501)
 (https://github.com/googleapis/java-storage/commit/36e389857ea3f86350af5763c829e5f1c3ec;>36e3898)
   
   
   
   
   Commits
   
   https://github.com/googleapis/java-storage/commit/796cc6ca88ef07f91961754d8fbecf9688dc9eb1;>796cc6c
 chore(main): release 2.10.0 (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1504;>#1504)
   https://github.com/googleapis/java-storage/commit/e36e80934644ccd5e873754c625367f24e5fc7f3;>e36e809
 fix: enable longpaths support for windows test (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1485;>#1485)
 (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1506;>#1506)
   https://github.com/googleapis/java-storage/commit/4712c7c229301bdb31e800f110220244b940e0d5;>4712c7c
 test(deps): update dependency 
com.google.cloud:google-cloud-conformance-tests...
   https://github.com/googleapis/java-storage/commit/cfb109f9d436f413ada9eba37a3a42a2d29b3936;>cfb109f
 chore(deps): update dependency com.google.cloud:libraries-bom to v26 (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1495;>#1495)
   https://github.com/googleapis/java-storage/commit/36440fc587976acc78f4dae480214b80ad3fc477;>36440fc
 feat: Custom Placement Config Dual Region Support (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1470;>#1470)
   https://github.com/googleapis/java-storage/commit/36e389857ea3f86350af5763c829e5f1c3ec;>36e3898
 deps: update dependency org.graalvm.buildtools:native-maven-plugin to 
v0.9.13...
   https://github.com/googleapis/java-storage/commit/2a5e23eafd9b87cc43276ddd75b60826f6d7c525;>2a5e23e
 chore: add native image checks as required (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1475;>#1475)
   https://github.com/googleapis/java-storage/commit/9b222e483937bd0639123899442c1e863683718a;>9b222e4
 docs(sample): upgrade maven-surefire and remove junit-platform-native (https://github-redirect.dependabot.com/googleapis/java-storage/issues/1476;>#1476)
   

[jira] [Closed] (TIKA-2492) Remove pdfdebugger from tika

2022-07-14 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-2492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr closed TIKA-2492.
-
Resolution: Fixed

> Remove pdfdebugger from tika
> 
>
> Key: TIKA-2492
> URL: https://issues.apache.org/jira/browse/TIKA-2492
> Project: Tika
>  Issue Type: Improvement
>  Components: packaging
>Reporter: Tilman Hausherr
>Priority: Minor
> Fix For: 2.0.0-BETA, 1.17
>
>
> PDFDebugger isn't needed in tika but it is a dependency in pdfbox-tools 
> (because that one contains the command line interface, which calls the PDFBox 
> command line tools).
> Thus I suggest that the tika parser pom be changed like this:
> {code}
> 
>   org.apache.pdfbox
>   pdfbox-tools
>   ${pdfbox.version}
>   
> 
>   commons-logging
>   commons-logging
> 
> +
> +  org.apache.pdfbox
> +  pdfbox-debugger
> +
>   
> {code}
> This saves you 200KB in tika-app. That's not much, but every weight loss 
> counts :-)
> It should also be possible to get it removed from tika-bundle, but I don't 
> know how to remove it properly. Just removing it from "Embed-Dependency" 
> isn't enough.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (TIKA-3818) Remove pdfdebugger from tika (2)

2022-07-14 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr resolved TIKA-3818.
---
Resolution: Fixed

> Remove pdfdebugger from tika (2)
> 
>
> Key: TIKA-3818
> URL: https://issues.apache.org/jira/browse/TIKA-3818
> Project: Tika
>  Issue Type: Task
>  Components: build
>Affects Versions: 2.4.1
>Reporter: Tilman Hausherr
>Assignee: Tilman Hausherr
>Priority: Trivial
> Fix For: 2.4.2
>
>
> We already did this in 2017 in TIKA-2492, but it reappeared (see the graph I 
> posted in TIKA-3817), or wasn't removed fully.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3818) Remove pdfdebugger from tika (2)

2022-07-14 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3818?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567080#comment-17567080
 ] 

Hudson commented on TIKA-3818:
--

SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #687 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/687/])
TIKA-3818: remove pdfbox-debugger (tilman: 
[https://github.com/apache/tika/commit/0bc3e18ab3c4849fdb31dd05aa0c494c31d17d7b])
* (edit) tika-bundles/tika-bundle-standard/pom.xml
* (edit) 
tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-pdf-module/pom.xml


> Remove pdfdebugger from tika (2)
> 
>
> Key: TIKA-3818
> URL: https://issues.apache.org/jira/browse/TIKA-3818
> Project: Tika
>  Issue Type: Task
>  Components: build
>Affects Versions: 2.4.1
>Reporter: Tilman Hausherr
>Assignee: Tilman Hausherr
>Priority: Trivial
> Fix For: 2.4.2
>
>
> We already did this in 2017 in TIKA-2492, but it reappeared (see the graph I 
> posted in TIKA-3817), or wasn't removed fully.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3817) Azure Graph conflict with Tika-app on (JsonGenerator) jackson version 2.13.3 - bug changes version to 2.4.1 (Tika's version)

2022-07-14 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567076#comment-17567076
 ] 

Tilman Hausherr commented on TIKA-3817:
---

I notice you're using tika-app, this is meant as a stand-alone application. Why 
not use the dependencies you really need?

> Azure Graph conflict with Tika-app on (JsonGenerator) jackson version 2.13.3 
> - bug changes version to 2.4.1 (Tika's version)
> 
>
> Key: TIKA-3817
> URL: https://issues.apache.org/jira/browse/TIKA-3817
> Project: Tika
>  Issue Type: Bug
>  Components: app
>Affects Versions: 2.4.1
> Environment: Java 1.8, Maven Project, tika-app 2.4.1, jackson 2.13.3, 
> azure-core 1.30.0
>Reporter: Andre Nel
>Priority: Major
>  Labels: Azure, jackson, tika-app, version
> Attachments: screenshot-1.png
>
>
> Azure Graph conflict on jackson.
> Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
> name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching 
> the jackson version, of which jackson-core 2.13.3 is desired and set in the 
> pom. This results in version conflict - where tika changes the jackson 
> version from the desired 2.13.3 to tika's version 2.4.1, which doesn't even 
> exist for jackson and would be far too old a version of jackson.
>  
> ERROR [main] 12:03:19,483 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-annotations' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-core' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-databind' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> INFO  [main] 12:03:19,491 
> com.azure.core.implementation.jackson.JacksonVersion Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> ERROR [main] 12:03:19,494 
> com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> Exception in thread "main" java.lang.LinkageError: Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
>     at 
> com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
>     at 
> com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
>     at 
> com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
>     at 
> com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
>     at 
> com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
>     at 
> com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
>     at 
> main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph
>  
> My offending code calling azure-core graph:
>             final ClientSecretCredential clientSecretCredential = new 
> ClientSecretCredentialBuilder()
>             .clientId(clientId)
>             .clientSecret(clientSecret)
>             .tenantId(tenantId)
>             .build();
>             final TokenCredentialAuthProvider tokenCredentialAuthProvider = 
> new TokenCredentialAuthProvider(scopes, clientSecretCredential);
>             final GraphServiceClient graphClient =
>             GraphServiceClient
>             .builder()
>             .authenticationProvider(tokenCredentialAuthProvider)
>             .buildClient();
>             final User me = graphClient.me().buildRequest().get();
>  
> which calls 

[jira] [Updated] (TIKA-3818) Remove pdfdebugger from tika (2)

2022-07-14 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated TIKA-3818:
--
Description: We already did this in 2017 in TIKA-2492, but it reappeared 
(see the graph I posted in TIKA-3817), or wasn't removed fully.  (was: We 
already did this in 2017 in TIKA-2492, but it reappeared, or wasn't removed 
fully.)

> Remove pdfdebugger from tika (2)
> 
>
> Key: TIKA-3818
> URL: https://issues.apache.org/jira/browse/TIKA-3818
> Project: Tika
>  Issue Type: Task
>  Components: build
>Affects Versions: 2.4.1
>Reporter: Tilman Hausherr
>Assignee: Tilman Hausherr
>Priority: Trivial
> Fix For: 2.4.2
>
>
> We already did this in 2017 in TIKA-2492, but it reappeared (see the graph I 
> posted in TIKA-3817), or wasn't removed fully.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (TIKA-3818) Remove pdfdebugger from tika (2)

2022-07-14 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated TIKA-3818:
--
Affects Version/s: 2.4.1
   (was: 1.28.4)

> Remove pdfdebugger from tika (2)
> 
>
> Key: TIKA-3818
> URL: https://issues.apache.org/jira/browse/TIKA-3818
> Project: Tika
>  Issue Type: Task
>  Components: build
>Affects Versions: 2.4.1
>Reporter: Tilman Hausherr
>Assignee: Tilman Hausherr
>Priority: Trivial
> Fix For: 1.28.5
>
>
> We already did this in 2017 in TIKA-2492, but it reappeared, or wasn't 
> removed fully.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (TIKA-3818) Remove pdfdebugger from tika (2)

2022-07-14 Thread Tilman Hausherr (Jira)
Tilman Hausherr created TIKA-3818:
-

 Summary: Remove pdfdebugger from tika (2)
 Key: TIKA-3818
 URL: https://issues.apache.org/jira/browse/TIKA-3818
 Project: Tika
  Issue Type: Task
  Components: build
Affects Versions: 1.28.4
Reporter: Tilman Hausherr
Assignee: Tilman Hausherr
 Fix For: 1.28.5


We already did this in 2017 in TIKA-2492, but it reappeared, or wasn't removed 
fully.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (TIKA-3818) Remove pdfdebugger from tika (2)

2022-07-14 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated TIKA-3818:
--
Fix Version/s: 2.4.2
   (was: 1.28.5)

> Remove pdfdebugger from tika (2)
> 
>
> Key: TIKA-3818
> URL: https://issues.apache.org/jira/browse/TIKA-3818
> Project: Tika
>  Issue Type: Task
>  Components: build
>Affects Versions: 2.4.1
>Reporter: Tilman Hausherr
>Assignee: Tilman Hausherr
>Priority: Trivial
> Fix For: 2.4.2
>
>
> We already did this in 2017 in TIKA-2492, but it reappeared, or wasn't 
> removed fully.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (TIKA-3817) Azure Graph conflict with Tika-app on (JsonGenerator) jackson version 2.13.3 - bug changes version to 2.4.1 (Tika's version)

2022-07-14 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated TIKA-3817:
--
Attachment: screenshot-1.png

> Azure Graph conflict with Tika-app on (JsonGenerator) jackson version 2.13.3 
> - bug changes version to 2.4.1 (Tika's version)
> 
>
> Key: TIKA-3817
> URL: https://issues.apache.org/jira/browse/TIKA-3817
> Project: Tika
>  Issue Type: Bug
>  Components: app
>Affects Versions: 2.4.1
> Environment: Java 1.8, Maven Project, tika-app 2.4.1, jackson 2.13.3, 
> azure-core 1.30.0
>Reporter: Andre Nel
>Priority: Major
>  Labels: Azure, jackson, tika-app, version
> Attachments: screenshot-1.png
>
>
> Azure Graph conflict on jackson.
> Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
> name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching 
> the jackson version, of which jackson-core 2.13.3 is desired and set in the 
> pom. This results in version conflict - where tika changes the jackson 
> version from the desired 2.13.3 to tika's version 2.4.1, which doesn't even 
> exist for jackson and would be far too old a version of jackson.
>  
> ERROR [main] 12:03:19,483 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-annotations' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-core' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-databind' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> INFO  [main] 12:03:19,491 
> com.azure.core.implementation.jackson.JacksonVersion Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> ERROR [main] 12:03:19,494 
> com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> Exception in thread "main" java.lang.LinkageError: Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
>     at 
> com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
>     at 
> com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
>     at 
> com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
>     at 
> com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
>     at 
> com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
>     at 
> com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
>     at 
> main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph
>  
> My offending code calling azure-core graph:
>             final ClientSecretCredential clientSecretCredential = new 
> ClientSecretCredentialBuilder()
>             .clientId(clientId)
>             .clientSecret(clientSecret)
>             .tenantId(tenantId)
>             .build();
>             final TokenCredentialAuthProvider tokenCredentialAuthProvider = 
> new TokenCredentialAuthProvider(scopes, clientSecretCredential);
>             final GraphServiceClient graphClient =
>             GraphServiceClient
>             .builder()
>             .authenticationProvider(tokenCredentialAuthProvider)
>             .buildClient();
>             final User me = graphClient.me().buildRequest().get();
>  
> which calls azure-core-1.30.0.jar -> com.azure.core.implementation.jackson -> 
> JacksonVersion.class line 43 onwards:
>     private 

[jira] [Commented] (TIKA-3817) Azure Graph conflict with Tika-app on (JsonGenerator) jackson version 2.13.3 - bug changes version to 2.4.1 (Tika's version)

2022-07-14 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567061#comment-17567061
 ] 

Tilman Hausherr commented on TIKA-3817:
---

I don't understand this either. We use azure-core ourselves with the same 
version. I have a tika demo project and I added it, it looks like this now:
{code}


org.apache.tika
tika-core
2.4.1


org.apache.tika
tika-parser-pdf-module
2.4.1


com.azure
azure-core
1.30.0


{code}
The dependency graph looks like this:
 !screenshot-1.png! 

> Azure Graph conflict with Tika-app on (JsonGenerator) jackson version 2.13.3 
> - bug changes version to 2.4.1 (Tika's version)
> 
>
> Key: TIKA-3817
> URL: https://issues.apache.org/jira/browse/TIKA-3817
> Project: Tika
>  Issue Type: Bug
>  Components: app
>Affects Versions: 2.4.1
> Environment: Java 1.8, Maven Project, tika-app 2.4.1, jackson 2.13.3, 
> azure-core 1.30.0
>Reporter: Andre Nel
>Priority: Major
>  Labels: Azure, jackson, tika-app, version
> Attachments: screenshot-1.png
>
>
> Azure Graph conflict on jackson.
> Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
> name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching 
> the jackson version, of which jackson-core 2.13.3 is desired and set in the 
> pom. This results in version conflict - where tika changes the jackson 
> version from the desired 2.13.3 to tika's version 2.4.1, which doesn't even 
> exist for jackson and would be far too old a version of jackson.
>  
> ERROR [main] 12:03:19,483 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-annotations' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-core' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-databind' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> INFO  [main] 12:03:19,491 
> com.azure.core.implementation.jackson.JacksonVersion Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> ERROR [main] 12:03:19,494 
> com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> Exception in thread "main" java.lang.LinkageError: Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
>     at 
> com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
>     at 
> com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
>     at 
> com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
>     at 
> com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
>     at 
> com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
>     at 
> com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
>     at 
> main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph
>  
> My offending code calling azure-core graph:
>             final ClientSecretCredential clientSecretCredential = new 
> ClientSecretCredentialBuilder()
>             .clientId(clientId)
>             .clientSecret(clientSecret)
>             .tenantId(tenantId)
>             .build();
>             final TokenCredentialAuthProvider 

[jira] [Comment Edited] (TIKA-3814) Extracted text from HTML file does not exclude newline chars from body

2022-07-14 Thread Sai Konuri (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566996#comment-17566996
 ] 

Sai Konuri edited comment on TIKA-3814 at 7/14/22 10:12 PM:


Thanks Nick and Tim! 

 

As suggested, I agree that we are left with two options: 
 # Write a custom handler that skips the line break characters if we are inside 
the text tags (etc.)
 # We also tried replacing the "\r\n" with "" before parsing and that is giving 
us desirable results so far. We need to do some more testing for this however

 


was (Author: JIRAUSER292407):
Thanks Nick and Tim! 

 

As suggested, I agree that we are left with two options: 
 # Write a custom handler that skips the line break characters if we are inside 
the text tags (etc.)
 # We also tried replacing the "\r\n" with "" and that is giving us desirable 
results so far. We need to do some more testing for this however

 

> Extracted text from HTML file does not exclude newline chars from body
> --
>
> Key: TIKA-3814
> URL: https://issues.apache.org/jira/browse/TIKA-3814
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.3.0
>Reporter: Sai Konuri
>Priority: Minor
> Attachments: bug.html, image-2022-07-06-19-08-30-437.png, 
> image-2022-07-06-19-09-54-534.png
>
>
> When there is a newline character ('\n') within the text of a 
> ,,, etc, the text that is extracted is not excluding those 
> newlines. 
> A sample html file is attached.
>  
> {*}Expected{*}:
> !image-2022-07-06-19-08-30-437.png!
>  
> {*}Actual{*}: 
> !image-2022-07-06-19-09-54-534.png!
>  
>  
> This is the code I am using to extract the text of the HTML file: 
> {code:java}
> AutoDetectParser parser = new AutoDetectParser();
> BodyContentHandler handler = new BodyContentHandler();
> Metadata metadata = new Metadata();
> try (InputStream stream = 
> this.getClass().getClassLoader().getResourceAsStream("bug.html")) {
> parser.parse(stream, handler, metadata);
> System.out.println(handler);
> } {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3814) Extracted text from HTML file does not exclude newline chars from body

2022-07-14 Thread Sai Konuri (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566996#comment-17566996
 ] 

Sai Konuri commented on TIKA-3814:
--

Thanks Nick and Tim! 

 

As suggested, I agree that we are left with two options: 
 # Write a custom handler that skips the line break characters if we are inside 
the text tags (etc.)
 # We also tried replacing the "\r\n" with "" and that is giving us desirable 
results so far. We need to do some more testing for this however

 

> Extracted text from HTML file does not exclude newline chars from body
> --
>
> Key: TIKA-3814
> URL: https://issues.apache.org/jira/browse/TIKA-3814
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.3.0
>Reporter: Sai Konuri
>Priority: Minor
> Attachments: bug.html, image-2022-07-06-19-08-30-437.png, 
> image-2022-07-06-19-09-54-534.png
>
>
> When there is a newline character ('\n') within the text of a 
> ,,, etc, the text that is extracted is not excluding those 
> newlines. 
> A sample html file is attached.
>  
> {*}Expected{*}:
> !image-2022-07-06-19-08-30-437.png!
>  
> {*}Actual{*}: 
> !image-2022-07-06-19-09-54-534.png!
>  
>  
> This is the code I am using to extract the text of the HTML file: 
> {code:java}
> AutoDetectParser parser = new AutoDetectParser();
> BodyContentHandler handler = new BodyContentHandler();
> Metadata metadata = new Metadata();
> try (InputStream stream = 
> this.getClass().getClassLoader().getResourceAsStream("bug.html")) {
> parser.parse(stream, handler, metadata);
> System.out.println(handler);
> } {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3814) Extracted text from HTML file does not exclude newline chars from body

2022-07-14 Thread Nick Burch (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566991#comment-17566991
 ] 

Nick Burch commented on TIKA-3814:
--

I have a feeling that the Text content handler might rely on these coming 
through in the character stream to nicely-ish format the text output?

I do agree that a custom content handler that tracks if it's inside of the "no 
breaks wanted" tags, and skips newlines in the character stream if so, is 
likely to be the likely-best solution here

> Extracted text from HTML file does not exclude newline chars from body
> --
>
> Key: TIKA-3814
> URL: https://issues.apache.org/jira/browse/TIKA-3814
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.3.0
>Reporter: Sai Konuri
>Priority: Minor
> Attachments: bug.html, image-2022-07-06-19-08-30-437.png, 
> image-2022-07-06-19-09-54-534.png
>
>
> When there is a newline character ('\n') within the text of a 
> ,,, etc, the text that is extracted is not excluding those 
> newlines. 
> A sample html file is attached.
>  
> {*}Expected{*}:
> !image-2022-07-06-19-08-30-437.png!
>  
> {*}Actual{*}: 
> !image-2022-07-06-19-09-54-534.png!
>  
>  
> This is the code I am using to extract the text of the HTML file: 
> {code:java}
> AutoDetectParser parser = new AutoDetectParser();
> BodyContentHandler handler = new BodyContentHandler();
> Metadata metadata = new Metadata();
> try (InputStream stream = 
> this.getClass().getClassLoader().getResourceAsStream("bug.html")) {
> parser.parse(stream, handler, metadata);
> System.out.println(handler);
> } {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Comment Edited] (TIKA-3814) Extracted text from HTML file does not exclude newline chars from body

2022-07-14 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566979#comment-17566979
 ] 

Tim Allison edited comment on TIKA-3814 at 7/14/22 8:29 PM:


I'm sorry for our team's delay.  I haven't looked at the relevant specs 
recently.  Do you happen to know if we're breaking portion of the spec on HTML 
parsing here?

If we convert {{\r\n}} to spaces, is that the solution or do we need to delete 
them?  I'm deeply worried that the latter will cause serious problems for word 
spacing on HTML that wasn't put together well.

Do we need to do this only between {{  }}, or do we need to 
do this for everything within {{.*  }}, or do we need to 
do this for everything within {{.* Extracted text from HTML file does not exclude newline chars from body
> --
>
> Key: TIKA-3814
> URL: https://issues.apache.org/jira/browse/TIKA-3814
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.3.0
>Reporter: Sai Konuri
>Priority: Critical
> Attachments: bug.html, image-2022-07-06-19-08-30-437.png, 
> image-2022-07-06-19-09-54-534.png
>
>
> When there is a newline character ('\n') within the text of a 
> ,,, etc, the text that is extracted is not excluding those 
> newlines. 
> A sample html file is attached.
>  
> {*}Expected{*}:
> !image-2022-07-06-19-08-30-437.png!
>  
> {*}Actual{*}: 
> !image-2022-07-06-19-09-54-534.png!
>  
>  
> This is the code I am using to extract the text of the HTML file: 
> {code:java}
> AutoDetectParser parser = new AutoDetectParser();
> BodyContentHandler handler = new BodyContentHandler();
> Metadata metadata = new Metadata();
> try (InputStream stream = 
> this.getClass().getClassLoader().getResourceAsStream("bug.html")) {
> parser.parse(stream, handler, metadata);
> System.out.println(handler);
> } {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (TIKA-3814) Extracted text from HTML file does not exclude newline chars from body

2022-07-14 Thread Tim Allison (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison updated TIKA-3814:
--
Priority: Minor  (was: Critical)

> Extracted text from HTML file does not exclude newline chars from body
> --
>
> Key: TIKA-3814
> URL: https://issues.apache.org/jira/browse/TIKA-3814
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.3.0
>Reporter: Sai Konuri
>Priority: Minor
> Attachments: bug.html, image-2022-07-06-19-08-30-437.png, 
> image-2022-07-06-19-09-54-534.png
>
>
> When there is a newline character ('\n') within the text of a 
> ,,, etc, the text that is extracted is not excluding those 
> newlines. 
> A sample html file is attached.
>  
> {*}Expected{*}:
> !image-2022-07-06-19-08-30-437.png!
>  
> {*}Actual{*}: 
> !image-2022-07-06-19-09-54-534.png!
>  
>  
> This is the code I am using to extract the text of the HTML file: 
> {code:java}
> AutoDetectParser parser = new AutoDetectParser();
> BodyContentHandler handler = new BodyContentHandler();
> Metadata metadata = new Metadata();
> try (InputStream stream = 
> this.getClass().getClassLoader().getResourceAsStream("bug.html")) {
> parser.parse(stream, handler, metadata);
> System.out.println(handler);
> } {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3814) Extracted text from HTML file does not exclude newline chars from body

2022-07-14 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566979#comment-17566979
 ] 

Tim Allison commented on TIKA-3814:
---

I'm sorry for our team's delay.  I haven't looked at the relevant specs 
recently.  Do you happen to know if we're breaking portion of the spec on HTML 
parsing here?

If we convert {{\r\n}} to spaces, is that the solution or do we need to delete 
them?  I'm deeply worried that the latter will cause serious problems for word 
spacing on HTML that wasn't put together well.

Do we need to do this only between {{  }}, or do we need to 
do this for everything within {{.* Extracted text from HTML file does not exclude newline chars from body
> --
>
> Key: TIKA-3814
> URL: https://issues.apache.org/jira/browse/TIKA-3814
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.3.0
>Reporter: Sai Konuri
>Priority: Critical
> Attachments: bug.html, image-2022-07-06-19-08-30-437.png, 
> image-2022-07-06-19-09-54-534.png
>
>
> When there is a newline character ('\n') within the text of a 
> ,,, etc, the text that is extracted is not excluding those 
> newlines. 
> A sample html file is attached.
>  
> {*}Expected{*}:
> !image-2022-07-06-19-08-30-437.png!
>  
> {*}Actual{*}: 
> !image-2022-07-06-19-09-54-534.png!
>  
>  
> This is the code I am using to extract the text of the HTML file: 
> {code:java}
> AutoDetectParser parser = new AutoDetectParser();
> BodyContentHandler handler = new BodyContentHandler();
> Metadata metadata = new Metadata();
> try (InputStream stream = 
> this.getClass().getClassLoader().getResourceAsStream("bug.html")) {
> parser.parse(stream, handler, metadata);
> System.out.println(handler);
> } {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3795) General upgrades for 2.4.2

2022-07-14 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566973#comment-17566973
 ] 

Hudson commented on TIKA-3795:
--

SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #686 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/686/])
TIKA-3795 -- update apache parent pom (tallison: 
[https://github.com/apache/tika/commit/12325670a8f029d4be85a4fc3a8aecd2a35f0ccc])
* (edit) tika-parent/pom.xml


> General upgrades for 2.4.2
> --
>
> Key: TIKA-3795
> URL: https://issues.apache.org/jira/browse/TIKA-3795
> Project: Tika
>  Issue Type: Improvement
>  Components: build
>Reporter: Tilman Hausherr
>Priority: Minor
> Fix For: 2.4.2
>
> Attachments: image-2022-07-12-06-46-00-215.png
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3812) Parser Order: image get parsed by GDALParser instead of TesseractOCRParser

2022-07-14 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566974#comment-17566974
 ] 

Hudson commented on TIKA-3812:
--

SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #686 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/686/])
TIKA-3812 -- test should work whether or not ffmpeg and exiftool are installed 
or not. (tallison: 
[https://github.com/apache/tika/commit/9dc592f95200885af75671f0e0381770d8b8298f])
* (edit) 
tika-parsers/tika-parsers-extended/tika-parser-scientific-package/src/test/resources/2.4.1-tesseract.txt
* (edit) 
tika-parsers/tika-parsers-extended/tika-parser-scientific-package/src/test/resources/2.4.1-no-tesseract.txt
* (edit) 
tika-parsers/tika-parsers-extended/tika-parser-scientific-package/src/test/java/org/apache/tika/parser/scientific/integration/TestParsers.java


> Parser Order: image get parsed by GDALParser instead of TesseractOCRParser
> --
>
> Key: TIKA-3812
> URL: https://issues.apache.org/jira/browse/TIKA-3812
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.4.1
>Reporter: Eugen Caruntu
>Priority: Minor
> Attachments: parser-diffs.tgz
>
>
> The selected parser seems to be different in 2.4.1. For example sending an 
> image (jpg/png) that was previously (2.4.0) processed by TesseractOCRParser, 
> now gets parsed by GDALParser.
> Seems that when multiple parsers support same file types, the selected parser 
> depends on the order in which they get loaded.
> For example the GDALParser, ImageParser and TesseractOCRParser all support 
> image/jpeg, image/png, image/gif ...
> A recent change is reversing the parser order (TIKA-3750).
> Re-configuring the GDALParser by excluding the image mime types might work, 
> but there could be other duplicated parsers.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (TIKA-3814) Extracted text from HTML file does not exclude newline chars from body

2022-07-14 Thread Sai Konuri (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sai Konuri updated TIKA-3814:
-
Priority: Critical  (was: Trivial)

> Extracted text from HTML file does not exclude newline chars from body
> --
>
> Key: TIKA-3814
> URL: https://issues.apache.org/jira/browse/TIKA-3814
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.3.0
>Reporter: Sai Konuri
>Priority: Critical
> Attachments: bug.html, image-2022-07-06-19-08-30-437.png, 
> image-2022-07-06-19-09-54-534.png
>
>
> When there is a newline character ('\n') within the text of a 
> ,,, etc, the text that is extracted is not excluding those 
> newlines. 
> A sample html file is attached.
>  
> {*}Expected{*}:
> !image-2022-07-06-19-08-30-437.png!
>  
> {*}Actual{*}: 
> !image-2022-07-06-19-09-54-534.png!
>  
>  
> This is the code I am using to extract the text of the HTML file: 
> {code:java}
> AutoDetectParser parser = new AutoDetectParser();
> BodyContentHandler handler = new BodyContentHandler();
> Metadata metadata = new Metadata();
> try (InputStream stream = 
> this.getClass().getClassLoader().getResourceAsStream("bug.html")) {
> parser.parse(stream, handler, metadata);
> System.out.println(handler);
> } {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3814) Extracted text from HTML file does not exclude newline chars from body

2022-07-14 Thread Sai Konuri (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566971#comment-17566971
 ] 

Sai Konuri commented on TIKA-3814:
--

This is impacting our customers for our feature, so marking this as critical. 

> Extracted text from HTML file does not exclude newline chars from body
> --
>
> Key: TIKA-3814
> URL: https://issues.apache.org/jira/browse/TIKA-3814
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.3.0
>Reporter: Sai Konuri
>Priority: Trivial
> Attachments: bug.html, image-2022-07-06-19-08-30-437.png, 
> image-2022-07-06-19-09-54-534.png
>
>
> When there is a newline character ('\n') within the text of a 
> ,,, etc, the text that is extracted is not excluding those 
> newlines. 
> A sample html file is attached.
>  
> {*}Expected{*}:
> !image-2022-07-06-19-08-30-437.png!
>  
> {*}Actual{*}: 
> !image-2022-07-06-19-09-54-534.png!
>  
>  
> This is the code I am using to extract the text of the HTML file: 
> {code:java}
> AutoDetectParser parser = new AutoDetectParser();
> BodyContentHandler handler = new BodyContentHandler();
> Metadata metadata = new Metadata();
> try (InputStream stream = 
> this.getClass().getClassLoader().getResourceAsStream("bug.html")) {
> parser.parse(stream, handler, metadata);
> System.out.println(handler);
> } {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (TIKA-3814) Extracted text from HTML file does not exclude newline chars from body

2022-07-14 Thread Sai Konuri (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sai Konuri updated TIKA-3814:
-
Priority: Trivial  (was: Critical)

> Extracted text from HTML file does not exclude newline chars from body
> --
>
> Key: TIKA-3814
> URL: https://issues.apache.org/jira/browse/TIKA-3814
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.3.0
>Reporter: Sai Konuri
>Priority: Trivial
> Attachments: bug.html, image-2022-07-06-19-08-30-437.png, 
> image-2022-07-06-19-09-54-534.png
>
>
> When there is a newline character ('\n') within the text of a 
> ,,, etc, the text that is extracted is not excluding those 
> newlines. 
> A sample html file is attached.
>  
> {*}Expected{*}:
> !image-2022-07-06-19-08-30-437.png!
>  
> {*}Actual{*}: 
> !image-2022-07-06-19-09-54-534.png!
>  
>  
> This is the code I am using to extract the text of the HTML file: 
> {code:java}
> AutoDetectParser parser = new AutoDetectParser();
> BodyContentHandler handler = new BodyContentHandler();
> Metadata metadata = new Metadata();
> try (InputStream stream = 
> this.getClass().getClassLoader().getResourceAsStream("bug.html")) {
> parser.parse(stream, handler, metadata);
> System.out.println(handler);
> } {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (TIKA-3814) Extracted text from HTML file does not exclude newline chars from body

2022-07-14 Thread Sai Konuri (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sai Konuri updated TIKA-3814:
-
Priority: Critical  (was: Trivial)

> Extracted text from HTML file does not exclude newline chars from body
> --
>
> Key: TIKA-3814
> URL: https://issues.apache.org/jira/browse/TIKA-3814
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.3.0
>Reporter: Sai Konuri
>Priority: Critical
> Attachments: bug.html, image-2022-07-06-19-08-30-437.png, 
> image-2022-07-06-19-09-54-534.png
>
>
> When there is a newline character ('\n') within the text of a 
> ,,, etc, the text that is extracted is not excluding those 
> newlines. 
> A sample html file is attached.
>  
> {*}Expected{*}:
> !image-2022-07-06-19-08-30-437.png!
>  
> {*}Actual{*}: 
> !image-2022-07-06-19-09-54-534.png!
>  
>  
> This is the code I am using to extract the text of the HTML file: 
> {code:java}
> AutoDetectParser parser = new AutoDetectParser();
> BodyContentHandler handler = new BodyContentHandler();
> Metadata metadata = new Metadata();
> try (InputStream stream = 
> this.getClass().getClassLoader().getResourceAsStream("bug.html")) {
> parser.parse(stream, handler, metadata);
> System.out.println(handler);
> } {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3812) Parser Order: image get parsed by GDALParser instead of TesseractOCRParser

2022-07-14 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566954#comment-17566954
 ] 

Tilman Hausherr commented on TIKA-3812:
---

Thanks, it works now.

> Parser Order: image get parsed by GDALParser instead of TesseractOCRParser
> --
>
> Key: TIKA-3812
> URL: https://issues.apache.org/jira/browse/TIKA-3812
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.4.1
>Reporter: Eugen Caruntu
>Priority: Minor
> Attachments: parser-diffs.tgz
>
>
> The selected parser seems to be different in 2.4.1. For example sending an 
> image (jpg/png) that was previously (2.4.0) processed by TesseractOCRParser, 
> now gets parsed by GDALParser.
> Seems that when multiple parsers support same file types, the selected parser 
> depends on the order in which they get loaded.
> For example the GDALParser, ImageParser and TesseractOCRParser all support 
> image/jpeg, image/png, image/gif ...
> A recent change is reversing the parser order (TIKA-3750).
> Re-configuring the GDALParser by excluding the image mime types might work, 
> but there could be other duplicated parsers.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3812) Parser Order: image get parsed by GDALParser instead of TesseractOCRParser

2022-07-14 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566951#comment-17566951
 ] 

Hudson commented on TIKA-3812:
--

UNSTABLE: Integrated in Jenkins build Tika » tika-main-jdk8 #685 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/685/])
TIKA-3812 -- add unit tests to confirm parser order with >= 2.4.1 (tallison: 
[https://github.com/apache/tika/commit/19b0337d60d91778d6837f88e62d151586888a79])
* (add) 
tika-parsers/tika-parsers-extended/tika-parser-scientific-package/src/test/resources/2.4.0-no-tesseract.txt
* (add) 
tika-parsers/tika-parsers-extended/tika-parser-scientific-package/src/test/resources/2.4.0-tesseract.txt
* (edit) 
tika-parsers/tika-parsers-extended/tika-parser-scientific-package/pom.xml
* (add) 
tika-parsers/tika-parsers-extended/tika-parser-scientific-package/src/test/java/org/apache/tika/parser/scientific/integration/TestParsers.java
* (add) 
tika-parsers/tika-parsers-extended/tika-parser-scientific-package/src/test/resources/2.4.1-no-tesseract.txt
* (add) 
tika-parsers/tika-parsers-extended/tika-parser-scientific-package/src/test/resources/2.4.1-tesseract.txt


> Parser Order: image get parsed by GDALParser instead of TesseractOCRParser
> --
>
> Key: TIKA-3812
> URL: https://issues.apache.org/jira/browse/TIKA-3812
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.4.1
>Reporter: Eugen Caruntu
>Priority: Minor
> Attachments: parser-diffs.tgz
>
>
> The selected parser seems to be different in 2.4.1. For example sending an 
> image (jpg/png) that was previously (2.4.0) processed by TesseractOCRParser, 
> now gets parsed by GDALParser.
> Seems that when multiple parsers support same file types, the selected parser 
> depends on the order in which they get loaded.
> For example the GDALParser, ImageParser and TesseractOCRParser all support 
> image/jpeg, image/png, image/gif ...
> A recent change is reversing the parser order (TIKA-3750).
> Re-configuring the GDALParser by excluding the image mime types might work, 
> but there could be other duplicated parsers.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3812) Parser Order: image get parsed by GDALParser instead of TesseractOCRParser

2022-07-14 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566946#comment-17566946
 ] 

Tim Allison commented on TIKA-3812:
---

Sorry.  Just pushed fix.  Tracking to see if that doesn't fix it.

> Parser Order: image get parsed by GDALParser instead of TesseractOCRParser
> --
>
> Key: TIKA-3812
> URL: https://issues.apache.org/jira/browse/TIKA-3812
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.4.1
>Reporter: Eugen Caruntu
>Priority: Minor
> Attachments: parser-diffs.tgz
>
>
> The selected parser seems to be different in 2.4.1. For example sending an 
> image (jpg/png) that was previously (2.4.0) processed by TesseractOCRParser, 
> now gets parsed by GDALParser.
> Seems that when multiple parsers support same file types, the selected parser 
> depends on the order in which they get loaded.
> For example the GDALParser, ImageParser and TesseractOCRParser all support 
> image/jpeg, image/png, image/gif ...
> A recent change is reversing the parser order (TIKA-3750).
> Re-configuring the GDALParser by excluding the image mime types might work, 
> but there could be other duplicated parsers.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3812) Parser Order: image get parsed by GDALParser instead of TesseractOCRParser

2022-07-14 Thread Tilman Hausherr (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566943#comment-17566943
 ] 

Tilman Hausherr commented on TIKA-3812:
---

Build fails on my machine (W10):
{noformat}
[ERROR] Tests run: 1, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 0.979 s 
<<< FAILURE! - in org.apache.tika.parser.scientific.integration.TestParsers
[ERROR] 
org.apache.tika.parser.scientific.integration.TestParsers.testDiffsFrom241  
Time elapsed: 0.951 s  <<< FAILURE!
org.opentest4j.AssertionFailedError: expected:  but was: 
        at org.junit.jupiter.api.AssertionUtils.fail(AssertionUtils.java:55)
        at 
org.junit.jupiter.api.AssertionUtils.failNotEqual(AssertionUtils.java:62)
        at 
org.junit.jupiter.api.AssertEquals.assertEquals(AssertEquals.java:182)
        at 
org.junit.jupiter.api.AssertEquals.assertEquals(AssertEquals.java:177)
        at org.junit.jupiter.api.Assertions.assertEquals(Assertions.java:1141)
        at 
org.apache.tika.parser.scientific.integration.TestParsers.testDiffsFrom241(TestParsers.java:66)
 {noformat}

> Parser Order: image get parsed by GDALParser instead of TesseractOCRParser
> --
>
> Key: TIKA-3812
> URL: https://issues.apache.org/jira/browse/TIKA-3812
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 2.4.1
>Reporter: Eugen Caruntu
>Priority: Minor
> Attachments: parser-diffs.tgz
>
>
> The selected parser seems to be different in 2.4.1. For example sending an 
> image (jpg/png) that was previously (2.4.0) processed by TesseractOCRParser, 
> now gets parsed by GDALParser.
> Seems that when multiple parsers support same file types, the selected parser 
> depends on the order in which they get loaded.
> For example the GDALParser, ImageParser and TesseractOCRParser all support 
> image/jpeg, image/png, image/gif ...
> A recent change is reversing the parser order (TIKA-3750).
> Re-configuring the GDALParser by excluding the image mime types might work, 
> but there could be other duplicated parsers.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3817) Azure Graph conflict with Tika-app on (JsonGenerator) jackson version 2.13.3 - bug changes version to 2.4.1 (Tika's version)

2022-07-14 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17566884#comment-17566884
 ] 

Tim Allison commented on TIKA-3817:
---

What are we doing wrong at the tika level?  As you point out, Tika 2.4.1 uses 
jackson-core version 2.13.3.

> Azure Graph conflict with Tika-app on (JsonGenerator) jackson version 2.13.3 
> - bug changes version to 2.4.1 (Tika's version)
> 
>
> Key: TIKA-3817
> URL: https://issues.apache.org/jira/browse/TIKA-3817
> Project: Tika
>  Issue Type: Bug
>  Components: app
>Affects Versions: 2.4.1
> Environment: Java 1.8, Maven Project, tika-app 2.4.1, jackson 2.13.3, 
> azure-core 1.30.0
>Reporter: Andre Nel
>Priority: Major
>  Labels: Azure, jackson, tika-app, version
>
> Azure Graph conflict on jackson.
> Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
> name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching 
> the jackson version, of which jackson-core 2.13.3 is desired and set in the 
> pom. This results in version conflict - where tika changes the jackson 
> version from the desired 2.13.3 to tika's version 2.4.1, which doesn't even 
> exist for jackson and would be far too old a version of jackson.
>  
> ERROR [main] 12:03:19,483 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-annotations' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-core' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-databind' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> INFO  [main] 12:03:19,491 
> com.azure.core.implementation.jackson.JacksonVersion Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> ERROR [main] 12:03:19,494 
> com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> Exception in thread "main" java.lang.LinkageError: Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
>     at 
> com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
>     at 
> com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
>     at 
> com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
>     at 
> com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
>     at 
> com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
>     at 
> com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
>     at 
> main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph
>  
> My offending code calling azure-core graph:
>             final ClientSecretCredential clientSecretCredential = new 
> ClientSecretCredentialBuilder()
>             .clientId(clientId)
>             .clientSecret(clientSecret)
>             .tenantId(tenantId)
>             .build();
>             final TokenCredentialAuthProvider tokenCredentialAuthProvider = 
> new TokenCredentialAuthProvider(scopes, clientSecretCredential);
>             final GraphServiceClient graphClient =
>             GraphServiceClient
>             .builder()
>             .authenticationProvider(tokenCredentialAuthProvider)
>             .buildClient();
>             final User me = graphClient.me().buildRequest().get();
>  
> which calls azure-core-1.30.0.jar -> com.azure.core.implementation.jackson 

[jira] [Updated] (TIKA-3817) Azure Graph conflict with Tika-app on (JsonGenerator) jackson version 2.13.3 - bug changes version to 2.4.1 (Tika's version)

2022-07-14 Thread Andre Nel (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andre Nel updated TIKA-3817:

Description: 
Azure Graph conflict on jackson.

Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching the 
jackson version, of which jackson-core 2.13.3 is desired and set in the pom. 
This results in version conflict - where tika changes the jackson version from 
the desired 2.13.3 to tika's version 2.4.1, which doesn't even exist for 
jackson and would be far too old a version of jackson.

 

ERROR [main] 12:03:19,483 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-annotations' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-core' is not supported (older than earliest 
supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-databind' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
INFO  [main] 12:03:19,491 com.azure.core.implementation.jackson.JacksonVersion 
Package versions: jackson-annotations=2.4.1, jackson-core=2.4.1, 
jackson-databind=2.4.1, jackson-dataformat-xml=unknown, 
jackson-datatype-jsr310=2.13.3, azure-core=1.30.0, Troubleshooting version 
conflicts: [https://aka.ms/azsdk/java/dependency/troubleshoot]
ERROR [main] 12:03:19,494 
com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
azure-core=1.30.0, Troubleshooting version conflicts: 
[https://aka.ms/azsdk/java/dependency/troubleshoot]
Exception in thread "main" java.lang.LinkageError: Package versions: 
jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
azure-core=1.30.0, Troubleshooting version conflicts: 
[https://aka.ms/azsdk/java/dependency/troubleshoot]
    at 
com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
    at 
com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
    at 
com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
    at 
com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
    at 
com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
    at 
com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
    at 
com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
    at 
com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
    at 
com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
    at main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph

 

My offending code calling azure-core graph:

            final ClientSecretCredential clientSecretCredential = new 
ClientSecretCredentialBuilder()
            .clientId(clientId)
            .clientSecret(clientSecret)
            .tenantId(tenantId)
            .build();

            final TokenCredentialAuthProvider tokenCredentialAuthProvider = new 
TokenCredentialAuthProvider(scopes, clientSecretCredential);

            final GraphServiceClient graphClient =
            GraphServiceClient
            .builder()
            .authenticationProvider(tokenCredentialAuthProvider)
            .buildClient();

            final User me = graphClient.me().buildRequest().get();

 

which calls azure-core-1.30.0.jar -> com.azure.core.implementation.jackson -> 
JacksonVersion.class line 43 onwards:

    private JacksonVersion() {
        annotationsVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.annotation.JsonProperty");
        coreVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.core.JsonGenerator");
        databindVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.databind.ObjectMapper");
        xmlVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.dataformat.xml.XmlMapper");
        jsr310Version = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.datatype.jsr310.JavaTimeModule");

 

Which is messed up by: tika-app-2.4.1,jar -> 
com.azure.core.implementation.jackson.JacksonVersion -> JsonGenerator.class

which changes the jackson version from 2.13.3 to tika's 2.4.1... causing a 
world of hurt :(

 

 

 

Portions of My POM trying to resolve the conflict:

...

    
       
            
   

[jira] [Updated] (TIKA-3817) Azure Graph conflict with Tika-app on (JsonGenerator) jackson version 2.13.3 - bug changes version to 2.4.1 (Tika's version)

2022-07-14 Thread Andre Nel (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andre Nel updated TIKA-3817:

Summary: Azure Graph conflict with Tika-app on (JsonGenerator) jackson 
version 2.13.3 - bug changes version to 2.4.1 (Tika's version)  (was: Azure 
Graph conflict with Tika-app on jackson (JsonGenerator) version - bug changes 
version from 2.13.3 to 2.4.1)

> Azure Graph conflict with Tika-app on (JsonGenerator) jackson version 2.13.3 
> - bug changes version to 2.4.1 (Tika's version)
> 
>
> Key: TIKA-3817
> URL: https://issues.apache.org/jira/browse/TIKA-3817
> Project: Tika
>  Issue Type: Bug
>  Components: app
>Affects Versions: 2.4.1
> Environment: Java 1.8, Maven Project, tika-app 2.4.1, jackson 2.13.3, 
> azure-core 1.30.0
>Reporter: Andre Nel
>Priority: Major
>  Labels: Azure, jackson, tika-app, version
>
> Azure Graph conflict on jackson.
> Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
> name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching 
> the jackson version, of which jackson-core 2.13.3 is desired and set in the 
> pom. This results in version conflict - where tika changes the jackson 
> version from the desired 2.13.3 to tika's version 2.4.1, which doesn't even 
> exist for jackson and would be far too old a version of jackson.
>  
> ERROR [main] 12:03:19,483 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-annotations' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-core' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-databind' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> INFO  [main] 12:03:19,491 
> com.azure.core.implementation.jackson.JacksonVersion Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> ERROR [main] 12:03:19,494 
> com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> Exception in thread "main" java.lang.LinkageError: Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
>     at 
> com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
>     at 
> com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
>     at 
> com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
>     at 
> com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
>     at 
> com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
>     at 
> com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
>     at 
> main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph
>  
> My offending code calling azure-core graph:
>             final ClientSecretCredential clientSecretCredential = new 
> ClientSecretCredentialBuilder()
>             .clientId(clientId)
>             .clientSecret(clientSecret)
>             .tenantId(tenantId)
>             .build();
>             final TokenCredentialAuthProvider tokenCredentialAuthProvider = 
> new TokenCredentialAuthProvider(scopes, clientSecretCredential);
>             final GraphServiceClient graphClient =
>             GraphServiceClient
>             .builder()
>             .authenticationProvider(tokenCredentialAuthProvider)
>             .buildClient();
>             final User me = 

[jira] [Updated] (TIKA-3817) Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - bug changes version from 2.13.3 to 2.4.1

2022-07-14 Thread Andre Nel (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andre Nel updated TIKA-3817:

Summary: Azure Graph conflict with Tika-app on jackson (JsonGenerator) 
version - bug changes version from 2.13.3 to 2.4.1  (was: Azure Graph conflict 
with Tika-app on jackson (JsonGenerator) version - changing from 2.13.3 to 
2.4.1)

> Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - bug 
> changes version from 2.13.3 to 2.4.1
> 
>
> Key: TIKA-3817
> URL: https://issues.apache.org/jira/browse/TIKA-3817
> Project: Tika
>  Issue Type: Bug
>  Components: app
>Affects Versions: 2.4.1
> Environment: Java 1.8, Maven Project, tika-app 2.4.1, jackson 2.13.3, 
> azure-core 1.30.0
>Reporter: Andre Nel
>Priority: Major
>  Labels: Azure, jackson, tika-app, version
>
> Azure Graph conflict on jackson.
> Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
> name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching 
> the jackson version, of which jackson-core 2.13.3 is desired and set in the 
> pom. This results in version conflict - where tika changes the jackson 
> version from the desired 2.13.3 to tika's version 2.4.1, which doesn't even 
> exist for jackson and would be far too old a version of jackson.
>  
> ERROR [main] 12:03:19,483 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-annotations' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-core' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-databind' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> INFO  [main] 12:03:19,491 
> com.azure.core.implementation.jackson.JacksonVersion Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> ERROR [main] 12:03:19,494 
> com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> Exception in thread "main" java.lang.LinkageError: Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
>     at 
> com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
>     at 
> com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
>     at 
> com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
>     at 
> com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
>     at 
> com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
>     at 
> com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
>     at 
> main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph
>  
> My offending code calling azure-core graph:
>             final ClientSecretCredential clientSecretCredential = new 
> ClientSecretCredentialBuilder()
>             .clientId(clientId)
>             .clientSecret(clientSecret)
>             .tenantId(tenantId)
>             .build();
>             final TokenCredentialAuthProvider tokenCredentialAuthProvider = 
> new TokenCredentialAuthProvider(scopes, clientSecretCredential);
>             final GraphServiceClient graphClient =
>             GraphServiceClient
>             .builder()
>             .authenticationProvider(tokenCredentialAuthProvider)
>             .buildClient();
>             final User me = graphClient.me().buildRequest().get();
>  
> which calls 

[jira] [Updated] (TIKA-3817) Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - changing from 2.13.3 to 2.4.1

2022-07-14 Thread Andre Nel (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andre Nel updated TIKA-3817:

Description: 
Azure Graph conflict on jackson.

Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching the 
jackson version, of which jackson-core 2.13.3 is desired and set in the pom. 
This results in version conflict - where tika changes the jackson version from 
the desired 2.13.3 to tika's version 2.4.1, which doesn't even exist for 
jackson and would be far too old a version of jackson.

 

ERROR [main] 12:03:19,483 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-annotations' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-core' is not supported (older than earliest 
supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-databind' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
INFO  [main] 12:03:19,491 com.azure.core.implementation.jackson.JacksonVersion 
Package versions: jackson-annotations=2.4.1, jackson-core=2.4.1, 
jackson-databind=2.4.1, jackson-dataformat-xml=unknown, 
jackson-datatype-jsr310=2.13.3, azure-core=1.30.0, Troubleshooting version 
conflicts: [https://aka.ms/azsdk/java/dependency/troubleshoot]
ERROR [main] 12:03:19,494 
com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
azure-core=1.30.0, Troubleshooting version conflicts: 
[https://aka.ms/azsdk/java/dependency/troubleshoot]
Exception in thread "main" java.lang.LinkageError: Package versions: 
jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
azure-core=1.30.0, Troubleshooting version conflicts: 
[https://aka.ms/azsdk/java/dependency/troubleshoot]
    at 
com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
    at 
com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
    at 
com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
    at 
com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
    at 
com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
    at 
com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
    at 
com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
    at 
com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
    at 
com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
    at main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph

 

My offending code calling azure-core graph:

            final ClientSecretCredential clientSecretCredential = new 
ClientSecretCredentialBuilder()
            .clientId(clientId)
            .clientSecret(clientSecret)
            .tenantId(tenantId)
            .build();

            final TokenCredentialAuthProvider tokenCredentialAuthProvider = new 
TokenCredentialAuthProvider(scopes, clientSecretCredential);

            final GraphServiceClient graphClient =
            GraphServiceClient
            .builder()
            .authenticationProvider(tokenCredentialAuthProvider)
            .buildClient();

            final User me = graphClient.me().buildRequest().get();

 

which calls azure-core-1.30.0.jar -> com.azure.core.implementation.jackson -> 
JacksonVersion.class line 43 onwards:

    private JacksonVersion() {
        annotationsVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.annotation.JsonProperty");
        coreVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.core.JsonGenerator");
        databindVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.databind.ObjectMapper");
        xmlVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.dataformat.xml.XmlMapper");
        jsr310Version = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.datatype.jsr310.JavaTimeModule");

 

 

 

 

Portions of My POM trying to resolve the conflict:

...

    
       
            
                com.fasterxml.jackson.core
                jackson-annotations
                2.13.3
            
            
                com.fasterxml.jackson.core
                jackson-core
                

[jira] [Updated] (TIKA-3817) Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - changing from 2.13.3 to 2.4.1

2022-07-14 Thread Andre Nel (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andre Nel updated TIKA-3817:

Environment: Java 1.8 Maven Project tika-app 2.4.1 jackson 2.13.3 
azure-core 1.30.0  (was: Java 1.8 Maven Pom)

> Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - 
> changing from 2.13.3 to 2.4.1
> -
>
> Key: TIKA-3817
> URL: https://issues.apache.org/jira/browse/TIKA-3817
> Project: Tika
>  Issue Type: Bug
>  Components: app
>Affects Versions: 2.4.1
> Environment: Java 1.8 Maven Project tika-app 2.4.1 jackson 2.13.3 
> azure-core 1.30.0
>Reporter: Andre Nel
>Priority: Major
>  Labels: Azure, jackson, tika-app, version
>
> Azure Graph conflict on jackson.
> Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
> name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching 
> the jackson version, of which jackson-core 2.13.3 is desired and set in the 
> pom. This results in version conflict - where tika changes the jackson 
> version from the desired 2.13.3 to tika's version 2.4.1, which doesn't even 
> exist for jackson and would be far too old a version of jackson.
>  
> ERROR [main] 12:03:19,483 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-annotations' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-core' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-databind' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> INFO  [main] 12:03:19,491 
> com.azure.core.implementation.jackson.JacksonVersion Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> ERROR [main] 12:03:19,494 
> com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> Exception in thread "main" java.lang.LinkageError: Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
>     at 
> com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
>     at 
> com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
>     at 
> com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
>     at 
> com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
>     at 
> com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
>     at 
> com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
>     at 
> main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph
>  
> My offending code:
>             final ClientSecretCredential clientSecretCredential = new 
> ClientSecretCredentialBuilder()
>             .clientId(clientId)
>             .clientSecret(clientSecret)
>             .tenantId(tenantId)
>             .build();
>             final TokenCredentialAuthProvider tokenCredentialAuthProvider = 
> new TokenCredentialAuthProvider(scopes, clientSecretCredential);
>             final GraphServiceClient graphClient =
>             GraphServiceClient
>             .builder()
>             .authenticationProvider(tokenCredentialAuthProvider)
>             .buildClient();
>             final User me = graphClient.me().buildRequest().get();
>  
> which calls azure-core-1.30.0.jar -> com.azure.core.implementation.jackson -> 
> JacksonVersion.class line 43 onwards:
>     private JacksonVersion() {
>         annotationsVersion = 
> 

[jira] [Updated] (TIKA-3817) Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - changing from 2.13.3 to 2.4.1

2022-07-14 Thread Andre Nel (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andre Nel updated TIKA-3817:

Environment: Java 1.8, Maven Project, tika-app 2.4.1, jackson 2.13.3, 
azure-core 1.30.0  (was: Java 1.8 Maven Project tika-app 2.4.1 jackson 2.13.3 
azure-core 1.30.0)

> Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - 
> changing from 2.13.3 to 2.4.1
> -
>
> Key: TIKA-3817
> URL: https://issues.apache.org/jira/browse/TIKA-3817
> Project: Tika
>  Issue Type: Bug
>  Components: app
>Affects Versions: 2.4.1
> Environment: Java 1.8, Maven Project, tika-app 2.4.1, jackson 2.13.3, 
> azure-core 1.30.0
>Reporter: Andre Nel
>Priority: Major
>  Labels: Azure, jackson, tika-app, version
>
> Azure Graph conflict on jackson.
> Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
> name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching 
> the jackson version, of which jackson-core 2.13.3 is desired and set in the 
> pom. This results in version conflict - where tika changes the jackson 
> version from the desired 2.13.3 to tika's version 2.4.1, which doesn't even 
> exist for jackson and would be far too old a version of jackson.
>  
> ERROR [main] 12:03:19,483 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-annotations' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-core' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-databind' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> INFO  [main] 12:03:19,491 
> com.azure.core.implementation.jackson.JacksonVersion Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> ERROR [main] 12:03:19,494 
> com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
> Exception in thread "main" java.lang.LinkageError: Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
> jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
> azure-core=1.30.0, Troubleshooting version conflicts: 
> [https://aka.ms/azsdk/java/dependency/troubleshoot]
>     at 
> com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
>     at 
> com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
>     at 
> com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
>     at 
> com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
>     at 
> com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
>     at 
> com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
>     at 
> com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
>     at 
> main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph
>  
> My offending code:
>             final ClientSecretCredential clientSecretCredential = new 
> ClientSecretCredentialBuilder()
>             .clientId(clientId)
>             .clientSecret(clientSecret)
>             .tenantId(tenantId)
>             .build();
>             final TokenCredentialAuthProvider tokenCredentialAuthProvider = 
> new TokenCredentialAuthProvider(scopes, clientSecretCredential);
>             final GraphServiceClient graphClient =
>             GraphServiceClient
>             .builder()
>             .authenticationProvider(tokenCredentialAuthProvider)
>             .buildClient();
>             final User me = graphClient.me().buildRequest().get();
>  
> which calls azure-core-1.30.0.jar -> com.azure.core.implementation.jackson -> 
> JacksonVersion.class line 43 onwards:
>     

[jira] [Updated] (TIKA-3817) Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - changing from 2.13.3 to 2.4.1

2022-07-14 Thread Andre Nel (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andre Nel updated TIKA-3817:

Description: 
Azure Graph conflict on jackson.

Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching the 
jackson version, of which jackson-core 2.13.3 is desired and set in the pom. 
This results in version conflict - where tika changes the jackson version from 
the desired 2.13.3 to tika's version 2.4.1, which doesn't even exist for 
jackson and would be far too old a version of jackson.

 

ERROR [main] 12:03:19,483 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-annotations' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-core' is not supported (older than earliest 
supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-databind' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
INFO  [main] 12:03:19,491 com.azure.core.implementation.jackson.JacksonVersion 
Package versions: jackson-annotations=2.4.1, jackson-core=2.4.1, 
jackson-databind=2.4.1, jackson-dataformat-xml=unknown, 
jackson-datatype-jsr310=2.13.3, azure-core=1.30.0, Troubleshooting version 
conflicts: [https://aka.ms/azsdk/java/dependency/troubleshoot]
ERROR [main] 12:03:19,494 
com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
azure-core=1.30.0, Troubleshooting version conflicts: 
[https://aka.ms/azsdk/java/dependency/troubleshoot]
Exception in thread "main" java.lang.LinkageError: Package versions: 
jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
azure-core=1.30.0, Troubleshooting version conflicts: 
[https://aka.ms/azsdk/java/dependency/troubleshoot]
    at 
com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
    at 
com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
    at 
com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
    at 
com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
    at 
com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
    at 
com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
    at 
com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
    at 
com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
    at 
com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
    at main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph

 

My offending code:

            final ClientSecretCredential clientSecretCredential = new 
ClientSecretCredentialBuilder()
            .clientId(clientId)
            .clientSecret(clientSecret)
            .tenantId(tenantId)
            .build();

            final TokenCredentialAuthProvider tokenCredentialAuthProvider = new 
TokenCredentialAuthProvider(scopes, clientSecretCredential);

            final GraphServiceClient graphClient =
            GraphServiceClient
            .builder()
            .authenticationProvider(tokenCredentialAuthProvider)
            .buildClient();

            final User me = graphClient.me().buildRequest().get();

 

which calls azure-core-1.30.0.jar -> com.azure.core.implementation.jackson -> 
JacksonVersion.class line 43 onwards:

    private JacksonVersion() {
        annotationsVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.annotation.JsonProperty");
        coreVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.core.JsonGenerator");
        databindVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.databind.ObjectMapper");
        xmlVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.dataformat.xml.XmlMapper");
        jsr310Version = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.datatype.jsr310.JavaTimeModule");

 

 

 

 

Portions of My POM trying to resolve the conflict:

...

    
       
            
                com.fasterxml.jackson.core
                jackson-annotations
                2.13.3
            
            
                com.fasterxml.jackson.core
                jackson-core
                2.13.3
            
         

[jira] [Updated] (TIKA-3817) Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - changing from 2.13.3 to 2.4.1

2022-07-14 Thread Andre Nel (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andre Nel updated TIKA-3817:

Environment: Java 1.8 Maven Pom  (was: Java 1.8 Maven Pom

    
       
            
                com.fasterxml.jackson.core
                jackson-annotations
                2.13.3
            
            
                com.fasterxml.jackson.core
                jackson-core
                2.13.3
            
            
                com.fasterxml.jackson.core
                jackson-databind
                2.13.3
            
            
                com.fasterxml.jackson.dataformat
                jackson-dataformat-xml
                2.13.3
            
            
                com.fasterxml.jackson.datatype
                jackson-datatype-jsr310
                2.13.3
            
        
    

 

        
        
            com.azure
            azure-core
            1.30.0
            
                
                    com.fasterxml.jackson.core
                    jackson-annotations
                
                
                    com.fasterxml.jackson.core
                    jackson-core
                
                
                    com.fasterxml.jackson.core
                    jackson-databind
                
                
                    com.fasterxml.jackson.dataformat
                    jackson-dataformat-xml
                
                
                    com.fasterxml.jackson.datatype
                    jackson-datatype-jsr310
                
            
        

 

        
        
            com.fasterxml.jackson.core
            jackson-annotations
            2.13.3
            provided
        
        
        
            com.fasterxml.jackson.core
            jackson-core
            2.13.3
            provided
        
        
        
            com.fasterxml.jackson.core
            jackson-databind
            2.13.3
            provided
        
        
        
            com.fasterxml.jackson.dataformat
            jackson-dataformat-xml
            2.13.3
            provided
        
        
        
            com.fasterxml.jackson.datatype
            jackson-datatype-jsr310
            2.13.3
            provided
        

 

        
            org.apache.tika
            tika-app
            2.4.1
            
                
                    com.fasterxml.jackson.core
                    jackson-annotations
                
                
                    com.fasterxml.jackson.core
                    jackson-core
                
                
                    com.fasterxml.jackson.core
                    jackson-databind
                
                
                    com.fasterxml.jackson.dataformat
                    jackson-dataformat-xml
                
                
                    com.fasterxml.jackson.datatype
                    jackson-datatype-jsr310
                
            
        )

> Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - 
> changing from 2.13.3 to 2.4.1
> -
>
> Key: TIKA-3817
> URL: https://issues.apache.org/jira/browse/TIKA-3817
> Project: Tika
>  Issue Type: Bug
>  Components: app
>Affects Versions: 2.4.1
> Environment: Java 1.8 Maven Pom
>Reporter: Andre Nel
>Priority: Major
>  Labels: Azure, jackson, tika-app, version
>
> Azure Graph conflict on jackson.
> Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
> name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching 
> the jackson version, of which jackson-core 2.13.3 is desired and set in the 
> pom. This results in version conflict - where tika changes the jackson 
> version from the desired 2.13.3 to tika's version 2.4.1, which doesn't even 
> exist for jackson and would be far too old a version of jackson.
>  
> ERROR [main] 12:03:19,483 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-annotations' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-core' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> ERROR [main] 12:03:19,489 
> com.azure.core.implementation.jackson.JacksonVersion Version '2.4.1' of 
> package 'jackson-databind' is not supported (older than earliest supported 
> version - `2.10.0`), please upgrade.
> INFO  [main] 12:03:19,491 
> com.azure.core.implementation.jackson.JacksonVersion Package versions: 
> jackson-annotations=2.4.1, jackson-core=2.4.1, 

[jira] [Updated] (TIKA-3817) Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - changing from 2.13.3 to 2.4.1

2022-07-14 Thread Andre Nel (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andre Nel updated TIKA-3817:

Description: 
Azure Graph conflict on jackson.

Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching the 
jackson version, of which jackson-core 2.13.3 is desired and set in the pom. 
This results in version conflict - where tika changes the jackson version from 
the desired 2.13.3 to tika's version 2.4.1, which doesn't even exist for 
jackson and would be far too old a version of jackson.

 

ERROR [main] 12:03:19,483 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-annotations' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-core' is not supported (older than earliest 
supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-databind' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
INFO  [main] 12:03:19,491 com.azure.core.implementation.jackson.JacksonVersion 
Package versions: jackson-annotations=2.4.1, jackson-core=2.4.1, 
jackson-databind=2.4.1, jackson-dataformat-xml=unknown, 
jackson-datatype-jsr310=2.13.3, azure-core=1.30.0, Troubleshooting version 
conflicts: [https://aka.ms/azsdk/java/dependency/troubleshoot]
ERROR [main] 12:03:19,494 
com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
azure-core=1.30.0, Troubleshooting version conflicts: 
[https://aka.ms/azsdk/java/dependency/troubleshoot]
Exception in thread "main" java.lang.LinkageError: Package versions: 
jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
azure-core=1.30.0, Troubleshooting version conflicts: 
[https://aka.ms/azsdk/java/dependency/troubleshoot]
    at 
com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
    at 
com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
    at 
com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
    at 
com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
    at 
com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
    at 
com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
    at 
com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
    at 
com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
    at 
com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
    at main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph

 

My offending code:

            final ClientSecretCredential clientSecretCredential = new 
ClientSecretCredentialBuilder()
            .clientId(clientId)
            .clientSecret(clientSecret)
            .tenantId(tenantId)
            .build();

            final TokenCredentialAuthProvider tokenCredentialAuthProvider = new 
TokenCredentialAuthProvider(scopes, clientSecretCredential);

            final GraphServiceClient graphClient =
            GraphServiceClient
            .builder()
            .authenticationProvider(tokenCredentialAuthProvider)
            .buildClient();

            final User me = graphClient.me().buildRequest().get();

 

which calls azure-core-1.30.0.jar -> com.azure.core.implementation.jackson -> 
JacksonVersion.class line 43 onwards:

    private JacksonVersion() {
        annotationsVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.annotation.JsonProperty");
        coreVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.core.JsonGenerator");
        databindVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.databind.ObjectMapper");
        xmlVersion = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.dataformat.xml.XmlMapper");
        jsr310Version = 
SemanticVersion.getPackageVersionForClass("com.fasterxml.jackson.datatype.jsr310.JavaTimeModule");

  was:
Azure Graph conflict on jackson.

Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching the 
jackson version, of which jackson-core 2.13.3 is desired and set in the pom. 
This results in version conflict - where tika changes the 

[jira] [Updated] (TIKA-3817) Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - changing from 2.13.3 to 2.4.1

2022-07-14 Thread Andre Nel (Jira)


 [ 
https://issues.apache.org/jira/browse/TIKA-3817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andre Nel updated TIKA-3817:

Description: 
Azure Graph conflict on jackson.

Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetching the 
jackson version, of which jackson-core 2.13.3 is desired and set in the pom. 
This results in version conflict - where tika changes the jackson version from 
the desired 2.13.3 to tika's version 2.4.1, which doesn't even exist for 
jackson and would be far too old a version of jackson.

 

ERROR [main] 12:03:19,483 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-annotations' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-core' is not supported (older than earliest 
supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-databind' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
INFO  [main] 12:03:19,491 com.azure.core.implementation.jackson.JacksonVersion 
Package versions: jackson-annotations=2.4.1, jackson-core=2.4.1, 
jackson-databind=2.4.1, jackson-dataformat-xml=unknown, 
jackson-datatype-jsr310=2.13.3, azure-core=1.30.0, Troubleshooting version 
conflicts: [https://aka.ms/azsdk/java/dependency/troubleshoot]
ERROR [main] 12:03:19,494 
com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
azure-core=1.30.0, Troubleshooting version conflicts: 
[https://aka.ms/azsdk/java/dependency/troubleshoot]
Exception in thread "main" java.lang.LinkageError: Package versions: 
jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
jackson-dataformat-xml=unknown, jackson-datatype-jsr310=2.13.3, 
azure-core=1.30.0, Troubleshooting version conflicts: 
[https://aka.ms/azsdk/java/dependency/troubleshoot]
    at 
com.azure.core.implementation.jackson.ObjectMapperShim.createHeaderMapper(ObjectMapperShim.java:138)
    at 
com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:81)
    at 
com.azure.core.util.serializer.JacksonAdapter.(JacksonAdapter.java:59)
    at 
com.azure.core.util.serializer.JacksonAdapter$SerializerAdapterHolder.(JacksonAdapter.java:114)
    at 
com.azure.core.util.serializer.JacksonAdapter.createDefaultSerializerAdapter(JacksonAdapter.java:123)
    at 
com.azure.identity.implementation.IdentityClient.(IdentityClient.java:104)
    at 
com.azure.identity.implementation.IdentityClientBuilder.build(IdentityClientBuilder.java:158)
    at 
com.azure.identity.ClientSecretCredential.(ClientSecretCredential.java:68)
    at 
com.azure.identity.ClientSecretCredentialBuilder.build(ClientSecretCredentialBuilder.java:93)
    at main.java.emailtranslator.EmailTranslator.connectOffice365MicrosoftGraph

 

Offending code:

            final ClientSecretCredential clientSecretCredential = new 
ClientSecretCredentialBuilder()
            .clientId(clientId)
            .clientSecret(clientSecret)
            .tenantId(tenantId)
            .build();

            final TokenCredentialAuthProvider tokenCredentialAuthProvider = new 
TokenCredentialAuthProvider(scopes, clientSecretCredential);

            final GraphServiceClient graphClient =
            GraphServiceClient
            .builder()
            .authenticationProvider(tokenCredentialAuthProvider)
            .buildClient();

            final User me = graphClient.me().buildRequest().get();

 

  was:
Azure Graph conflict on jackson.

Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetting the 
jackson version of which jackson-core 2.13.3 is desired and set in the pom. 
This results in version conflict - where tika changes the jackson version from 
the desired 2.13.3 to tika's version 2.4.1, which doesn't even exist for 
jackson and would be far too old a version of jackson.

 

ERROR [main] 12:03:19,483 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-annotations' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-core' is not supported (older than earliest 
supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-databind' is not supported (older than 
earliest 

[jira] [Created] (TIKA-3817) Azure Graph conflict with Tika-app on jackson (JsonGenerator) version - changing from 2.13.3 to 2.4.1

2022-07-14 Thread Andre Nel (Jira)
Andre Nel created TIKA-3817:
---

 Summary: Azure Graph conflict with Tika-app on jackson 
(JsonGenerator) version - changing from 2.13.3 to 2.4.1
 Key: TIKA-3817
 URL: https://issues.apache.org/jira/browse/TIKA-3817
 Project: Tika
  Issue Type: Bug
  Components: app
Affects Versions: 2.4.1
 Environment: Java 1.8 Maven Pom

    
       
            
                com.fasterxml.jackson.core
                jackson-annotations
                2.13.3
            
            
                com.fasterxml.jackson.core
                jackson-core
                2.13.3
            
            
                com.fasterxml.jackson.core
                jackson-databind
                2.13.3
            
            
                com.fasterxml.jackson.dataformat
                jackson-dataformat-xml
                2.13.3
            
            
                com.fasterxml.jackson.datatype
                jackson-datatype-jsr310
                2.13.3
            
        
    

 

        
        
            com.azure
            azure-core
            1.30.0
            
                
                    com.fasterxml.jackson.core
                    jackson-annotations
                
                
                    com.fasterxml.jackson.core
                    jackson-core
                
                
                    com.fasterxml.jackson.core
                    jackson-databind
                
                
                    com.fasterxml.jackson.dataformat
                    jackson-dataformat-xml
                
                
                    com.fasterxml.jackson.datatype
                    jackson-datatype-jsr310
                
            
        

 

        
        
            com.fasterxml.jackson.core
            jackson-annotations
            2.13.3
            provided
        
        
        
            com.fasterxml.jackson.core
            jackson-core
            2.13.3
            provided
        
        
        
            com.fasterxml.jackson.core
            jackson-databind
            2.13.3
            provided
        
        
        
            com.fasterxml.jackson.dataformat
            jackson-dataformat-xml
            2.13.3
            provided
        
        
        
            com.fasterxml.jackson.datatype
            jackson-datatype-jsr310
            2.13.3
            provided
        

 

        
            org.apache.tika
            tika-app
            2.4.1
            
                
                    com.fasterxml.jackson.core
                    jackson-annotations
                
                
                    com.fasterxml.jackson.core
                    jackson-core
                
                
                    com.fasterxml.jackson.core
                    jackson-databind
                
                
                    com.fasterxml.jackson.dataformat
                    jackson-dataformat-xml
                
                
                    com.fasterxml.jackson.datatype
                    jackson-datatype-jsr310
                
            
        
Reporter: Andre Nel


Azure Graph conflict on jackson.

Both Tika-app 2.4.1 and Azure-core 1.30.0 use jars with classes of the same 
name JsonGenerator (com.fasterxml.jackson.core.JsonGenerator) when fetting the 
jackson version of which jackson-core 2.13.3 is desired and set in the pom. 
This results in version conflict - where tika changes the jackson version from 
the desired 2.13.3 to tika's version 2.4.1, which doesn't even exist for 
jackson and would be far too old a version of jackson.

 

ERROR [main] 12:03:19,483 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-annotations' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-core' is not supported (older than earliest 
supported version - `2.10.0`), please upgrade.
ERROR [main] 12:03:19,489 com.azure.core.implementation.jackson.JacksonVersion 
Version '2.4.1' of package 'jackson-databind' is not supported (older than 
earliest supported version - `2.10.0`), please upgrade.
INFO  [main] 12:03:19,491 com.azure.core.implementation.jackson.JacksonVersion 
Package versions: jackson-annotations=2.4.1, jackson-core=2.4.1, 
jackson-databind=2.4.1, jackson-dataformat-xml=unknown, 
jackson-datatype-jsr310=2.13.3, azure-core=1.30.0, Troubleshooting version 
conflicts: https://aka.ms/azsdk/java/dependency/troubleshoot
ERROR [main] 12:03:19,494 
com.azure.core.implementation.jackson.ObjectMapperShim Package versions: 
jackson-annotations=2.4.1, jackson-core=2.4.1, jackson-databind=2.4.1, 
jackson-dataformat-xml=unknown, 

[jira] [Commented] (TIKA-3795) General upgrades for 2.4.2

2022-07-14 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1751#comment-1751
 ] 

Hudson commented on TIKA-3795:
--

SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #684 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/684/])
TIKA-3795: update aws (tilman: 
[https://github.com/apache/tika/commit/a42a6198344cae94a6bd806fb023511849a964d0])
* (edit) tika-parent/pom.xml


> General upgrades for 2.4.2
> --
>
> Key: TIKA-3795
> URL: https://issues.apache.org/jira/browse/TIKA-3795
> Project: Tika
>  Issue Type: Improvement
>  Components: build
>Reporter: Tilman Hausherr
>Priority: Minor
> Fix For: 2.4.2
>
> Attachments: image-2022-07-12-06-46-00-215.png
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)