[jira] [Commented] (TIKA-3795) General upgrades for 2.4.2
[ https://issues.apache.org/jira/browse/TIKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17564517#comment-17564517 ] Hudson commented on TIKA-3795: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #674 (See [https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/674/]) TIKA-3795: add exclusion for jetty-io (tilman: [https://github.com/apache/tika/commit/0f5570691133c75ac4472c3340354a6c4080b104]) * (edit) tika-parent/pom.xml > General upgrades for 2.4.2 > -- > > Key: TIKA-3795 > URL: https://issues.apache.org/jira/browse/TIKA-3795 > Project: Tika > Issue Type: Improvement > Components: build >Reporter: Tilman Hausherr >Priority: Minor > Fix For: 2.4.2 > > -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (TIKA-3795) General upgrades for 2.4.2
[ https://issues.apache.org/jira/browse/TIKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17564506#comment-17564506 ] Hudson commented on TIKA-3795: -- FAILURE: Integrated in Jenkins build Tika » tika-main-jdk8 #673 (See [https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/673/]) TIKA-3795: update aws, google cloud (tilman: [https://github.com/apache/tika/commit/ff7771301ecdb97524dcd0d1ad55c5ab563b6aa7]) * (edit) tika-parent/pom.xml > General upgrades for 2.4.2 > -- > > Key: TIKA-3795 > URL: https://issues.apache.org/jira/browse/TIKA-3795 > Project: Tika > Issue Type: Improvement > Components: build >Reporter: Tilman Hausherr >Priority: Minor > Fix For: 2.4.2 > > -- This message was sent by Atlassian Jira (v8.20.10#820010)
Re: remove jdk14 build from ci
Am 24.06.2022 um 11:23 schrieb Tim Allison: +1 on jdk14 We still want to make sure that branch_1x still works with at least jdk8 and jdk11? Yes we want hat all work; my thought is to remove the intermediate versions while keeping the extremes. Tilman On Thu, Jun 23, 2022 at 11:29 PM Tilman Hausherr wrote: I propose to remove the jdk14 build job. Reason: I think it's enough to keep the lowest (8) and the highest (17, I just changed it to 18) build, intermediate versions won't bring any surprises. Keeping jdk14 is a waste of energy. Same argument for tika-branch1x-jdk11 . Tilman
[jira] [Commented] (TIKA-1770) AutoDetectParser wrongly detects plain text as images/audio
[ https://issues.apache.org/jira/browse/TIKA-1770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17564468#comment-17564468 ] Emil Zegers commented on TIKA-1770: --- Wrong recognition of text files still happens with Tika 2.4.1. Took me a while to understand what was going on and then I found the bug report. Curious to understand what would be needed to fix this. Happy user of Tika but I don't know the code base yet. > AutoDetectParser wrongly detects plain text as images/audio > --- > > Key: TIKA-1770 > URL: https://issues.apache.org/jira/browse/TIKA-1770 > Project: Tika > Issue Type: Bug > Components: parser >Affects Versions: 1.10 > Environment: OS independent (tested on both Windows, MAC OS) >Reporter: Ziqi >Priority: Minor > Attachments: the-acl-rd-tec_chunk_10228.txt, > the-acl-rd-tec_chunk_15.txt, the-acl-rd-tec_chunk_9113.txt > > > AutoDetectParser fails to recognize certain plain-text files as plain text. > In the attachment are three testing files, as you can see they are all plain > text. > The following code is used for testing: > > AutoDetectParser parser = new AutoDetectParser(); > for (File f : new File("path").listFiles()) { > InputStream in = new BufferedInputStream(new > FileInputStream(f.toString())); > BodyContentHandler handler = new BodyContentHandler(-1); > Metadata metadata = new Metadata(); > try { > parser.parse(in, handler, metadata); > String content = handler.toString(); > System.out.println(metadata); //line A > }catch (Exception e){ > e.printStackTrace(); > } > } > > for the three testing files, line A prints the following: > X-Parsed-By=org.apache.tika.parser.EmptyParser > Content-Type=image/x-portable-bitmap > X-Parsed-By=org.apache.tika.parser.DefaultParser > X-Parsed-By=org.apache.tika.parser.mp3.Mp3Parser xmpDM:audioCompressor=MP3 > Content-Type=audio/mpeg > X-Parsed-By=org.apache.tika.parser.EmptyParser > Content-Type=image/x-portable-bitmap > And as a result, variable "content" is always empty. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (TIKA-3795) General upgrades for 2.4.2
[ https://issues.apache.org/jira/browse/TIKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17564176#comment-17564176 ] Hudson commented on TIKA-3795: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #672 (See [https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/672/]) TIKA-3795: add comment (tilman: [https://github.com/apache/tika/commit/ee8323ceb5f9b4173b0633be0e3aa2ed513de604]) * (edit) tika-parent/pom.xml > General upgrades for 2.4.2 > -- > > Key: TIKA-3795 > URL: https://issues.apache.org/jira/browse/TIKA-3795 > Project: Tika > Issue Type: Improvement > Components: build >Reporter: Tilman Hausherr >Priority: Minor > Fix For: 2.4.2 > > -- This message was sent by Atlassian Jira (v8.20.10#820010)
[GitHub] [tika] THausherr merged pull request #603: Bump azure-storage-blob from 12.17.1 to 12.18.0
THausherr merged PR #603: URL: https://github.com/apache/tika/pull/603 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@tika.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Commented] (TIKA-3795) General upgrades for 2.4.2
[ https://issues.apache.org/jira/browse/TIKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17564138#comment-17564138 ] Hudson commented on TIKA-3795: -- FAILURE: Integrated in Jenkins build Tika » tika-main-jdk8 #671 (See [https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/671/]) TIKA-3795: update comment (tilman: [https://github.com/apache/tika/commit/3307e2998613422b5489faaa36d38a86f1b74f56]) * (edit) tika-parent/pom.xml > General upgrades for 2.4.2 > -- > > Key: TIKA-3795 > URL: https://issues.apache.org/jira/browse/TIKA-3795 > Project: Tika > Issue Type: Improvement > Components: build >Reporter: Tilman Hausherr >Priority: Minor > Fix For: 2.4.2 > > -- This message was sent by Atlassian Jira (v8.20.10#820010)