[jira] [Commented] (TIKA-3795) General upgrades for 2.4.2

2022-07-08 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17564517#comment-17564517
 ] 

Hudson commented on TIKA-3795:
--

SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #674 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/674/])
TIKA-3795: add exclusion for jetty-io (tilman: 
[https://github.com/apache/tika/commit/0f5570691133c75ac4472c3340354a6c4080b104])
* (edit) tika-parent/pom.xml


> General upgrades for 2.4.2
> --
>
> Key: TIKA-3795
> URL: https://issues.apache.org/jira/browse/TIKA-3795
> Project: Tika
>  Issue Type: Improvement
>  Components: build
>Reporter: Tilman Hausherr
>Priority: Minor
> Fix For: 2.4.2
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3795) General upgrades for 2.4.2

2022-07-08 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17564506#comment-17564506
 ] 

Hudson commented on TIKA-3795:
--

FAILURE: Integrated in Jenkins build Tika » tika-main-jdk8 #673 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/673/])
TIKA-3795: update aws, google cloud (tilman: 
[https://github.com/apache/tika/commit/ff7771301ecdb97524dcd0d1ad55c5ab563b6aa7])
* (edit) tika-parent/pom.xml


> General upgrades for 2.4.2
> --
>
> Key: TIKA-3795
> URL: https://issues.apache.org/jira/browse/TIKA-3795
> Project: Tika
>  Issue Type: Improvement
>  Components: build
>Reporter: Tilman Hausherr
>Priority: Minor
> Fix For: 2.4.2
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


Re: remove jdk14 build from ci

2022-07-08 Thread Tilman Hausherr

Am 24.06.2022 um 11:23 schrieb Tim Allison:

+1 on jdk14

We still want to make sure that branch_1x still works with at least jdk8
and jdk11?


Yes we want hat all work; my thought is to remove the intermediate 
versions while keeping the extremes.


Tilman





On Thu, Jun 23, 2022 at 11:29 PM Tilman Hausherr 
wrote:


I propose to remove the jdk14 build job. Reason: I think it's enough to
keep the lowest (8) and the highest (17, I just changed it to 18) build,
intermediate versions won't bring any surprises. Keeping jdk14 is a
waste of energy.

Same argument for tika-branch1x-jdk11 .

Tilman






[jira] [Commented] (TIKA-1770) AutoDetectParser wrongly detects plain text as images/audio

2022-07-08 Thread Emil Zegers (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-1770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17564468#comment-17564468
 ] 

Emil Zegers commented on TIKA-1770:
---

Wrong recognition of text files still happens with Tika 2.4.1. Took me a while 
to understand what was going on and then I found the bug report. Curious to 
understand what would be needed to fix this. Happy user of Tika but I don't 
know the code base yet.

> AutoDetectParser wrongly detects plain text as images/audio
> ---
>
> Key: TIKA-1770
> URL: https://issues.apache.org/jira/browse/TIKA-1770
> Project: Tika
>  Issue Type: Bug
>  Components: parser
>Affects Versions: 1.10
> Environment: OS independent (tested on both Windows, MAC OS)
>Reporter: Ziqi
>Priority: Minor
> Attachments: the-acl-rd-tec_chunk_10228.txt, 
> the-acl-rd-tec_chunk_15.txt, the-acl-rd-tec_chunk_9113.txt
>
>
> AutoDetectParser fails to recognize certain plain-text files as plain text.
> In the attachment are three testing files, as you can see they are all plain 
> text.
> The following code is used for testing:
> 
> AutoDetectParser parser = new AutoDetectParser();
> for (File f : new File("path").listFiles()) {
> InputStream in = new BufferedInputStream(new 
> FileInputStream(f.toString()));
> BodyContentHandler handler = new BodyContentHandler(-1);
> Metadata metadata = new Metadata();
> try {
> parser.parse(in, handler, metadata);
> String content = handler.toString();
> System.out.println(metadata); //line A
> }catch (Exception e){
> e.printStackTrace();
> }
> }
> 
> for the three testing files, line A prints the following:
> X-Parsed-By=org.apache.tika.parser.EmptyParser 
> Content-Type=image/x-portable-bitmap 
> X-Parsed-By=org.apache.tika.parser.DefaultParser 
> X-Parsed-By=org.apache.tika.parser.mp3.Mp3Parser xmpDM:audioCompressor=MP3 
> Content-Type=audio/mpeg 
> X-Parsed-By=org.apache.tika.parser.EmptyParser 
> Content-Type=image/x-portable-bitmap 
> And as a result, variable "content" is always empty.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-3795) General upgrades for 2.4.2

2022-07-08 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17564176#comment-17564176
 ] 

Hudson commented on TIKA-3795:
--

SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #672 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/672/])
TIKA-3795: add comment (tilman: 
[https://github.com/apache/tika/commit/ee8323ceb5f9b4173b0633be0e3aa2ed513de604])
* (edit) tika-parent/pom.xml


> General upgrades for 2.4.2
> --
>
> Key: TIKA-3795
> URL: https://issues.apache.org/jira/browse/TIKA-3795
> Project: Tika
>  Issue Type: Improvement
>  Components: build
>Reporter: Tilman Hausherr
>Priority: Minor
> Fix For: 2.4.2
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[GitHub] [tika] THausherr merged pull request #603: Bump azure-storage-blob from 12.17.1 to 12.18.0

2022-07-08 Thread GitBox


THausherr merged PR #603:
URL: https://github.com/apache/tika/pull/603


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@tika.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Commented] (TIKA-3795) General upgrades for 2.4.2

2022-07-08 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17564138#comment-17564138
 ] 

Hudson commented on TIKA-3795:
--

FAILURE: Integrated in Jenkins build Tika » tika-main-jdk8 #671 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/671/])
TIKA-3795: update comment (tilman: 
[https://github.com/apache/tika/commit/3307e2998613422b5489faaa36d38a86f1b74f56])
* (edit) tika-parent/pom.xml


> General upgrades for 2.4.2
> --
>
> Key: TIKA-3795
> URL: https://issues.apache.org/jira/browse/TIKA-3795
> Project: Tika
>  Issue Type: Improvement
>  Components: build
>Reporter: Tilman Hausherr
>Priority: Minor
> Fix For: 2.4.2
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)