[jira] [Commented] (TIKA-3395) Make Inner Classes Static If Possible to Prevent Memory Leaks

2021-05-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3395?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342903#comment-17342903 ] ASF GitHub Bot commented on TIKA-3395: -- kamaci opened a new pull request #438: URL: h

[GitHub] [tika] kamaci opened a new pull request #438: fix for TIKA-3395 contributed by kamaci

2021-05-11 Thread GitBox
kamaci opened a new pull request #438: URL: https://github.com/apache/tika/pull/438 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please

[jira] [Updated] (TIKA-3395) Make Inner Classes Static If Possible to Prevent Memory Leaks

2021-05-11 Thread Furkan Kamaci (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Furkan Kamaci updated TIKA-3395: Description: A static inner class does not keep an implicit reference to its enclosing instance. Th

[jira] [Created] (TIKA-3395) Make Inner Classes Static If Possible to Prevent Memory Leaks

2021-05-11 Thread Furkan Kamaci (Jira)
Furkan Kamaci created TIKA-3395: --- Summary: Make Inner Classes Static If Possible to Prevent Memory Leaks Key: TIKA-3395 URL: https://issues.apache.org/jira/browse/TIKA-3395 Project: Tika Issue

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342898#comment-17342898 ] Nick Burch commented on TIKA-3392: -- Not sure how easy / possible / user friendly this wou

[jira] [Created] (TIKA-3394) Integrate async into tika-app in 2.x

2021-05-11 Thread Tim Allison (Jira)
Tim Allison created TIKA-3394: - Summary: Integrate async into tika-app in 2.x Key: TIKA-3394 URL: https://issues.apache.org/jira/browse/TIKA-3394 Project: Tika Issue Type: Task Componen

[jira] [Commented] (TIKA-3391) Refactor fetchiterators to pipesinterators in 2.x, clean up pipesiteratormanager

2021-05-11 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342864#comment-17342864 ] Hudson commented on TIKA-3391: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #2

[jira] [Commented] (TIKA-3393) Refactor metadata filters to use new ConfigBase in 2.x

2021-05-11 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342801#comment-17342801 ] Hudson commented on TIKA-3393: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #2

[jira] [Resolved] (TIKA-3393) Refactor metadata filters to use new ConfigBase in 2.x

2021-05-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3393. --- Assignee: Tim Allison Resolution: Fixed > Refactor metadata filters to use new ConfigBase in 2.x

Re: 2.0.0-BETA?

2021-05-11 Thread Oleg Tikhonov
Hi Tim, Thanks for the effort! +1. BR, Oleg On Tue, May 11, 2021, 16:51 Tim Allison wrote: > All, > What would you say to a beta release towards the end of this > week/beginning of next? > > Cheers, > > Tim >

[jira] [Commented] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO

2021-05-11 Thread Peter Kronenberg (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342725#comment-17342725 ] Peter Kronenberg commented on TIKA-3361: Yes, I agree it would be great to have ad

[jira] [Created] (TIKA-3393) Refactor metadata filters to use new ConfigBase in 2.x

2021-05-11 Thread Tim Allison (Jira)
Tim Allison created TIKA-3393: - Summary: Refactor metadata filters to use new ConfigBase in 2.x Key: TIKA-3393 URL: https://issues.apache.org/jira/browse/TIKA-3393 Project: Tika Issue Type: Task

[jira] [Resolved] (TIKA-3351) Make list of parsers in metadata unique

2021-05-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3351. --- Fix Version/s: 1.27 Resolution: Fixed > Make list of parsers in metadata unique > -

[jira] [Commented] (TIKA-3391) Refactor fetchiterators to pipesinterators in 2.x, clean up pipesiteratormanager

2021-05-11 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342679#comment-17342679 ] Hudson commented on TIKA-3391: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #2

[jira] [Comment Edited] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342661#comment-17342661 ] Tim Allison edited comment on TIKA-3392 at 5/11/21, 3:43 PM: -

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342664#comment-17342664 ] Andrei Dobrescu commented on TIKA-3392: --- > Every xml parser we use _should_ grab a s

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342661#comment-17342661 ] Tim Allison commented on TIKA-3392: --- >Seems like there are 5 occurences of secure proces

[jira] [Comment Edited] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342642#comment-17342642 ] Andrei Dobrescu edited comment on TIKA-3392 at 5/11/21, 3:16 PM: ---

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342642#comment-17342642 ] Andrei Dobrescu commented on TIKA-3392: --- "Frankly, I think the intent of that portio

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342639#comment-17342639 ] Andrei Dobrescu commented on TIKA-3392: --- Seems like there are 5 occurences of secure

[jira] [Updated] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Dobrescu updated TIKA-3392: -- Attachment: image-2021-05-11-18-12-15-300.png > Apache Tika V1.26 doen't work on Android anymore

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342637#comment-17342637 ] Tim Allison commented on TIKA-3392: --- Uh, secure processing is needed and is important fo

[jira] [Updated] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Dobrescu updated TIKA-3392: -- Attachment: image-2021-05-11-18-10-40-949.png > Apache Tika V1.26 doen't work on Android anymore

[jira] [Comment Edited] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342626#comment-17342626 ] Andrei Dobrescu edited comment on TIKA-3392 at 5/11/21, 3:08 PM: ---

[jira] [Comment Edited] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342626#comment-17342626 ] Andrei Dobrescu edited comment on TIKA-3392 at 5/11/21, 3:07 PM: ---

[jira] [Comment Edited] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342626#comment-17342626 ] Andrei Dobrescu edited comment on TIKA-3392 at 5/11/21, 3:05 PM: ---

[jira] [Comment Edited] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342626#comment-17342626 ] Andrei Dobrescu edited comment on TIKA-3392 at 5/11/21, 3:04 PM: ---

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342626#comment-17342626 ] Andrei Dobrescu commented on TIKA-3392: --- I did a bit of research before posting this

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342625#comment-17342625 ] Tim Allison commented on TIKA-3392: --- Is that the only xml parser available on Android?

[jira] [Updated] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Dobrescu updated TIKA-3392: -- Attachment: image-2021-05-11-17-53-58-291.png > Apache Tika V1.26 doen't work on Android anymore

[jira] [Commented] (TIKA-3390) Migrate Language Level to Java 8

2021-05-11 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342616#comment-17342616 ] Hudson commented on TIKA-3390: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #2

[jira] [Updated] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Dobrescu updated TIKA-3392: -- Description: I use Apache Tika on Android in order to detect mime type of varios files: Apache

[jira] [Updated] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrei Dobrescu updated TIKA-3392: -- Description: I use Apache Tika on Android in order to detect mime type of varios files: Apache

[jira] [Created] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)
Andrei Dobrescu created TIKA-3392: - Summary: Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies. Key: TIKA-3392 URL: https://issues.apache.org/jira/browse/TIKA-3392 Proj

[jira] [Resolved] (TIKA-3391) Refactor fetchiterators to pipesinterators in 2.x, clean up pipesiteratormanager

2021-05-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3391. --- Fix Version/s: 2.0.0 Assignee: Tim Allison Resolution: Fixed > Refactor fetchiterators

[jira] [Resolved] (TIKA-3370) Refactor the AsyncProcessor in 2.x

2021-05-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3370. --- Resolution: Fixed > Refactor the AsyncProcessor in 2.x > -- > >

Re: high level parser module names in 2.x

2021-05-11 Thread Eric Pugh
Sounds good to me. On Tue, May 11, 2021 at 9:33 AM Tim Allison wrote: > If there aren't objections, I'll make this change today or tomorrow. > > Cheers, > >Tim > > On Tue, Apr 20, 2021 at 10:57 AM Tim Allison wrote: > > > > How about: > > > > standard > > extended > > ml (for machin

2.0.0-BETA?

2021-05-11 Thread Tim Allison
All, What would you say to a beta release towards the end of this week/beginning of next? Cheers, Tim

[jira] [Created] (TIKA-3391) Refactor fetchiterators to pipesinterators in 2.x, clean up pipesiteratormanager

2021-05-11 Thread Tim Allison (Jira)
Tim Allison created TIKA-3391: - Summary: Refactor fetchiterators to pipesinterators in 2.x, clean up pipesiteratormanager Key: TIKA-3391 URL: https://issues.apache.org/jira/browse/TIKA-3391 Project: Tika

[jira] [Commented] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO

2021-05-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342572#comment-17342572 ] Tim Allison commented on TIKA-3361: --- What I'm wrestling with is that I'm hoping to add m

Re: high level parser module names in 2.x

2021-05-11 Thread Tim Allison
If there aren't objections, I'll make this change today or tomorrow. Cheers, Tim On Tue, Apr 20, 2021 at 10:57 AM Tim Allison wrote: > > How about: > > standard > extended > ml (for machine learning) > > On Wed, Mar 10, 2021 at 10:37 AM Nick Burch wrote: > > > > On Tue, 9 Mar 2021,

[jira] [Commented] (TIKA-3390) Migrate Language Level to Java 8

2021-05-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342567#comment-17342567 ] ASF GitHub Bot commented on TIKA-3390: -- tballison merged pull request #437: URL: http

[GitHub] [tika] tballison merged pull request #437: fix for TIKA-3390 contributed by kamaci

2021-05-11 Thread GitBox
tballison merged pull request #437: URL: https://github.com/apache/tika/pull/437 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please co

[jira] [Commented] (TIKA-3388) Ole10Native attachments with non-ASCII filenames extracted with garbled names

2021-05-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342551#comment-17342551 ] Tim Allison commented on TIKA-3388: --- Thank you for sharing this with us and generating a