Re: [VOTE] Release Apache Tika 2.0.0-BETA Candidate #1

2021-05-20 Thread Oleg Tikhonov
Hi Tim, My +1. Ubuntu 20, basic stuff. Java 11. Best regards, Oleg > On 19 May 2021, at 18:29, Tim Allison wrote: > > All, > > A candidate for the Tika 2.0.0-BETA release is available at: > https://dist.apache.org/repos/dist/dev/tika/ > > The release candidate is a zip archive of the sources

[jira] [Commented] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO

2021-05-20 Thread Peter Kronenberg (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348945#comment-17348945 ] Peter Kronenberg commented on TIKA-3361: The code already explicitly checks for th

[jira] [Commented] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO

2021-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348848#comment-17348848 ] Tim Allison commented on TIKA-3361: --- Frankly, as long as we never get a divide by zero e

[jira] [Commented] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO

2021-05-20 Thread Peter Kronenberg (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348844#comment-17348844 ] Peter Kronenberg commented on TIKA-3361: No problem, I can take care of it.  What

[jira] [Comment Edited] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO

2021-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348793#comment-17348793 ] Tim Allison edited comment on TIKA-3361 at 5/20/21, 8:50 PM: -

[jira] [Commented] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO

2021-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348793#comment-17348793 ] Tim Allison commented on TIKA-3361: --- How about the PR as is, but change the terms to "fa

[jira] [Commented] (TIKA-3361) Improve intelligence of OCRStrategy=AUTO

2021-05-20 Thread Peter Kronenberg (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348689#comment-17348689 ] Peter Kronenberg commented on TIKA-3361: [~tallison] Still thinking? :) > Improv

[jira] [Commented] (TIKA-3270) Render non-text in PDFs for OCR

2021-05-20 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348688#comment-17348688 ] Hudson commented on TIKA-3270: -- UNSTABLE: Integrated in Jenkins build Tika » tika-main-jdk8 #

[jira] [Commented] (TIKA-3408) Apache Tika 1.26 Metadata for MP4 and MP3.

2021-05-20 Thread Danny McKinney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348679#comment-17348679 ] Danny McKinney commented on TIKA-3408: -- ExifTool Version Number : 12.25 File Name : B

[jira] [Reopened] (TIKA-3270) Render non-text in PDFs for OCR

2021-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reopened TIKA-3270: --- Have to rework the logic a bit. The rendering strategy default is "render with no text and then run OCR"

[jira] [Commented] (TIKA-3270) Render non-text in PDFs for OCR

2021-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348657#comment-17348657 ] Tim Allison commented on TIKA-3270: --- This is a breaking change. The default is now to r

[jira] [Resolved] (TIKA-3270) Render non-text in PDFs for OCR

2021-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3270. --- Fix Version/s: 2.0.0 Assignee: Tim Allison Resolution: Fixed This is now in 2.0.0. I

[jira] [Commented] (TIKA-3270) Render non-text in PDFs for OCR

2021-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348620#comment-17348620 ] Tim Allison commented on TIKA-3270: --- [~tilman] it really is that easy! :D > Render non-

[jira] [Updated] (TIKA-3270) Render non-text in PDFs for OCR

2021-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-3270: -- Attachment: tiger-no-text.png > Render non-text in PDFs for OCR > --- > >

[jira] [Updated] (TIKA-3270) Render non-text in PDFs for OCR

2021-05-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-3270: -- Attachment: test-no-text.png test.png > Render non-text in PDFs for OCR > --

[jira] [Commented] (TIKA-3410) Clean up logging in PipesServer

2021-05-20 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348597#comment-17348597 ] Hudson commented on TIKA-3410: -- UNSTABLE: Integrated in Jenkins build Tika » tika-main-jdk8 #

Re: Recording/Streaming Apache Tika Virtual Meetings to YouTube

2021-05-20 Thread Rich Bowen
On 2021/05/20 09:07:16, Bertrand Delacretaz wrote: .. > > I am not aware of any such service (or tool) that is provided by ASF > > to the projects to host the meetings... > > I think this is changing, our conferences team might soon be able to > provide conferencing services for projects to h

Re: Recording/Streaming Apache Tika Virtual Meetings to YouTube

2021-05-20 Thread Swapnil M Mane
Great, thank you Bertrand and Sally! Lewis, wishing best to the Tika community and you for the event! Best Regards, Swapnil M Mane, www.apache.org On Thu, May 20, 2021 at 3:11 PM Sally Khudairi wrote: > > +1; thank you, Bertrand. > > If memory serves me correctly, we recently hosted a Cassandra

[jira] [Commented] (TIKA-3408) Apache Tika 1.26 Metadata for MP4 and MP3.

2021-05-20 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17348193#comment-17348193 ] Nick Burch commented on TIKA-3408: -- I'm not sure what you mean by an epoch date here, and

Re: Recording/Streaming Apache Tika Virtual Meetings to YouTube

2021-05-20 Thread Sally Khudairi
+1; thank you, Bertrand. If memory serves me correctly, we recently hosted a Cassandra event using our Hopin account. The turnout was more than 4x than anticipated. Here's hoping you'll have a great Tika community event! Once you have the recordings, we'll be happy to help post to the ASF's You

Re: Recording/Streaming Apache Tika Virtual Meetings to YouTube

2021-05-20 Thread Bertrand Delacretaz
Hi, On Thu, May 20, 2021 at 10:51 AM Swapnil M Mane wrote: > On Wed, May 19, 2021 at 11:27 PM lewis john mcgibbney > wrote: > > ...The meeting was hosted on a paid version of WebEx. It would be great if > > we could move away from this for the next meeting. > > > > I am not aware of any such se

Re: Recording/Streaming Apache Tika Virtual Meetings to YouTube

2021-05-20 Thread Swapnil M Mane
Hi Lewis, Please find my comments inline. On Wed, May 19, 2021 at 11:27 PM lewis john mcgibbney wrote: > > Hi Swapnil, > Excellent., Thank you. Replies inline below > > On Wed, May 19, 2021 at 9:53 AM Swapnil M Mane > wrote: >> >> >> If it is a community meetup where the participant has active