[jira] [Commented] (TIKA-3311) Add github workflows to Tika

2021-03-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296430#comment-17296430 ] Lewis John McGibbney commented on TIKA-3311: bq. Is it because PRs are not run through

[jira] [Comment Edited] (TIKA-3311) Add github workflows to Tika

2021-03-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296315#comment-17296315 ] Tim Allison edited comment on TIKA-3311 at 3/5/21, 8:16 PM: I'm not against

[jira] [Commented] (TIKA-3311) Add github workflows to Tika

2021-03-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296315#comment-17296315 ] Tim Allison commented on TIKA-3311: --- I'm not against this...the more CI, the better.  How is this

[jira] [Commented] (TIKA-94) Speech-to-text transcription

2021-03-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296301#comment-17296301 ] ASF GitHub Bot commented on TIKA-94: lewismc edited a comment on pull request #406: URL:

[jira] [Commented] (TIKA-94) Speech-to-text transcription

2021-03-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296300#comment-17296300 ] ASF GitHub Bot commented on TIKA-94: lewismc commented on pull request #406: URL:

[GitHub] [tika] lewismc edited a comment on pull request #406: [TIKA-94] Speech-to-text transcription

2021-03-05 Thread GitBox
lewismc edited a comment on pull request #406: URL: https://github.com/apache/tika/pull/406#issuecomment-791632511 I feel that this patch is ready for testing by the community so I removed the `WIP` This is an automated

[GitHub] [tika] lewismc commented on pull request #406: [TIKA-94] Speech-to-text transcription

2021-03-05 Thread GitBox
lewismc commented on pull request #406: URL: https://github.com/apache/tika/pull/406#issuecomment-791632511 I feel that this patch is ready for testing by the community. This is an automated message from the Apache Git

[jira] [Commented] (TIKA-3311) Add github workflows to Tika

2021-03-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296298#comment-17296298 ] ASF GitHub Bot commented on TIKA-3311: -- lewismc commented on pull request #407: URL:

[GitHub] [tika] lewismc commented on pull request #407: [TIKA-3311] Add github workflows to Tika

2021-03-05 Thread GitBox
lewismc commented on pull request #407: URL: https://github.com/apache/tika/pull/407#issuecomment-791631615 Any comments folks? This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [tika] peterkronenberg commented on pull request #411: TIKA-3313 Improve performance and usability of RereadableInputStream

2021-03-05 Thread GitBox
peterkronenberg commented on pull request #411: URL: https://github.com/apache/tika/pull/411#issuecomment-791617757 Just realized that I accidentally deleted droste.zip in the first commit. Because this file gets flagged by my anti-virus program, it would be very difficult for me to

[jira] [Commented] (TIKA-3313) Improve performance and usability of RereadableInputStream

2021-03-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296285#comment-17296285 ] ASF GitHub Bot commented on TIKA-3313: -- peterkronenberg commented on pull request #411: URL:

Re: 1.26?

2021-03-05 Thread Tim Allison
All, James Ahlborn modified jackcess-crypt for us and made a new release. I've made the upgrade in our branch_1x, and I ran a comparison of the msaccess files we have in our corpus: https://corpora.tika.apache.org/base/reports/tika_1_25_v_1_26_msaccess_reports.tgz No surprises if we upgrade to

Pull Request #411 for RereadableInputStream

2021-03-05 Thread Peter Kronenberg
Created a pull request yesterday for some changes in RereadableInputStream. Can someone review? Peter Kronenberg | Senior AI Analytic ENGINEER C: 703.887.5623 [Torch AI] 4303 W. 119th St., Leawood, KS 66209

[jira] [Commented] (TIKA-3310) MP4 video detected as application/mp4

2021-03-05 Thread Peter Kronenberg (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296246#comment-17296246 ] Peter Kronenberg commented on TIKA-3310: Ok, I've gone ahead and separated them. It searches for

[jira] [Commented] (TIKA-3310) MP4 video detected as application/mp4

2021-03-05 Thread Peter Kronenberg (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296122#comment-17296122 ] Peter Kronenberg commented on TIKA-3310: oh yeah, you're right > MP4 video detected as

[jira] [Commented] (TIKA-3310) MP4 video detected as application/mp4

2021-03-05 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296103#comment-17296103 ] Nick Burch commented on TIKA-3310: -- I think we need to do the loop twice though, once checking major, and

[jira] [Commented] (TIKA-3310) MP4 video detected as application/mp4

2021-03-05 Thread Peter Kronenberg (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296085#comment-17296085 ] Peter Kronenberg commented on TIKA-3310: done > MP4 video detected as application/mp4 >

[jira] [Commented] (TIKA-3310) MP4 video detected as application/mp4

2021-03-05 Thread Peter Kronenberg (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296080#comment-17296080 ] Peter Kronenberg commented on TIKA-3310: Ah, now I understand what you're saying. Ok, I will

[jira] [Commented] (TIKA-3310) MP4 video detected as application/mp4

2021-03-05 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296076#comment-17296076 ] Nick Burch commented on TIKA-3310: -- My worry is, though I don't know if it could happen, is eg major=3g2c

[jira] [Commented] (TIKA-3310) MP4 video detected as application/mp4

2021-03-05 Thread Peter Kronenberg (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296065#comment-17296065 ] Peter Kronenberg commented on TIKA-3310: [~nick] What are your current thoughts? Do you still

FW: OSS-Fuzz integration

2021-03-05 Thread Nick Burch
Hi All For those who don't follow dev@commons, there's yet another fulling tool on the block! Details below. Looks pretty neat, and is now being used on a few Apache Commons projects, including Commons Compress which we use What do people think about more fuzzing? Worth doing? Or just too