[jira] [Commented] (TIKA-3343) Remove Tika custom lang detection for 2.x

2021-03-30 Thread Kenneth William Krugler (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311844#comment-17311844 ] Kenneth William Krugler commented on TIKA-3343: --- [~tallison] - I didn't find the specific

[jira] [Commented] (TIKA-3340) LanguageProfile for Myanmar

2021-03-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311842#comment-17311842 ] ASF GitHub Bot commented on TIKA-3340: -- kkrugler commented on pull request #421: URL:

[GitHub] [tika] kkrugler commented on pull request #421: [TIKA-3340] LanguageProfile for Myanmar

2021-03-30 Thread GitBox
kkrugler commented on pull request #421: URL: https://github.com/apache/tika/pull/421#issuecomment-810625585 Hi @arky - thanks for the PR! Would it be possible to add `my` to the list of languages being tested in `LanguageIdentifierTest`? You'd have to add a

[jira] [Commented] (TIKA-3340) LanguageProfile for Myanmar

2021-03-30 Thread Arky (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311815#comment-17311815 ] Arky commented on TIKA-3340: {color:#1d1c1d}Here is list of Asian languages + 1 East African language.

[jira] [Comment Edited] (TIKA-3340) LanguageProfile for Myanmar

2021-03-30 Thread Arky (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311815#comment-17311815 ] Arky edited comment on TIKA-3340 at 3/30/21, 9:09 PM: -- {color:#1d1c1d}Here is list of

[jira] [Commented] (TIKA-3340) LanguageProfile for Myanmar

2021-03-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311805#comment-17311805 ] Tim Allison commented on TIKA-3340: --- We have Thai, Vietnamese, Malay, Tagalog. In addition to Burmese,

[jira] [Commented] (TIKA-3340) LanguageProfile for Myanmar

2021-03-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311795#comment-17311795 ] Tim Allison commented on TIKA-3340: --- Ah, interesting. Thank you for the background. Are there other

[jira] [Commented] (TIKA-3343) Remove Tika custom lang detection for 2.x

2021-03-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311794#comment-17311794 ] Tim Allison commented on TIKA-3343: --- Are you finding better performance (accuracy/resource consumption)

[jira] [Commented] (TIKA-1993) Image Recognition with Tika

2021-03-30 Thread Arky (Jira)
[ https://issues.apache.org/jira/browse/TIKA-1993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311791#comment-17311791 ] Arky commented on TIKA-1993: We have a downstream use case for this. We'll like to help ICIJ Datashare users

[jira] [Commented] (TIKA-3343) Remove Tika custom lang detection for 2.x

2021-03-30 Thread Peter Kronenberg (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311785#comment-17311785 ] Peter Kronenberg commented on TIKA-3343: I'm using this functionality. I don't care if it's

[jira] [Comment Edited] (TIKA-3340) LanguageProfile for Myanmar

2021-03-30 Thread Arky (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311776#comment-17311776 ] Arky edited comment on TIKA-3340 at 3/30/21, 8:10 PM: -- {color:#1d1c1d}[~tallison]

[jira] [Commented] (TIKA-3340) LanguageProfile for Myanmar

2021-03-30 Thread Arky (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311776#comment-17311776 ] Arky commented on TIKA-3340: {color:#1d1c1d}[~tallison] This ngram was created using corpus of modern Burmese

[jira] [Created] (TIKA-3343) Remove Tika custom lang detection for 2.x

2021-03-30 Thread Tim Allison (Jira)
Tim Allison created TIKA-3343: - Summary: Remove Tika custom lang detection for 2.x Key: TIKA-3343 URL: https://issues.apache.org/jira/browse/TIKA-3343 Project: Tika Issue Type: Task

[jira] [Commented] (TIKA-3340) LanguageProfile for Myanmar

2021-03-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311738#comment-17311738 ] Tim Allison commented on TIKA-3340: --- [~arky], thank you for this pull request! Out of curiosity, which

[jira] [Commented] (TIKA-3340) LanguageProfile for Myanmar

2021-03-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311731#comment-17311731 ] ASF GitHub Bot commented on TIKA-3340: -- arky opened a new pull request #421: URL:

[GitHub] [tika] arky opened a new pull request #421: [TIKA-3340] LanguageProfile for Myanmar

2021-03-30 Thread GitBox
arky opened a new pull request #421: URL: https://github.com/apache/tika/pull/421 Adds Myanmar LanguageProfile for Apache Tika https://issues.apache.org/jira/browse/TIKA-3340 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

[jira] [Resolved] (TIKA-3342) Update security page for recently fixed cve

2021-03-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3342. --- Resolution: Fixed > Update security page for recently fixed cve >

[jira] [Created] (TIKA-3342) Update security page for recently fixed cve

2021-03-30 Thread Tim Allison (Jira)
Tim Allison created TIKA-3342: - Summary: Update security page for recently fixed cve Key: TIKA-3342 URL: https://issues.apache.org/jira/browse/TIKA-3342 Project: Tika Issue Type: Task

CVE-2021-28657: Infinite loop in Apache Tika's MP3 parser

2021-03-30 Thread Tim Allison
Description: A carefully crafted or corrupt file may trigger an infinite loop in Tika's MP3Parser up to and including Tika 1.25. Apache Tika users should upgrade to 1.26 or later. Mitigation: Users should upgrade to 1.26 or later. Credit: Apache Tika would like to thank Khaled Nassar for