[jira] [Commented] (TIKA-3374) Non-Unicode archive entry name is garbled

2021-04-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17334428#comment-17334428 ] ASF GitHub Bot commented on TIKA-3374: -- Ryan421 opened a new pull request #433: URL:

[GitHub] [tika] Ryan421 opened a new pull request #433: [TIKA-3374] Apply charset detection for archive entry name

2021-04-27 Thread GitBox
Ryan421 opened a new pull request #433: URL: https://github.com/apache/tika/pull/433 Fixes #TIKA-3374 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about

[jira] [Updated] (TIKA-3374) Non-Unicode archive entry name is garbled

2021-04-27 Thread Ryan Liu (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Liu updated TIKA-3374: --- Description: PackageParser retrieves archive entry name through commons-compress archiver's

[jira] [Created] (TIKA-3374) Non-Unicode archive entry name is garbled

2021-04-27 Thread Ryan Liu (Jira)
Ryan Liu created TIKA-3374: -- Summary: Non-Unicode archive entry name is garbled Key: TIKA-3374 URL: https://issues.apache.org/jira/browse/TIKA-3374 Project: Tika Issue Type: Bug

[GitHub] [tika-helm] philipsoutham opened a new pull request #2: Locking down the Tika environment

2021-04-27 Thread GitBox
philipsoutham opened a new pull request #2: URL: https://github.com/apache/tika-helm/pull/2 Dropping all kernel capabilities and not running as root user. This starts and seems to work loading the default page, but I would like to have a full test suite to make sure it doesn't break under

[jira] [Commented] (TIKA-3373) add "yml" as extension

2021-04-27 Thread Caleb Cushing (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1704#comment-1704 ] Caleb Cushing commented on TIKA-3373: - Before... Any suggestions on how to do that with gradle and a

[jira] [Commented] (TIKA-3373) add "yml" as extension

2021-04-27 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17333295#comment-17333295 ] Nick Burch commented on TIKA-3373: -- You can't override a built-in type. For now, just grab the updated

[jira] [Commented] (TIKA-3373) add "yml" as extension

2021-04-27 Thread Caleb Cushing (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17333270#comment-17333270 ] Caleb Cushing commented on TIKA-3373: - Is the right fix for me for now to copy the same code to a

[jira] [Commented] (TIKA-3373) add "yml" as extension

2021-04-27 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17333249#comment-17333249 ] Hudson commented on TIKA-3373: -- UNSTABLE: Integrated in Jenkins build Tika » tika-main-jdk8 #209 (See

[jira] [Commented] (TIKA-3373) add "yml" as extension

2021-04-27 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17333233#comment-17333233 ] Hudson commented on TIKA-3373: -- SUCCESS: Integrated in Jenkins build Tika » tika-branch1x-jdk8 #120 (See

[jira] [Commented] (TIKA-3373) add "yml" as extension

2021-04-27 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17333178#comment-17333178 ] Nick Burch commented on TIKA-3373: -- Thanks for that SO post, very helpful to see what people are commonly

[jira] [Updated] (TIKA-3373) add "yml" as extension

2021-04-27 Thread Caleb Cushing (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Caleb Cushing updated TIKA-3373: Description: seems that the tika definition for yaml only includes `yaml` as a valid file

[jira] [Commented] (TIKA-3372) Fix writelimit in recursiveparserhandler

2021-04-27 Thread Julien Massiera (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17333168#comment-17333168 ] Julien Massiera commented on TIKA-3372: --- Concerning the behavior you describe [~tallison], what does

[jira] [Created] (TIKA-3373) add "yml" as extension

2021-04-27 Thread Caleb Cushing (Jira)
Caleb Cushing created TIKA-3373: --- Summary: add "yml" as extension Key: TIKA-3373 URL: https://issues.apache.org/jira/browse/TIKA-3373 Project: Tika Issue Type: New Feature

[jira] [Commented] (TIKA-3348) Improve the workflow for extracting and returning images from PDFs and other containers using Tika Server..

2021-04-27 Thread Simon Lucy (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17333139#comment-17333139 ] Simon Lucy commented on TIKA-3348: --- The metadata is extracted and it is consistent with returning

[jira] [Comment Edited] (TIKA-3348) Improve the workflow for extracting and returning images from PDFs and other containers using Tika Server..

2021-04-27 Thread Simon Lucy (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17333127#comment-17333127 ] Simon Lucy edited comment on TIKA-3348 at 4/27/21, 11:03 AM: -- It's a pretty

[jira] [Commented] (TIKA-3348) Improve the workflow for extracting and returning images from PDFs and other containers using Tika Server..

2021-04-27 Thread Simon Lucy (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17333127#comment-17333127 ] Simon Lucy commented on TIKA-3348: --- It's a pretty straightforward use case, extracting images from

[jira] [Comment Edited] (TIKA-3372) Fix writelimit in recursiveparserhandler

2021-04-27 Thread Julien Massiera (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17333027#comment-17333027 ] Julien Massiera edited comment on TIKA-3372 at 4/27/21, 8:09 AM: -

[jira] [Commented] (TIKA-3372) Fix writelimit in recursiveparserhandler

2021-04-27 Thread Julien Massiera (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17333027#comment-17333027 ] Julien Massiera commented on TIKA-3372: --- [~tallison] here is my use case :  I send a simple txt