[jira] [Commented] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415782#comment-17415782 ] Nick Burch commented on TIKA-3555: -- Doesn't that make us look more dodgy, and more likely

[jira] [Commented] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415770#comment-17415770 ] Tim Allison commented on TIKA-3555: --- On third thought, maybe we should obfuscate those f

[jira] [Commented] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415753#comment-17415753 ] Tim Allison commented on TIKA-3555: --- May want to share this file with ESET: https://git

[jira] [Comment Edited] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415753#comment-17415753 ] Tim Allison edited comment on TIKA-3555 at 9/15/21, 8:52 PM: -

[jira] [Commented] (TIKA-3550) Some DXF files are detected as text/plain

2021-09-15 Thread Robin Schimpf (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415668#comment-17415668 ] Robin Schimpf commented on TIKA-3550: - Thank you for fixing the problem so quick! > S

[jira] [Comment Edited] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415641#comment-17415641 ] Tim Allison edited comment on TIKA-3555 at 9/15/21, 5:04 PM: -

[jira] [Commented] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415641#comment-17415641 ] Tim Allison commented on TIKA-3555: --- I'm happy to document that file and we actually hav

[jira] [Commented] (TIKA-3554) Detect plain text file as application/zip based on file ext wrong

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415640#comment-17415640 ] Tim Allison commented on TIKA-3554: --- I'm not able to reproduce this in Tika 2.x. Have y

[jira] [Commented] (TIKA-3554) Detect plain text file as application/zip based on file ext wrong

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415638#comment-17415638 ] Tim Allison commented on TIKA-3554: --- Are you only using tika-core? Or are you bringing

[jira] [Comment Edited] (TIKA-3556) DefaultZipContainerDetector returns application/zip for .odt files when OPCPackageDetector is on the classpath

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415626#comment-17415626 ] Tim Allison edited comment on TIKA-3556 at 9/15/21, 4:35 PM: -

[jira] [Commented] (TIKA-3556) DefaultZipContainerDetector returns application/zip for .odt files when OPCPackageDetector is on the classpath

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415626#comment-17415626 ] Tim Allison commented on TIKA-3556: --- In addition to that (which is easily fixable), thin

[jira] [Commented] (TIKA-3556) DefaultZipContainerDetector returns application/zip for .odt files when OPCPackageDetector is on the classpath

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415566#comment-17415566 ] Tim Allison commented on TIKA-3556: --- Y, I agree on the above. One unfortunate bit is th

[jira] [Commented] (TIKA-3556) DefaultZipContainerDetector returns application/zip for .odt files when OPCPackageDetector is on the classpath

2021-09-15 Thread Simon Gaeremynck (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415549#comment-17415549 ] Simon Gaeremynck commented on TIKA-3556: {quote}To confirm, your detectors are in

[jira] [Commented] (TIKA-3556) DefaultZipContainerDetector returns application/zip for .odt files when OPCPackageDetector is on the classpath

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415541#comment-17415541 ] Tim Allison commented on TIKA-3556: --- Other point I notice is that this only affects file

[jira] [Commented] (TIKA-3556) DefaultZipContainerDetector returns application/zip for .odt files when OPCPackageDetector is on the classpath

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415533#comment-17415533 ] Tim Allison commented on TIKA-3556: --- And then I see a TODO: {{//TODO: OPCBased needs to

[jira] [Commented] (TIKA-3556) DefaultZipContainerDetector returns application/zip for .odt files when OPCPackageDetector is on the classpath

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415531#comment-17415531 ] Tim Allison commented on TIKA-3556: --- Wait, no, I was testing a bad odt file. I agree

[jira] [Commented] (TIKA-3556) DefaultZipContainerDetector returns application/zip for .odt files when OPCPackageDetector is on the classpath

2021-09-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415521#comment-17415521 ] Tim Allison commented on TIKA-3556: --- Able to reproduce this. I'm surprised our unit test

[jira] [Commented] (TIKA-3554) Detect plain text file as application/zip based on file ext wrong

2021-09-15 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415477#comment-17415477 ] Krisztián Gyula Tóth commented on TIKA-3554: "Where the rough type is known, A

[jira] [Commented] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415473#comment-17415473 ] Krisztián Gyula Tóth commented on TIKA-3555: [~nick] Thanks for the quick repl

[jira] [Commented] (TIKA-3554) Detect plain text file as application/zip based on file ext wrong

2021-09-15 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415460#comment-17415460 ] Nick Burch commented on TIKA-3554: -- If you want Apache Tika to do detection only on the f

[jira] [Commented] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17415439#comment-17415439 ] Nick Burch commented on TIKA-3555: -- See TIKA-259 This file will make an underpowered com

[jira] [Created] (TIKA-3556) DefaultZipContainerDetector returns application/zip for .odt files when OPCPackageDetector is on the classpath

2021-09-15 Thread Simon Gaeremynck (Jira)
Simon Gaeremynck created TIKA-3556: -- Summary: DefaultZipContainerDetector returns application/zip for .odt files when OPCPackageDetector is on the classpath Key: TIKA-3556 URL: https://issues.apache.org/jira/brow

[jira] [Updated] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztián Gyula Tóth updated TIKA-3555: --- Attachment: eset_tika_alert.png > Eset antivirus found threat in the GitHub repo after

[jira] [Updated] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztián Gyula Tóth updated TIKA-3555: --- Description: I've just cloned this GitHub repo  [https://github.com/apache/tika]  when

[jira] [Updated] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztián Gyula Tóth updated TIKA-3555: --- Attachment: (was: image (10).png) > Eset antivirus found threat in the GitHub repo

[jira] [Created] (TIKA-3555) Eset antivirus found threat in the GitHub repo after Git clone

2021-09-15 Thread Jira
Krisztián Gyula Tóth created TIKA-3555: -- Summary: Eset antivirus found threat in the GitHub repo after Git clone Key: TIKA-3555 URL: https://issues.apache.org/jira/browse/TIKA-3555 Project: Tika

[jira] [Updated] (TIKA-3554) Detect plain text file as application/zip based on file ext wrong

2021-09-15 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztián Gyula Tóth updated TIKA-3554: --- Description: *Given* a simple plain text file with the file extension `.zip` and with

[jira] [Updated] (TIKA-3554) Detect plain text file as application/zip based on file ext wrong

2021-09-15 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztián Gyula Tóth updated TIKA-3554: --- Description: *Given* a simple plain text file with the file extension `.zip` and with

[jira] [Updated] (TIKA-3554) Detect plain text file as application/zip based on file ext wrong

2021-09-15 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztián Gyula Tóth updated TIKA-3554: --- Labels: mime-type (was: ) > Detect plain text file as application/zip based on file e

[jira] [Updated] (TIKA-3554) Detect plain text file as application/zip based on file ext wrong

2021-09-15 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztián Gyula Tóth updated TIKA-3554: --- Description: *Given* a simple plain text file with the file extension `.zip` and with

[jira] [Updated] (TIKA-3554) Detect plain text file as application/zip based on file ext wrong

2021-09-15 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztián Gyula Tóth updated TIKA-3554: --- Description: *Given* a simple plain text file with the file extension `.zip` and with

[jira] [Updated] (TIKA-3554) Detect plain text file as application/zip based on file ext wrong

2021-09-15 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztián Gyula Tóth updated TIKA-3554: --- Summary: Detect plain text file as application/zip based on file ext wrong (was: Dete

[jira] [Created] (TIKA-3554) Detect plain text file as application/zip based on file ext false

2021-09-15 Thread Jira
Krisztián Gyula Tóth created TIKA-3554: -- Summary: Detect plain text file as application/zip based on file ext false Key: TIKA-3554 URL: https://issues.apache.org/jira/browse/TIKA-3554 Project: Ti