[
https://issues.apache.org/jira/browse/TIKA-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17047850#comment-17047850
]
Hudson commented on TIKA-3057:
------------------------------
SUCCESS: Integrated in Jenkins build Tika-trunk #1780 (See
[https://builds.apache.org/job/Tika-trunk/1780/])
TIKA-3057 -- improve detection of some zip based files (tallison:
[https://github.com/apache/tika/commit/c89fc0c95937b71e9c1a1b5905f34e0dc1cb650f])
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/iwork/iwana/IWork13PackageParser.java
* (edit) tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml
* (add)
tika-parsers/src/main/java/org/apache/tika/parser/iwork/iwana/IWork18PackageParser.java
* (add) tika-parsers/src/test/resources/test-documents/testKeynote2018.key
* (add)
tika-parsers/src/test/resources/test-documents/testStarOffice-6.0-draw.sxi
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/pkg/PackageParser.java
* (edit) CHANGES.txt
* (add)
tika-parsers/src/test/resources/test-documents/testStarOffice-6.0-writer.sxw
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/pkg/StreamingZipContainerDetector.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/pkg/ZipContainerDetector.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/pkg/ZipContainerDetectorBase.java
* (add)
tika-parsers/src/test/resources/test-documents/testOpenOffice-autotext.bau
* (add)
tika-parsers/src/test/resources/test-documents/testStarOffice-6.0-draw.sxd
* (add)
tika-parsers/src/test/resources/test-documents/testOpenOffice-extension.oxt
* (add)
tika-parsers/src/test/resources/test-documents/testStarOffice-6.0-calc.sxc
* (edit)
tika-parsers/src/test/java/org/apache/tika/detect/TestContainerAwareDetector.java
* (add)
tika-parsers/src/test/resources/test-documents/testStarOffice-6.0-writer-template.stw
> Improve detection of zip-based formats
> --------------------------------------
>
> Key: TIKA-3057
> URL: https://issues.apache.org/jira/browse/TIKA-3057
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Assignee: Tim Allison
> Priority: Major
> Fix For: 1.24
>
>
> In crawling open office and libre office's bug trackers, I found a bunch of
> staroffice/libreoffice zip-based formats that we aren't currently detecting.
> I also found that Apple changed its format for .pages .numbers and .keynote
> in 2018! YAY!!!!!
--
This message was sent by Atlassian Jira
(v8.3.4#803005)