This is an automated email from the ASF dual-hosted git repository.
tallison pushed a change to branch branch_1x
in repository https://gitbox.apache.org/repos/asf/tika.git.
from af7adbe TIKA-2888 Add wmv2 codec detection for WMV files (#272)
new 38e393e TIKA-3104 -- add bplist subtype detector
new 201adcc Merge remote-tracking branch 'origin/branch_1x' into branch_1x
new 9e53cec TIKA-3106 Magic header detection for emails starting with an
ARC- signature header, and for the 2+ header cases also allow Received as a
matching header
new 2526321 trivial bug in FuzzingCLI in calculating total max time for
child process
new 252fdac TIKA-3112 -- fix npe triggered by null supported types in
TikaGUI
new 2b10d9c TIKA-3111 -- upgrade to PDFBox 2.0.20 -- need to understand
testUnmappedUnicodeStats()
new aad237c TIKA-3110 -- revert throwing IOException if a parser tries to
skip past the end of an inputstream...make TikaInputStream live up to its Java
contract at the risk of infinite loops in dependencies that weren't written
with security in mind.
new c4702b8 TIKA-3115 -- add detection for parquet files
new 9d64a90 Update PDFParser.properties (#322)
new 6d066cb add heif mimetype support (#278)
new 2320d0b tika-mimetypes: Add mimetypes for .mpd, .m3u8 and .m4s (#320)
new b7246b4 Disable external DTD + Stylesheets with the
TransformerFactory (#276)
new f4c8e8f fix merge conflicts
The 13 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails. The revisions
listed as "add" were already present in the repository and have only
been added to this reference.
Summary of changes:
.../apache/tika/config/TikaConfigSerializer.java | 4 +-
.../java/org/apache/tika/io/TikaInputStream.java | 3 -
.../java/org/apache/tika/mime/MimeTypesReader.java | 8 +-
.../java/org/apache/tika/utils/XMLReaderUtils.java | 14 +++
.../org/apache/tika/mime/tika-mimetypes.xml | 71 ++++++++++++-
.../org/apache/tika/fuzzing/cli/FuzzingCLI.java | 2 +-
tika-parsers/pom.xml | 2 +-
.../apache/tika/parser/apple/BPListDetector.java | 114 +++++++++++++++++++++
.../apple/{PListParser.java => BPListParser.java} | 33 ++++--
.../image/{WebPParser.java => HeifParser.java} | 36 ++++---
.../tika/parser/image/ImageMetadataExtractor.java | 14 +++
.../apache/tika/parser/pdf/AbstractPDF2XHTML.java | 4 +
.../services/org.apache.tika.detect.Detector | 1 +
.../services/org.apache.tika.parser.Parser | 3 +-
.../apache/tika/parser/pdf/PDFParser.properties | 2 +-
.../tika/detect/TestContainerAwareDetector.java | 7 ++
...{PListParserTest.java => BPListParserTest.java} | 18 +++-
.../{WebPParserTest.java => HeifParserTest.java} | 43 ++++----
.../org/apache/tika/parser/pdf/PDFParserTest.java | 15 ++-
.../test/resources/test-documents/IMG_1034.heic | Bin 0 -> 1499892 bytes
.../resources/test-documents/testMemgraph.memgraph | Bin 0 -> 646412 bytes
21 files changed, 326 insertions(+), 68 deletions(-)
create mode 100644
tika-parsers/src/main/java/org/apache/tika/parser/apple/BPListDetector.java
rename
tika-parsers/src/main/java/org/apache/tika/parser/apple/{PListParser.java =>
BPListParser.java} (84%)
copy tika-parsers/src/main/java/org/apache/tika/parser/image/{WebPParser.java
=> HeifParser.java} (73%)
rename
tika-parsers/src/test/java/org/apache/tika/parser/apple/{PListParserTest.java
=> BPListParserTest.java} (68%)
copy
tika-parsers/src/test/java/org/apache/tika/parser/image/{WebPParserTest.java =>
HeifParserTest.java} (53%)
create mode 100644 tika-parsers/src/test/resources/test-documents/IMG_1034.heic
create mode 100644
tika-parsers/src/test/resources/test-documents/testMemgraph.memgraph