[
https://issues.apache.org/jira/browse/TIKA-4365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Subbu updated TIKA-4365:
------------------------
Description:
AAB file goes through DefaultZipContainerDetector and gets detected as
_application/java-archive_ since it has MANIFEST.MF via JarDetector.
They have their own content type as mentioned in tika-mimetypes.xml -
application/x-authorware-bin
[https://github.com/apache/tika/blob/6a12d7cf1a2ee37dce899933a05520604330c947/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml#L3485]
The AAB file structure has a AndroidManifest.xml similar to apk archive but in
[base/manifest/|http://base/manifest/] directory.
Android Dev documentation reference :
[https://developer.android.com/guide/app-bundle/app-bundle-format]
was:
AAB file goes through DefaultZipContainerDetector and gets detected as
_application/java-archive_ since it has MANIFEST.MF via JarDetector.
They have their own content type as mentioned in tika-mimetypes.xml -
application/x-authorware-bin
[https://github.com/apache/tika/blob/6a12d7cf1a2ee37dce899933a05520604330c947/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml#L3485]
The AAB file structure has a AndroidManifest.xml similar to apk archive but in
[base/manifest/|http://base/manifest/] directory.
Android Dev documentation reference :
[https://developer.android.com/guide/app-bundle/app-bundle-format]
Also adding BundleConfig.pb for better accuracy may help.
> Support Android Bundle aab detection
> ------------------------------------
>
> Key: TIKA-4365
> URL: https://issues.apache.org/jira/browse/TIKA-4365
> Project: Tika
> Issue Type: Bug
> Components: tika-core
> Reporter: Subbu
> Priority: Critical
>
> AAB file goes through DefaultZipContainerDetector and gets detected as
> _application/java-archive_ since it has MANIFEST.MF via JarDetector.
> They have their own content type as mentioned in tika-mimetypes.xml -
> application/x-authorware-bin
> [https://github.com/apache/tika/blob/6a12d7cf1a2ee37dce899933a05520604330c947/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml#L3485]
> The AAB file structure has a AndroidManifest.xml similar to apk archive but
> in [base/manifest/|http://base/manifest/] directory.
> Android Dev documentation reference :
> [https://developer.android.com/guide/app-bundle/app-bundle-format]
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)