[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-12 Thread Andrei Dobrescu (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17343732#comment-17343732
 ] 

Andrei Dobrescu commented on TIKA-3392:
---

thanks for the fix!

I will wait for the next release, will test then.

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Fix For: 1.27
>
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png, image-2021-05-11-18-12-15-300.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTyp

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-12 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17343551#comment-17343551
 ] 

Tim Allison commented on TIKA-3392:
---

For TikaConfig, we're using the XMLReaderUtils, which also warns on failed 
security but does not crash; same for ExternalParsersConfigReader.  I can't 
remember now why I created a whole separate system for mimetypes... :( Will 
look through some issues.

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Fix For: 1.27
>
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png, image-2021-05-11-18-12-15-300.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.cr

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-12 Thread Nick Burch (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17343485#comment-17343485
 ] 

Nick Burch commented on TIKA-3392:
--

[~tallison] What about the other Tika "own" XML files like Tika Config or 
External Parsers definitions, should we not do the same thing for them?

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Fix For: 1.27
>
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png, image-2021-05-11-18-12-15-300.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-12 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17343465#comment-17343465
 ] 

Hudson commented on TIKA-3392:
--

UNSTABLE: Integrated in Jenkins build Tika » tika-main-jdk8 #231 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/231/])
TIKA-3392 -- allow insecure parsing in MimeTypesReader; log a warning 
(tallison: 
[https://github.com/apache/tika/commit/03b6baf1efdb43b299cbe025184cf24cca9a8847])
* (edit) tika-core/src/main/java/org/apache/tika/mime/MimeTypesReader.java


> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Fix For: 1.27
>
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png, image-2021-05-11-18-12-15-300.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tik

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-12 Thread Hudson (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17343444#comment-17343444
 ] 

Hudson commented on TIKA-3392:
--

SUCCESS: Integrated in Jenkins build Tika » tika-branch1x-jdk8 #129 (See 
[https://ci-builds.apache.org/job/Tika/job/tika-branch1x-jdk8/129/])
TIKA-3392 -- allow insecure parsing in MimeTypesReader (tallison: 
[https://github.com/apache/tika/commit/06c111f82a14a34492b3302b4d8310645b6f8366])
* (edit) tika-core/src/main/java/org/apache/tika/mime/MimeTypesReader.java


> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Fix For: 1.27
>
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png, image-2021-05-11-18-12-15-300.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.Mi

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-12 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17343412#comment-17343412
 ] 

Tim Allison commented on TIKA-3392:
---

Thank you [~nick], I split the difference -- logged a warning in 
MimeTypesReader and left the other as it was.

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Fix For: 1.27
>
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png, image-2021-05-11-18-12-15-300.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.co

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Nick Burch (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342898#comment-17342898
 ] 

Nick Burch commented on TIKA-3392:
--

Not sure how easy / possible / user friendly this would be, but... my 
suggestion would be:
 * SAX Parser for Tika related XML file (config, mime types, external parser 
definitions etc) - hopefully fairly safe - log insecure parser but continue
 * SAX Parser for external XML files called via parsers - potentially dangerous 
- fail by default, but allow users to set an unsafe static flag if they really 
want to risk it

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png, image-2021-05-11-18-12-15-300.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342664#comment-17342664
 ] 

Andrei Dobrescu commented on TIKA-3392:
---

> Every xml parser we use _should_ grab a secure parser.
> You don't, but we do

Yes, of course, I agree.

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png, image-2021-05-11-18-12-15-300.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMi

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342661#comment-17342661
 ] 

Tim Allison commented on TIKA-3392:
---

>Seems like there are 5 occurences of secure processing...
Every xml parser we use _should_ grab a secure parser.  As the stacktrace 
shows, though, we have two different factories for creating a sax parser, one 
is in XMLReaderUtils and one is in MimesTypesReader

> No, I don't have a mime type xml configuration file, if that is what you 
> mean. I just use AutoDetectParser as it is.
You don't, but we do: 
https://github.com/apache/tika/blob/main/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml

I'm happy enough adding catching and logging to MimeTypesReader.  Does this 
cause any fellow devs heartburn?
Any fellow devs have heartburn over swallowing

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png, image-2021-05-11-18-12-15-300.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.j

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342642#comment-17342642
 ] 

Andrei Dobrescu commented on TIKA-3392:
---

"Frankly, I think the intent of that portion of the code from the beginning was 
to log a warning if the secure processing components weren't available, but to 
go forth with an insecure parser."

Yes, that would solve the issue.

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png, image-2021-05-11-18-12-15-300.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.ap

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342639#comment-17342639
 ] 

Andrei Dobrescu commented on TIKA-3392:
---

Seems like there are 5 occurences of secure processing...

!image-2021-05-11-18-10-40-949.png!

And 186 of import org.xml.sax
!image-2021-05-11-18-12-15-300.png!

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png, image-2021-05-11-18-12-15-300.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342637#comment-17342637
 ] 

Tim Allison commented on TIKA-3392:
---

Uh, secure processing is needed and is important for files in the wild.  

However, if we're reading our own mimes xml, we can probably relax the use of 
secure processing.  Frankly, I think the intent of that portion of the code 
from the beginning was to log a warning if the secure processing components 
weren't available, but to go forth with an insecure parser.

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Attachments: image-2021-05-11-17-53-58-291.png, 
> image-2021-05-11-18-10-40-949.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> 

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Andrei Dobrescu (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342626#comment-17342626
 ] 

Andrei Dobrescu commented on TIKA-3392:
---

I did a bit of research before posting this issue. Thing is:
- All Android apps do bundle all their dependent libraries. So if in app A1 you 
import library L1 with version V1 and in app A2 you import library L1 with 
version V2, it will be ok, because the APK file format is containerized.
- The exception are the classes from the Android SDK. The SDK is the only 
system-level library, common to all apps. It contains Java SE classes, Android 
specific classes, such as the UI toolkit. Problem is, when they developed 
Android, some Genius from Google thought it's a good idea to put in the SDK 
JSON.org, Apache HTTP client, org.xml.*, org.xmlpull.* libraries. [You can find 
the documen tation of the SDK 
here|https://developer.android.com/reference/packages]


As you can see, the SDK contains an implementation of org.xml.sax. I can import 
latest Apache Xerces but org.xml.* classes will always resolve to the ones from 
the SDK. The classes from the SDK doesn't support "secure-processing", and 
because of that Tika library will crash.

I can think of 3 solutions to this problem:
- Guys from Google could update or remove their org.xml.* classes from the SDK. 
This surely won't happen.
- I can stop using Tika, and start using another mime type detector, such as 
the linux file command: [https://stackoverflow.com/a/2227201/11536597|like 
this]. I could compile the [http://www.darwinsys.com/file/|source code] to 
target Android, then bundle the native library.
- Tika could stop using secure-processing XML feature. Why is it even needed? 
Is it important? Can the library work without it? It basically crashes at 
MimeTypesReader.java:429 / newSaxParser method / 
factory.setFeature(XMLConstants.FEATURE_SECURE_PROCESSING, true);

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Attachments: image-2021-05-11-17-53-58-291.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesR

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

2021-05-11 Thread Tim Allison (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342625#comment-17342625
 ] 

Tim Allison commented on TIKA-3392:
---

Is that the only xml parser available on Android?  If you can show us how to 
prevent xxe and xml bombs with harmony, we can add that...maybe?

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml 
> dependencies.
> --
>
> Key: TIKA-3392
> URL: https://issues.apache.org/jira/browse/TIKA-3392
> Project: Tika
>  Issue Type: Bug
>  Components: core
>Affects Versions: 1.26
> Environment: Android 11
>Reporter: Andrei Dobrescu
>Priority: Major
>  Labels: android
> Attachments: image-2021-05-11-17-53-58-291.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
> AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
> at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:119)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
> at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
> at 
> org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
> at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
> at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
> at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
> at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.apache.tika.config.TikaConfig.(TikaConfig.java:257)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>  at 
> org.apache.tika.parser.AutoDetectParser.(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: 
> http://javax.xml.XMLConstants/feature/secure-processing
>  at 
> org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>  at 
> org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>  at 
> org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>  at 
> org.apache.tika.mime.MimeTypesReader.(MimeTypesReader.java:117)
>  at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>  at 
> org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>  at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>  at 
> org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>  at org.ap