Hi, I am currently trying to upgrade Tika from ancient 1.x to most recent 3.3 in Apache Oak (https://issues.apache.org/jira/browse/OAK-9752). I studied https://cwiki.apache.org/confluence/display/TIKA/Migrating+to+Tika+2.0.0 however the usage in OSGi is unclear to me:
First I tried to deploy both tika-core and tika-parsers-standard-package (both have OSGi metadata in their manifests) in my OSGi container (Apache Felix) and ran into https://issues.apache.org/jira/browse/TIKA-4698. Then I tried to deploy tika-standard-bundle and ran into https://issues.apache.org/jira/browse/TIKA-4699. I checked both tika-standard-bundle and tika-parsers-standard-package more thoroughly afterwards and they don’t really seem to be ready for OSGi. The former lacks to embed/include any of the parsers from its Maven dependencies (i.e. only contains a single source file, namely the Bundle Activator (https://github.com/apache/tika/blob/a7e6ac73862d0d8d719ace8b9d0cb6d39efb926b/tika-parsers/tika-parsers-standard/tika-parsers-standard-package/src/main/java/org/apache/tika/parser/internal/Activator.java). The latter embeds lots of stuff but again misses the actual parsers. Am I missing something here or is OSGi support just broken in newer Tika versions? Maven dependencies along have zero effect on the generated bundles (they are by default a compile time only thing) so just having a dependency without at least an according embed cannot really work... I am more than happy to help once I get a better understanding how things are supposed to work. Can someone clarify the idea of tika-parsers-standard-package and tika-standard-bundle in OSGi? Thanks in advance, Konrad
