[jira] [Commented] (TIKA-2071) Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers
[ https://issues.apache.org/jira/browse/TIKA-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479251#comment-15479251 ] Hudson commented on TIKA-2071: -- SUCCESS: Integrated in Jenkins build tika-2.x #139 (See [https://builds.apache.org/job/tika-2.x/139/]) TIKA-2071 - DefaultParser and CompositeParser does not filter (bob: rev 7a0280c77b3a84dd2e10d35cf71a7722abe4ba80) * (edit) tika-core/src/test/java/org/apache/tika/parser/CompositeParserTest.java * (edit) tika-core/src/main/java/org/apache/tika/parser/CompositeParser.java * (edit) tika-core/src/main/java/org/apache/tika/parser/DefaultParser.java > Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers > from dynamic ServiceLoader Parsers > --- > > Key: TIKA-2071 > URL: https://issues.apache.org/jira/browse/TIKA-2071 > Project: Tika > Issue Type: Bug >Reporter: Bob Paulin >Assignee: Bob Paulin > Fix For: 2.0 > > > The DefaultParser and CompositeParser do not filter dynamic services using > the excludedParser List. The exclude list should be applied here as well. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2073) Tika 2.0 - Tika Language Detect Project should include Bundle Activator and packaging consistant with other modules
[ https://issues.apache.org/jira/browse/TIKA-2073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479253#comment-15479253 ] Hudson commented on TIKA-2073: -- SUCCESS: Integrated in Jenkins build tika-2.x #139 (See [https://builds.apache.org/job/tika-2.x/139/]) TIKA-2073 - Tika Language Detect Project should include Bundle Activator (bob: rev f8092d3bd9fb85d5c87983de562643632e7ddd7d) * (add) tika-langdetect/src/main/java/org/apache/tika/langdetect/internal/Activator.java * (edit) tika-langdetect/pom.xml > Tika 2.0 - Tika Language Detect Project should include Bundle Activator and > packaging consistant with other modules > --- > > Key: TIKA-2073 > URL: https://issues.apache.org/jira/browse/TIKA-2073 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > Currently tika-langdetect does not register the LanguageDetectors as bundles > and includes dependencies that are not OSGi friendly so they can be embedded. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2075) Tika 2.0 - Expose Additional TikaService methods
[ https://issues.apache.org/jira/browse/TIKA-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479252#comment-15479252 ] Hudson commented on TIKA-2075: -- SUCCESS: Integrated in Jenkins build tika-2.x #139 (See [https://builds.apache.org/job/tika-2.x/139/]) TIKA-2075 - Expose Additional TikaService methods (bob: rev f112c88fbd54f6043eec6a79cb11145d0f56d4a5) * (edit) tika-core/src/main/java/org/apache/tika/osgi/TikaService.java * (edit) tika-core/src/main/java/org/apache/tika/osgi/internal/TikaServiceImpl.java > Tika 2.0 - Expose Additional TikaService methods > - > > Key: TIKA-2075 > URL: https://issues.apache.org/jira/browse/TIKA-2075 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > TikaService should also expose direct access to wrapped member variables such > as ServiceLoader, Parser, and Detector. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2071) Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers
[ https://issues.apache.org/jira/browse/TIKA-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479186#comment-15479186 ] Hudson commented on TIKA-2071: -- FAILURE: Integrated in Jenkins build tika-2.x-windows #43 (See [https://builds.apache.org/job/tika-2.x-windows/43/]) TIKA-2071 - DefaultParser and CompositeParser does not filter (bob: rev 7a0280c77b3a84dd2e10d35cf71a7722abe4ba80) * (edit) tika-core/src/test/java/org/apache/tika/parser/CompositeParserTest.java * (edit) tika-core/src/main/java/org/apache/tika/parser/DefaultParser.java * (edit) tika-core/src/main/java/org/apache/tika/parser/CompositeParser.java > Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers > from dynamic ServiceLoader Parsers > --- > > Key: TIKA-2071 > URL: https://issues.apache.org/jira/browse/TIKA-2071 > Project: Tika > Issue Type: Bug >Reporter: Bob Paulin >Assignee: Bob Paulin > Fix For: 2.0 > > > The DefaultParser and CompositeParser do not filter dynamic services using > the excludedParser List. The exclude list should be applied here as well. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2073) Tika 2.0 - Tika Language Detect Project should include Bundle Activator and packaging consistant with other modules
[ https://issues.apache.org/jira/browse/TIKA-2073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479188#comment-15479188 ] Hudson commented on TIKA-2073: -- FAILURE: Integrated in Jenkins build tika-2.x-windows #43 (See [https://builds.apache.org/job/tika-2.x-windows/43/]) TIKA-2073 - Tika Language Detect Project should include Bundle (bob: rev f8092d3bd9fb85d5c87983de562643632e7ddd7d) * (add) tika-langdetect/src/main/java/org/apache/tika/langdetect/internal/Activator.java * (edit) tika-langdetect/pom.xml > Tika 2.0 - Tika Language Detect Project should include Bundle Activator and > packaging consistant with other modules > --- > > Key: TIKA-2073 > URL: https://issues.apache.org/jira/browse/TIKA-2073 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > Currently tika-langdetect does not register the LanguageDetectors as bundles > and includes dependencies that are not OSGi friendly so they can be embedded. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2075) Tika 2.0 - Expose Additional TikaService methods
[ https://issues.apache.org/jira/browse/TIKA-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479187#comment-15479187 ] Hudson commented on TIKA-2075: -- FAILURE: Integrated in Jenkins build tika-2.x-windows #43 (See [https://builds.apache.org/job/tika-2.x-windows/43/]) TIKA-2075 - Expose Additional TikaService methods (bob: rev f112c88fbd54f6043eec6a79cb11145d0f56d4a5) * (edit) tika-core/src/main/java/org/apache/tika/osgi/TikaService.java * (edit) tika-core/src/main/java/org/apache/tika/osgi/internal/TikaServiceImpl.java > Tika 2.0 - Expose Additional TikaService methods > - > > Key: TIKA-2075 > URL: https://issues.apache.org/jira/browse/TIKA-2075 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > TikaService should also expose direct access to wrapped member variables such > as ServiceLoader, Parser, and Detector. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
tika-2.x-windows - Build # 43 - Still Failing
The Apache Jenkins build system has built tika-2.x-windows (build #43) Status: Still Failing Check console output at https://builds.apache.org/job/tika-2.x-windows/43/ to view the results.
[jira] [Resolved] (TIKA-2074) Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading
[ https://issues.apache.org/jira/browse/TIKA-2074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2074. -- Resolution: Fixed > Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading > > > Key: TIKA-2074 > URL: https://issues.apache.org/jira/browse/TIKA-2074 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > Currently the ServiceLoader depends on the classes loaded by the static > ServiceLoader to instantiate classes within the TikaConfig. This should be > extended to include dynamic classes as well to function in a dynamic OSGi > environment. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Resolved] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader
[ https://issues.apache.org/jira/browse/TIKA-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2070. -- Resolution: Fixed > Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service > Loader > - > > Key: TIKA-2070 > URL: https://issues.apache.org/jira/browse/TIKA-2070 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > Currently only Parser and Detector classes are added to the dynamic > ServiceLoader list. We should extend this to include the EncodingDetector > and LanguageDetector -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (TIKA-2075) Tika 2.0 - Expose Additional TikaService methods
[ https://issues.apache.org/jira/browse/TIKA-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin updated TIKA-2075: - Summary: Tika 2.0 - Expose Additional TikaService methods (was: Tika 2.0 - Expose Additonal TikaService methods) > Tika 2.0 - Expose Additional TikaService methods > - > > Key: TIKA-2075 > URL: https://issues.apache.org/jira/browse/TIKA-2075 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > TikaService should also expose direct access to wrapped member variables such > as ServiceLoader, Parser, and Detector. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Resolved] (TIKA-2072) Tika 2.0 - Create TikaServiceFactory for creating TikaService
[ https://issues.apache.org/jira/browse/TIKA-2072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin resolved TIKA-2072. -- Resolution: Fixed > Tika 2.0 - Create TikaServiceFactory for creating TikaService > - > > Key: TIKA-2072 > URL: https://issues.apache.org/jira/browse/TIKA-2072 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > In order to create TikaService objects with different configs we should have > a factory available. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (TIKA-2075) Tika 2.0 - Expose Additonal TikaService methods
Bob Paulin created TIKA-2075: Summary: Tika 2.0 - Expose Additonal TikaService methods Key: TIKA-2075 URL: https://issues.apache.org/jira/browse/TIKA-2075 Project: Tika Issue Type: Improvement Affects Versions: 2.0 Reporter: Bob Paulin Assignee: Bob Paulin TikaService should also expose direct access to wrapped member variables such as ServiceLoader, Parser, and Detector. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2074) Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading
[ https://issues.apache.org/jira/browse/TIKA-2074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479148#comment-15479148 ] Hudson commented on TIKA-2074: -- SUCCESS: Integrated in Jenkins build tika-2.x #138 (See [https://builds.apache.org/job/tika-2.x/138/]) TIKA-2074 - ServiceLoader can use Class files loaded via dynamic loading (bob: rev b73cd8ce867402491d73f7289422eb08de62b565) * (edit) tika-core/src/main/java/org/apache/tika/config/ServiceLoader.java * (add) tika-core/src/test/java/org/apache/tika/config/ServiceLoaderTest.java > Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading > > > Key: TIKA-2074 > URL: https://issues.apache.org/jira/browse/TIKA-2074 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > Currently the ServiceLoader depends on the classes loaded by the static > ServiceLoader to instantiate classes within the TikaConfig. This should be > extended to include dynamic classes as well to function in a dynamic OSGi > environment. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader
[ https://issues.apache.org/jira/browse/TIKA-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479147#comment-15479147 ] Hudson commented on TIKA-2070: -- SUCCESS: Integrated in Jenkins build tika-2.x #138 (See [https://builds.apache.org/job/tika-2.x/138/]) TIKA-2070 - Add Encoding Detector and Language Detectors to Dynamic (bob: rev d57a85274940a92079220fd1a87896e86278cc3f) * (edit) tika-parser-modules/tika-parser-database-module/src/main/java/org/apache/tika/module/database/internal/Activator.java * (edit) tika-parser-modules/tika-parser-web-module/src/main/java/org/apache/tika/module/web/internal/Activator.java * (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/module/office/internal/Activator.java * (edit) tika-parser-modules/tika-parser-scientific-module/src/main/java/org/apache/tika/module/scientific/internal/Activator.java * (edit) tika-parser-modules/tika-parser-cad-module/src/main/java/org/apache/tika/module/cad/internal/Activator.java * (edit) tika-parser-modules/tika-parser-multimedia-module/src/main/java/org/apache/tika/module/multimedia/internal/Activator.java * (edit) tika-core/src/main/java/org/apache/tika/language/detect/LanguageDetector.java * (edit) tika-core/src/main/java/org/apache/tika/config/TikaActivator.java * (edit) tika-parser-modules/tika-parser-code-module/src/main/java/org/apache/tika/module/code/internal/Activator.java * (edit) tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/module/text/internal/Activator.java * (edit) tika-parser-bundles/tika-parser-vorbis-bundle/src/main/java/org/apache/tika/module/vorbis/internal/Activator.java * (edit) tika-parser-modules/tika-parser-advanced-module/src/main/java/org/apache/tika/module/advanced/internal/Activator.java * (edit) tika-parser-modules/tika-parser-crypto-module/src/main/java/org/apache/tika/module/crypto/internal/Activator.java * (edit) tika-parser-modules/tika-parser-ebook-module/src/main/java/org/apache/tika/module/ebook/internal/Activator.java * (edit) tika-parser-modules/tika-parser-package-module/src/main/java/org/apache/tika/module/pkg/internal/Activator.java * (edit) tika-core/src/main/java/org/apache/tika/language/detect/LanguageHandler.java * (edit) tika-core/src/main/java/org/apache/tika/osgi/TikaAbstractBundleActivator.java * (edit) tika-parser-modules/tika-parser-journal-module/src/main/java/org/apache/tika/module/journal/internal/Activator.java > Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service > Loader > - > > Key: TIKA-2070 > URL: https://issues.apache.org/jira/browse/TIKA-2070 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > Currently only Parser and Detector classes are added to the dynamic > ServiceLoader list. We should extend this to include the EncodingDetector > and LanguageDetector -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2072) Tika 2.0 - Create TikaServiceFactory for creating TikaService
[ https://issues.apache.org/jira/browse/TIKA-2072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479146#comment-15479146 ] Hudson commented on TIKA-2072: -- SUCCESS: Integrated in Jenkins build tika-2.x #138 (See [https://builds.apache.org/job/tika-2.x/138/]) TIKA-2072 - Create TikaServiceFactory for creating TikaService (bob: rev 587dcb7725a6556a08618a41c7c1b5ff893e3721) * (edit) tika-core/src/main/java/org/apache/tika/config/TikaActivator.java * (add) tika-core/src/main/java/org/apache/tika/osgi/TikaServiceFactory.java * (add) tika-core/src/main/java/org/apache/tika/osgi/internal/TikaServiceFactoryImpl.java > Tika 2.0 - Create TikaServiceFactory for creating TikaService > - > > Key: TIKA-2072 > URL: https://issues.apache.org/jira/browse/TIKA-2072 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > In order to create TikaService objects with different configs we should have > a factory available. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2072) Tika 2.0 - Create TikaServiceFactory for creating TikaService
[ https://issues.apache.org/jira/browse/TIKA-2072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479110#comment-15479110 ] Hudson commented on TIKA-2072: -- FAILURE: Integrated in Jenkins build tika-2.x-windows #42 (See [https://builds.apache.org/job/tika-2.x-windows/42/]) TIKA-2072 - Create TikaServiceFactory for creating TikaService (bob: rev 587dcb7725a6556a08618a41c7c1b5ff893e3721) * (add) tika-core/src/main/java/org/apache/tika/osgi/TikaServiceFactory.java * (edit) tika-core/src/main/java/org/apache/tika/config/TikaActivator.java * (add) tika-core/src/main/java/org/apache/tika/osgi/internal/TikaServiceFactoryImpl.java > Tika 2.0 - Create TikaServiceFactory for creating TikaService > - > > Key: TIKA-2072 > URL: https://issues.apache.org/jira/browse/TIKA-2072 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > In order to create TikaService objects with different configs we should have > a factory available. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2074) Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading
[ https://issues.apache.org/jira/browse/TIKA-2074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479113#comment-15479113 ] Hudson commented on TIKA-2074: -- FAILURE: Integrated in Jenkins build tika-2.x-windows #42 (See [https://builds.apache.org/job/tika-2.x-windows/42/]) TIKA-2074 - ServiceLoader can use Class files loaded via dynamic (bob: rev b73cd8ce867402491d73f7289422eb08de62b565) * (add) tika-core/src/test/java/org/apache/tika/config/ServiceLoaderTest.java * (edit) tika-core/src/main/java/org/apache/tika/config/ServiceLoader.java > Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading > > > Key: TIKA-2074 > URL: https://issues.apache.org/jira/browse/TIKA-2074 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > Currently the ServiceLoader depends on the classes loaded by the static > ServiceLoader to instantiate classes within the TikaConfig. This should be > extended to include dynamic classes as well to function in a dynamic OSGi > environment. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader
[ https://issues.apache.org/jira/browse/TIKA-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479112#comment-15479112 ] Hudson commented on TIKA-2070: -- FAILURE: Integrated in Jenkins build tika-2.x-windows #42 (See [https://builds.apache.org/job/tika-2.x-windows/42/]) TIKA-2070 - Add Encoding Detector and Language Detectors to Dynamic (bob: rev d57a85274940a92079220fd1a87896e86278cc3f) * (edit) tika-parser-modules/tika-parser-advanced-module/src/main/java/org/apache/tika/module/advanced/internal/Activator.java * (edit) tika-parser-modules/tika-parser-code-module/src/main/java/org/apache/tika/module/code/internal/Activator.java * (edit) tika-parser-modules/tika-parser-crypto-module/src/main/java/org/apache/tika/module/crypto/internal/Activator.java * (edit) tika-parser-modules/tika-parser-cad-module/src/main/java/org/apache/tika/module/cad/internal/Activator.java * (edit) tika-parser-modules/tika-parser-database-module/src/main/java/org/apache/tika/module/database/internal/Activator.java * (edit) tika-parser-bundles/tika-parser-vorbis-bundle/src/main/java/org/apache/tika/module/vorbis/internal/Activator.java * (edit) tika-core/src/main/java/org/apache/tika/language/detect/LanguageHandler.java * (edit) tika-core/src/main/java/org/apache/tika/language/detect/LanguageDetector.java * (edit) tika-parser-modules/tika-parser-scientific-module/src/main/java/org/apache/tika/module/scientific/internal/Activator.java * (edit) tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/module/office/internal/Activator.java * (edit) tika-parser-modules/tika-parser-multimedia-module/src/main/java/org/apache/tika/module/multimedia/internal/Activator.java * (edit) tika-core/src/main/java/org/apache/tika/config/TikaActivator.java * (edit) tika-parser-modules/tika-parser-journal-module/src/main/java/org/apache/tika/module/journal/internal/Activator.java * (edit) tika-core/src/main/java/org/apache/tika/osgi/TikaAbstractBundleActivator.java * (edit) tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/module/text/internal/Activator.java * (edit) tika-parser-modules/tika-parser-web-module/src/main/java/org/apache/tika/module/web/internal/Activator.java * (edit) tika-parser-modules/tika-parser-ebook-module/src/main/java/org/apache/tika/module/ebook/internal/Activator.java * (edit) tika-parser-modules/tika-parser-package-module/src/main/java/org/apache/tika/module/pkg/internal/Activator.java > Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service > Loader > - > > Key: TIKA-2070 > URL: https://issues.apache.org/jira/browse/TIKA-2070 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > Currently only Parser and Detector classes are added to the dynamic > ServiceLoader list. We should extend this to include the EncodingDetector > and LanguageDetector -- This message was sent by Atlassian JIRA (v6.3.4#6332)
tika-2.x-windows - Build # 42 - Still Failing
The Apache Jenkins build system has built tika-2.x-windows (build #42) Status: Still Failing Check console output at https://builds.apache.org/job/tika-2.x-windows/42/ to view the results.
[jira] [Created] (TIKA-2074) Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading
Bob Paulin created TIKA-2074: Summary: Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading Key: TIKA-2074 URL: https://issues.apache.org/jira/browse/TIKA-2074 Project: Tika Issue Type: Improvement Affects Versions: 2.0 Reporter: Bob Paulin Assignee: Bob Paulin Currently the ServiceLoader depends on the classes loaded by the static ServiceLoader to instantiate classes within the TikaConfig. This should be extended to include dynamic classes as well to function in a dynamic OSGi environment. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (TIKA-2073) Tika 2.0 - Tika Language Detect Project should include Bundle Activator and packaging consistant with other modules
Bob Paulin created TIKA-2073: Summary: Tika 2.0 - Tika Language Detect Project should include Bundle Activator and packaging consistant with other modules Key: TIKA-2073 URL: https://issues.apache.org/jira/browse/TIKA-2073 Project: Tika Issue Type: Improvement Affects Versions: 2.0 Reporter: Bob Paulin Assignee: Bob Paulin Currently tika-langdetect does not register the LanguageDetectors as bundles and includes dependencies that are not OSGi friendly so they can be embedded. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (TIKA-2072) Tika 2.0 - Create TikaServiceFactory for creating TikaService
Bob Paulin created TIKA-2072: Summary: Tika 2.0 - Create TikaServiceFactory for creating TikaService Key: TIKA-2072 URL: https://issues.apache.org/jira/browse/TIKA-2072 Project: Tika Issue Type: Improvement Affects Versions: 2.0 Reporter: Bob Paulin Assignee: Bob Paulin In order to create TikaService objects with different configs we should have a factory available. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (TIKA-2071) Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers
[ https://issues.apache.org/jira/browse/TIKA-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin updated TIKA-2071: - Issue Type: Bug (was: Improvement) > Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers > from dynamic ServiceLoader Parsers > --- > > Key: TIKA-2071 > URL: https://issues.apache.org/jira/browse/TIKA-2071 > Project: Tika > Issue Type: Bug >Reporter: Bob Paulin >Assignee: Bob Paulin > Fix For: 2.0 > > > The DefaultParser and CompositeParser do not filter dynamic services using > the excludedParser List. The exclude list should be applied here as well. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader
[ https://issues.apache.org/jira/browse/TIKA-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bob Paulin updated TIKA-2070: - Affects Version/s: 2.0 > Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service > Loader > - > > Key: TIKA-2070 > URL: https://issues.apache.org/jira/browse/TIKA-2070 > Project: Tika > Issue Type: Improvement >Affects Versions: 2.0 >Reporter: Bob Paulin >Assignee: Bob Paulin > > Currently only Parser and Detector classes are added to the dynamic > ServiceLoader list. We should extend this to include the EncodingDetector > and LanguageDetector -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (TIKA-2071) Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers
Bob Paulin created TIKA-2071: Summary: Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers Key: TIKA-2071 URL: https://issues.apache.org/jira/browse/TIKA-2071 Project: Tika Issue Type: Improvement Reporter: Bob Paulin Assignee: Bob Paulin Fix For: 2.0 The DefaultParser and CompositeParser do not filter dynamic services using the excludedParser List. The exclude list should be applied here as well. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader
Bob Paulin created TIKA-2070: Summary: Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader Key: TIKA-2070 URL: https://issues.apache.org/jira/browse/TIKA-2070 Project: Tika Issue Type: Improvement Reporter: Bob Paulin Assignee: Bob Paulin Currently only Parser and Detector classes are added to the dynamic ServiceLoader list. We should extend this to include the EncodingDetector and LanguageDetector -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Comment Edited] (TIKA-2069) Extract Macro text from Microsoft Office documents
[ https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477188#comment-15477188 ] Tim Allison edited comment on TIKA-2069 at 9/9/16 4:19 PM: --- Thank you! This question is for [~jeffswindle] and fellow Tika devs (esp. [~rgauss] and [~gagravarr]), should we: 1) add macro text as metadata items (e.g. msoffice:macro) 2) inline them in the content via elements? 3) treat them as embedded documents (mime type would be?) I'd prefer option 1 or 3. Option 1 is probably simpler for end users; but option 3 would allow us to capture metadata about the macro. -[~jeffswindle], the title of this issue is for msoffice...is it ok to limit this to ooxml? Do you need this for the older doc and xls?- Already handled by POI at no extra cost. :) was (Author: talli...@mitre.org): Thank you! This question is for [~jeffswindle] and fellow Tika devs (esp. [~rgauss]), should we add macros as metadata items or inline them in the content via elements? I'd prefer a metadata item for each macro, but could go either way. [~jeffswindle], the title of this issue is for msoffice...is it ok to limit this to ooxml? Do you need this for the older doc and xls? > Extract Macro text from Microsoft Office documents > -- > > Key: TIKA-2069 > URL: https://issues.apache.org/jira/browse/TIKA-2069 > Project: Tika > Issue Type: Improvement > Components: detector, parser >Affects Versions: 1.13 > Environment: RHEL 5.x, Apache Tomcat >Reporter: Jeff Swindle > Labels: features > Attachments: excel-macro.PNG, test-macro-doc.docm, > test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, > xlsmacro.xlsm.tika-app-output.txt > > > Tika supports macro-enabled Microsoft Office documents by extracting metadata > and contents, however, macros within the document are not in the metadata or > content output. > Desire is to have the macro text extracted also. > Info regarding macro extraction: http://www.decalage.info/vba_tools -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2069) Extract Macro text from Microsoft Office documents
[ https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477438#comment-15477438 ] Tim Allison commented on TIKA-2069: --- Once we upgrade to POI 3.15-beta3, this _should_ be fairly straightforward, thanks to the work of others on POI. We may want to copy/modify the "find the vba.bin file" at the Tika level for OOXML files to pass an npoifs into VBAMacroReader from an open OOXML/zip file. > Extract Macro text from Microsoft Office documents > -- > > Key: TIKA-2069 > URL: https://issues.apache.org/jira/browse/TIKA-2069 > Project: Tika > Issue Type: Improvement > Components: detector, parser >Affects Versions: 1.13 > Environment: RHEL 5.x, Apache Tomcat >Reporter: Jeff Swindle > Labels: features > Attachments: excel-macro.PNG, test-macro-doc.docm, > test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, > xlsmacro.xlsm.tika-app-output.txt > > > Tika supports macro-enabled Microsoft Office documents by extracting metadata > and contents, however, macros within the document are not in the metadata or > content output. > Desire is to have the macro text extracted also. > Info regarding macro extraction: http://www.decalage.info/vba_tools -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2069) Extract Macro text from Microsoft Office documents
[ https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477415#comment-15477415 ] Tim Allison commented on TIKA-2069: --- Thanks to [~blagerw...@gmail.com], [~gagravarr] and [~onealj] among others, it looks like this is all nicely handled by POI now as of [bug-52949|https://bz.apache.org/bugzilla/show_bug.cgi?id=52949]. > Extract Macro text from Microsoft Office documents > -- > > Key: TIKA-2069 > URL: https://issues.apache.org/jira/browse/TIKA-2069 > Project: Tika > Issue Type: Improvement > Components: detector, parser >Affects Versions: 1.13 > Environment: RHEL 5.x, Apache Tomcat >Reporter: Jeff Swindle > Labels: features > Attachments: excel-macro.PNG, test-macro-doc.docm, > test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, > xlsmacro.xlsm.tika-app-output.txt > > > Tika supports macro-enabled Microsoft Office documents by extracting metadata > and contents, however, macros within the document are not in the metadata or > content output. > Desire is to have the macro text extracted also. > Info regarding macro extraction: http://www.decalage.info/vba_tools -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2069) Extract Macro text from Microsoft Office documents
[ https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477255#comment-15477255 ] Jeff Swindle commented on TIKA-2069: OOXML would be great. Not just limited to Word and Excel. Need Powerpoint also. > Extract Macro text from Microsoft Office documents > -- > > Key: TIKA-2069 > URL: https://issues.apache.org/jira/browse/TIKA-2069 > Project: Tika > Issue Type: Improvement > Components: detector, parser >Affects Versions: 1.13 > Environment: RHEL 5.x, Apache Tomcat >Reporter: Jeff Swindle > Labels: features > Attachments: excel-macro.PNG, test-macro-doc.docm, > test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, > xlsmacro.xlsm.tika-app-output.txt > > > Tika supports macro-enabled Microsoft Office documents by extracting metadata > and contents, however, macros within the document are not in the metadata or > content output. > Desire is to have the macro text extracted also. > Info regarding macro extraction: http://www.decalage.info/vba_tools -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2069) Extract Macro text from Microsoft Office documents
[ https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477188#comment-15477188 ] Tim Allison commented on TIKA-2069: --- Thank you! This question is for [~jeffswindle] and fellow Tika devs (esp. [~rgauss]), should we add macros as metadata items or inline them in the content via elements? I'd prefer a metadata item for each macro, but could go either way. [~jeffswindle], the title of this issue is for msoffice...is it ok to limit this to ooxml? Do you need this for the older doc and xls? > Extract Macro text from Microsoft Office documents > -- > > Key: TIKA-2069 > URL: https://issues.apache.org/jira/browse/TIKA-2069 > Project: Tika > Issue Type: Improvement > Components: detector, parser >Affects Versions: 1.13 > Environment: RHEL 5.x, Apache Tomcat >Reporter: Jeff Swindle > Labels: features > Attachments: excel-macro.PNG, test-macro-doc.docm, > test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, > xlsmacro.xlsm.tika-app-output.txt > > > Tika supports macro-enabled Microsoft Office documents by extracting metadata > and contents, however, macros within the document are not in the metadata or > content output. > Desire is to have the macro text extracted also. > Info regarding macro extraction: http://www.decalage.info/vba_tools -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (TIKA-2069) Extract Macro text from Microsoft Office documents
[ https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477176#comment-15477176 ] Jeff Swindle commented on TIKA-2069: Desire is for TIKA to extract macro text from Microsoft Office files as it does metadata and content. Need is to search for specific signatures that may be present in macros and if present should be removed prior to distributing document. TIKA would facilitate the search. > Extract Macro text from Microsoft Office documents > -- > > Key: TIKA-2069 > URL: https://issues.apache.org/jira/browse/TIKA-2069 > Project: Tika > Issue Type: Improvement > Components: detector, parser >Affects Versions: 1.13 > Environment: RHEL 5.x, Apache Tomcat >Reporter: Jeff Swindle > Labels: features > Attachments: excel-macro.PNG, test-macro-doc.docm, > test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, > xlsmacro.xlsm.tika-app-output.txt > > > Tika supports macro-enabled Microsoft Office documents by extracting metadata > and contents, however, macros within the document are not in the metadata or > content output. > Desire is to have the macro text extracted also. > Info regarding macro extraction: http://www.decalage.info/vba_tools -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (TIKA-2069) Extract Macro text from Microsoft Office documents
[ https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeff Swindle updated TIKA-2069: --- Attachment: excel-macro.PNG word-macro.PNG xlsmacro.xlsm.tika-app-output.txt xlsmacro.xlsm test-macro-doc.docm-tika-app-output.txt test-macro-doc.docm Word file containing macros. Output from tika-app-1.13. Screen shot of macro within Word file. Excel file containing macros. Output from tika-app-1.13. Screen shot of macros within Excel file. > Extract Macro text from Microsoft Office documents > -- > > Key: TIKA-2069 > URL: https://issues.apache.org/jira/browse/TIKA-2069 > Project: Tika > Issue Type: Improvement > Components: detector, parser >Affects Versions: 1.13 > Environment: RHEL 5.x, Apache Tomcat >Reporter: Jeff Swindle > Labels: features > Attachments: excel-macro.PNG, test-macro-doc.docm, > test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, > xlsmacro.xlsm.tika-app-output.txt > > > Tika supports macro-enabled Microsoft Office documents by extracting metadata > and contents, however, macros within the document are not in the metadata or > content output. > Desire is to have the macro text extracted also. > Info regarding macro extraction: http://www.decalage.info/vba_tools -- This message was sent by Atlassian JIRA (v6.3.4#6332)