[jira] [Commented] (TIKA-2071) Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479251#comment-15479251
 ] 

Hudson commented on TIKA-2071:
--

SUCCESS: Integrated in Jenkins build tika-2.x #139 (See 
[https://builds.apache.org/job/tika-2.x/139/])
TIKA-2071 - DefaultParser and CompositeParser does not filter (bob: rev 
7a0280c77b3a84dd2e10d35cf71a7722abe4ba80)
* (edit) tika-core/src/test/java/org/apache/tika/parser/CompositeParserTest.java
* (edit) tika-core/src/main/java/org/apache/tika/parser/CompositeParser.java
* (edit) tika-core/src/main/java/org/apache/tika/parser/DefaultParser.java


> Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers 
> from dynamic ServiceLoader Parsers
> ---
>
> Key: TIKA-2071
> URL: https://issues.apache.org/jira/browse/TIKA-2071
> Project: Tika
>  Issue Type: Bug
>Reporter: Bob Paulin
>Assignee: Bob Paulin
> Fix For: 2.0
>
>
> The DefaultParser and CompositeParser do not filter dynamic services using 
> the excludedParser List.  The exclude list should be applied here as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2073) Tika 2.0 - Tika Language Detect Project should include Bundle Activator and packaging consistant with other modules

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479253#comment-15479253
 ] 

Hudson commented on TIKA-2073:
--

SUCCESS: Integrated in Jenkins build tika-2.x #139 (See 
[https://builds.apache.org/job/tika-2.x/139/])
TIKA-2073 - Tika Language Detect Project should include Bundle Activator (bob: 
rev f8092d3bd9fb85d5c87983de562643632e7ddd7d)
* (add) 
tika-langdetect/src/main/java/org/apache/tika/langdetect/internal/Activator.java
* (edit) tika-langdetect/pom.xml


> Tika 2.0 - Tika Language Detect Project should include Bundle Activator and 
> packaging consistant with other modules
> ---
>
> Key: TIKA-2073
> URL: https://issues.apache.org/jira/browse/TIKA-2073
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> Currently tika-langdetect does not register the LanguageDetectors as bundles 
> and includes dependencies that are not OSGi friendly so they can be embedded.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2075) Tika 2.0 - Expose Additional TikaService methods

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479252#comment-15479252
 ] 

Hudson commented on TIKA-2075:
--

SUCCESS: Integrated in Jenkins build tika-2.x #139 (See 
[https://builds.apache.org/job/tika-2.x/139/])
TIKA-2075 - Expose Additional TikaService methods (bob: rev 
f112c88fbd54f6043eec6a79cb11145d0f56d4a5)
* (edit) tika-core/src/main/java/org/apache/tika/osgi/TikaService.java
* (edit) 
tika-core/src/main/java/org/apache/tika/osgi/internal/TikaServiceImpl.java


> Tika 2.0  - Expose Additional TikaService methods
> -
>
> Key: TIKA-2075
> URL: https://issues.apache.org/jira/browse/TIKA-2075
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> TikaService should also expose direct access to wrapped member variables such 
> as ServiceLoader, Parser, and Detector.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2071) Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479186#comment-15479186
 ] 

Hudson commented on TIKA-2071:
--

FAILURE: Integrated in Jenkins build tika-2.x-windows #43 (See 
[https://builds.apache.org/job/tika-2.x-windows/43/])
TIKA-2071 - DefaultParser and CompositeParser does not filter (bob: rev 
7a0280c77b3a84dd2e10d35cf71a7722abe4ba80)
* (edit) tika-core/src/test/java/org/apache/tika/parser/CompositeParserTest.java
* (edit) tika-core/src/main/java/org/apache/tika/parser/DefaultParser.java
* (edit) tika-core/src/main/java/org/apache/tika/parser/CompositeParser.java


> Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers 
> from dynamic ServiceLoader Parsers
> ---
>
> Key: TIKA-2071
> URL: https://issues.apache.org/jira/browse/TIKA-2071
> Project: Tika
>  Issue Type: Bug
>Reporter: Bob Paulin
>Assignee: Bob Paulin
> Fix For: 2.0
>
>
> The DefaultParser and CompositeParser do not filter dynamic services using 
> the excludedParser List.  The exclude list should be applied here as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2073) Tika 2.0 - Tika Language Detect Project should include Bundle Activator and packaging consistant with other modules

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479188#comment-15479188
 ] 

Hudson commented on TIKA-2073:
--

FAILURE: Integrated in Jenkins build tika-2.x-windows #43 (See 
[https://builds.apache.org/job/tika-2.x-windows/43/])
TIKA-2073 - Tika Language Detect Project should include Bundle (bob: rev 
f8092d3bd9fb85d5c87983de562643632e7ddd7d)
* (add) 
tika-langdetect/src/main/java/org/apache/tika/langdetect/internal/Activator.java
* (edit) tika-langdetect/pom.xml


> Tika 2.0 - Tika Language Detect Project should include Bundle Activator and 
> packaging consistant with other modules
> ---
>
> Key: TIKA-2073
> URL: https://issues.apache.org/jira/browse/TIKA-2073
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> Currently tika-langdetect does not register the LanguageDetectors as bundles 
> and includes dependencies that are not OSGi friendly so they can be embedded.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2075) Tika 2.0 - Expose Additional TikaService methods

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479187#comment-15479187
 ] 

Hudson commented on TIKA-2075:
--

FAILURE: Integrated in Jenkins build tika-2.x-windows #43 (See 
[https://builds.apache.org/job/tika-2.x-windows/43/])
TIKA-2075 - Expose Additional TikaService methods (bob: rev 
f112c88fbd54f6043eec6a79cb11145d0f56d4a5)
* (edit) tika-core/src/main/java/org/apache/tika/osgi/TikaService.java
* (edit) 
tika-core/src/main/java/org/apache/tika/osgi/internal/TikaServiceImpl.java


> Tika 2.0  - Expose Additional TikaService methods
> -
>
> Key: TIKA-2075
> URL: https://issues.apache.org/jira/browse/TIKA-2075
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> TikaService should also expose direct access to wrapped member variables such 
> as ServiceLoader, Parser, and Detector.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


tika-2.x-windows - Build # 43 - Still Failing

2016-09-09 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x-windows (build #43)

Status: Still Failing

Check console output at https://builds.apache.org/job/tika-2.x-windows/43/ to 
view the results.

[jira] [Resolved] (TIKA-2074) Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading

2016-09-09 Thread Bob Paulin (JIRA)

 [ 
https://issues.apache.org/jira/browse/TIKA-2074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bob Paulin resolved TIKA-2074.
--
Resolution: Fixed

> Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading
> 
>
> Key: TIKA-2074
> URL: https://issues.apache.org/jira/browse/TIKA-2074
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> Currently the ServiceLoader depends on the classes loaded by the static 
> ServiceLoader to instantiate classes within the TikaConfig.  This should be 
> extended to include dynamic classes as well to function in a dynamic OSGi 
> environment.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader

2016-09-09 Thread Bob Paulin (JIRA)

 [ 
https://issues.apache.org/jira/browse/TIKA-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bob Paulin resolved TIKA-2070.
--
Resolution: Fixed

> Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service 
> Loader
> -
>
> Key: TIKA-2070
> URL: https://issues.apache.org/jira/browse/TIKA-2070
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> Currently only Parser and Detector classes are added to the dynamic 
> ServiceLoader list.  We should extend this to include the EncodingDetector 
> and LanguageDetector  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TIKA-2075) Tika 2.0 - Expose Additional TikaService methods

2016-09-09 Thread Bob Paulin (JIRA)

 [ 
https://issues.apache.org/jira/browse/TIKA-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bob Paulin updated TIKA-2075:
-
Summary: Tika 2.0  - Expose Additional TikaService methods  (was: Tika 2.0  
- Expose Additonal TikaService methods)

> Tika 2.0  - Expose Additional TikaService methods
> -
>
> Key: TIKA-2075
> URL: https://issues.apache.org/jira/browse/TIKA-2075
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> TikaService should also expose direct access to wrapped member variables such 
> as ServiceLoader, Parser, and Detector.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (TIKA-2072) Tika 2.0 - Create TikaServiceFactory for creating TikaService

2016-09-09 Thread Bob Paulin (JIRA)

 [ 
https://issues.apache.org/jira/browse/TIKA-2072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bob Paulin resolved TIKA-2072.
--
Resolution: Fixed

> Tika 2.0 - Create TikaServiceFactory for creating TikaService
> -
>
> Key: TIKA-2072
> URL: https://issues.apache.org/jira/browse/TIKA-2072
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> In order to create TikaService objects with different configs we should have 
> a factory available.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (TIKA-2075) Tika 2.0 - Expose Additonal TikaService methods

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2075:


 Summary: Tika 2.0  - Expose Additonal TikaService methods
 Key: TIKA-2075
 URL: https://issues.apache.org/jira/browse/TIKA-2075
 Project: Tika
  Issue Type: Improvement
Affects Versions: 2.0
Reporter: Bob Paulin
Assignee: Bob Paulin


TikaService should also expose direct access to wrapped member variables such 
as ServiceLoader, Parser, and Detector.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2074) Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479148#comment-15479148
 ] 

Hudson commented on TIKA-2074:
--

SUCCESS: Integrated in Jenkins build tika-2.x #138 (See 
[https://builds.apache.org/job/tika-2.x/138/])
TIKA-2074 - ServiceLoader can use Class files loaded via dynamic loading (bob: 
rev b73cd8ce867402491d73f7289422eb08de62b565)
* (edit) tika-core/src/main/java/org/apache/tika/config/ServiceLoader.java
* (add) tika-core/src/test/java/org/apache/tika/config/ServiceLoaderTest.java


> Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading
> 
>
> Key: TIKA-2074
> URL: https://issues.apache.org/jira/browse/TIKA-2074
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> Currently the ServiceLoader depends on the classes loaded by the static 
> ServiceLoader to instantiate classes within the TikaConfig.  This should be 
> extended to include dynamic classes as well to function in a dynamic OSGi 
> environment.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479147#comment-15479147
 ] 

Hudson commented on TIKA-2070:
--

SUCCESS: Integrated in Jenkins build tika-2.x #138 (See 
[https://builds.apache.org/job/tika-2.x/138/])
TIKA-2070 - Add Encoding Detector and Language Detectors to Dynamic (bob: rev 
d57a85274940a92079220fd1a87896e86278cc3f)
* (edit) 
tika-parser-modules/tika-parser-database-module/src/main/java/org/apache/tika/module/database/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-web-module/src/main/java/org/apache/tika/module/web/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/module/office/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-scientific-module/src/main/java/org/apache/tika/module/scientific/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-cad-module/src/main/java/org/apache/tika/module/cad/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-multimedia-module/src/main/java/org/apache/tika/module/multimedia/internal/Activator.java
* (edit) 
tika-core/src/main/java/org/apache/tika/language/detect/LanguageDetector.java
* (edit) tika-core/src/main/java/org/apache/tika/config/TikaActivator.java
* (edit) 
tika-parser-modules/tika-parser-code-module/src/main/java/org/apache/tika/module/code/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/module/text/internal/Activator.java
* (edit) 
tika-parser-bundles/tika-parser-vorbis-bundle/src/main/java/org/apache/tika/module/vorbis/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-advanced-module/src/main/java/org/apache/tika/module/advanced/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-crypto-module/src/main/java/org/apache/tika/module/crypto/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-ebook-module/src/main/java/org/apache/tika/module/ebook/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-package-module/src/main/java/org/apache/tika/module/pkg/internal/Activator.java
* (edit) 
tika-core/src/main/java/org/apache/tika/language/detect/LanguageHandler.java
* (edit) 
tika-core/src/main/java/org/apache/tika/osgi/TikaAbstractBundleActivator.java
* (edit) 
tika-parser-modules/tika-parser-journal-module/src/main/java/org/apache/tika/module/journal/internal/Activator.java


> Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service 
> Loader
> -
>
> Key: TIKA-2070
> URL: https://issues.apache.org/jira/browse/TIKA-2070
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> Currently only Parser and Detector classes are added to the dynamic 
> ServiceLoader list.  We should extend this to include the EncodingDetector 
> and LanguageDetector  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2072) Tika 2.0 - Create TikaServiceFactory for creating TikaService

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479146#comment-15479146
 ] 

Hudson commented on TIKA-2072:
--

SUCCESS: Integrated in Jenkins build tika-2.x #138 (See 
[https://builds.apache.org/job/tika-2.x/138/])
TIKA-2072 - Create TikaServiceFactory for creating TikaService (bob: rev 
587dcb7725a6556a08618a41c7c1b5ff893e3721)
* (edit) tika-core/src/main/java/org/apache/tika/config/TikaActivator.java
* (add) tika-core/src/main/java/org/apache/tika/osgi/TikaServiceFactory.java
* (add) 
tika-core/src/main/java/org/apache/tika/osgi/internal/TikaServiceFactoryImpl.java


> Tika 2.0 - Create TikaServiceFactory for creating TikaService
> -
>
> Key: TIKA-2072
> URL: https://issues.apache.org/jira/browse/TIKA-2072
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> In order to create TikaService objects with different configs we should have 
> a factory available.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2072) Tika 2.0 - Create TikaServiceFactory for creating TikaService

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479110#comment-15479110
 ] 

Hudson commented on TIKA-2072:
--

FAILURE: Integrated in Jenkins build tika-2.x-windows #42 (See 
[https://builds.apache.org/job/tika-2.x-windows/42/])
TIKA-2072 - Create TikaServiceFactory for creating TikaService (bob: rev 
587dcb7725a6556a08618a41c7c1b5ff893e3721)
* (add) tika-core/src/main/java/org/apache/tika/osgi/TikaServiceFactory.java
* (edit) tika-core/src/main/java/org/apache/tika/config/TikaActivator.java
* (add) 
tika-core/src/main/java/org/apache/tika/osgi/internal/TikaServiceFactoryImpl.java


> Tika 2.0 - Create TikaServiceFactory for creating TikaService
> -
>
> Key: TIKA-2072
> URL: https://issues.apache.org/jira/browse/TIKA-2072
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> In order to create TikaService objects with different configs we should have 
> a factory available.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2074) Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479113#comment-15479113
 ] 

Hudson commented on TIKA-2074:
--

FAILURE: Integrated in Jenkins build tika-2.x-windows #42 (See 
[https://builds.apache.org/job/tika-2.x-windows/42/])
TIKA-2074 - ServiceLoader can use Class files loaded via dynamic (bob: rev 
b73cd8ce867402491d73f7289422eb08de62b565)
* (add) tika-core/src/test/java/org/apache/tika/config/ServiceLoaderTest.java
* (edit) tika-core/src/main/java/org/apache/tika/config/ServiceLoader.java


> Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading
> 
>
> Key: TIKA-2074
> URL: https://issues.apache.org/jira/browse/TIKA-2074
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> Currently the ServiceLoader depends on the classes loaded by the static 
> ServiceLoader to instantiate classes within the TikaConfig.  This should be 
> extended to include dynamic classes as well to function in a dynamic OSGi 
> environment.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader

2016-09-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15479112#comment-15479112
 ] 

Hudson commented on TIKA-2070:
--

FAILURE: Integrated in Jenkins build tika-2.x-windows #42 (See 
[https://builds.apache.org/job/tika-2.x-windows/42/])
TIKA-2070 - Add Encoding Detector and Language Detectors to Dynamic (bob: rev 
d57a85274940a92079220fd1a87896e86278cc3f)
* (edit) 
tika-parser-modules/tika-parser-advanced-module/src/main/java/org/apache/tika/module/advanced/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-code-module/src/main/java/org/apache/tika/module/code/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-crypto-module/src/main/java/org/apache/tika/module/crypto/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-cad-module/src/main/java/org/apache/tika/module/cad/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-database-module/src/main/java/org/apache/tika/module/database/internal/Activator.java
* (edit) 
tika-parser-bundles/tika-parser-vorbis-bundle/src/main/java/org/apache/tika/module/vorbis/internal/Activator.java
* (edit) 
tika-core/src/main/java/org/apache/tika/language/detect/LanguageHandler.java
* (edit) 
tika-core/src/main/java/org/apache/tika/language/detect/LanguageDetector.java
* (edit) 
tika-parser-modules/tika-parser-scientific-module/src/main/java/org/apache/tika/module/scientific/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-office-module/src/main/java/org/apache/tika/module/office/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-multimedia-module/src/main/java/org/apache/tika/module/multimedia/internal/Activator.java
* (edit) tika-core/src/main/java/org/apache/tika/config/TikaActivator.java
* (edit) 
tika-parser-modules/tika-parser-journal-module/src/main/java/org/apache/tika/module/journal/internal/Activator.java
* (edit) 
tika-core/src/main/java/org/apache/tika/osgi/TikaAbstractBundleActivator.java
* (edit) 
tika-parser-modules/tika-parser-text-module/src/main/java/org/apache/tika/module/text/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-web-module/src/main/java/org/apache/tika/module/web/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-ebook-module/src/main/java/org/apache/tika/module/ebook/internal/Activator.java
* (edit) 
tika-parser-modules/tika-parser-package-module/src/main/java/org/apache/tika/module/pkg/internal/Activator.java


> Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service 
> Loader
> -
>
> Key: TIKA-2070
> URL: https://issues.apache.org/jira/browse/TIKA-2070
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> Currently only Parser and Detector classes are added to the dynamic 
> ServiceLoader list.  We should extend this to include the EncodingDetector 
> and LanguageDetector  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


tika-2.x-windows - Build # 42 - Still Failing

2016-09-09 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x-windows (build #42)

Status: Still Failing

Check console output at https://builds.apache.org/job/tika-2.x-windows/42/ to 
view the results.

[jira] [Created] (TIKA-2074) Tika 2.0 - Allow ServiceLoader to use Class files loaded via dynamic loading

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2074:


 Summary: Tika 2.0 - Allow ServiceLoader to use Class files loaded 
via dynamic loading
 Key: TIKA-2074
 URL: https://issues.apache.org/jira/browse/TIKA-2074
 Project: Tika
  Issue Type: Improvement
Affects Versions: 2.0
Reporter: Bob Paulin
Assignee: Bob Paulin


Currently the ServiceLoader depends on the classes loaded by the static 
ServiceLoader to instantiate classes within the TikaConfig.  This should be 
extended to include dynamic classes as well to function in a dynamic OSGi 
environment.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (TIKA-2073) Tika 2.0 - Tika Language Detect Project should include Bundle Activator and packaging consistant with other modules

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2073:


 Summary: Tika 2.0 - Tika Language Detect Project should include 
Bundle Activator and packaging consistant with other modules
 Key: TIKA-2073
 URL: https://issues.apache.org/jira/browse/TIKA-2073
 Project: Tika
  Issue Type: Improvement
Affects Versions: 2.0
Reporter: Bob Paulin
Assignee: Bob Paulin


Currently tika-langdetect does not register the LanguageDetectors as bundles 
and includes dependencies that are not OSGi friendly so they can be embedded.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (TIKA-2072) Tika 2.0 - Create TikaServiceFactory for creating TikaService

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2072:


 Summary: Tika 2.0 - Create TikaServiceFactory for creating 
TikaService
 Key: TIKA-2072
 URL: https://issues.apache.org/jira/browse/TIKA-2072
 Project: Tika
  Issue Type: Improvement
Affects Versions: 2.0
Reporter: Bob Paulin
Assignee: Bob Paulin


In order to create TikaService objects with different configs we should have a 
factory available.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TIKA-2071) Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers

2016-09-09 Thread Bob Paulin (JIRA)

 [ 
https://issues.apache.org/jira/browse/TIKA-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bob Paulin updated TIKA-2071:
-
Issue Type: Bug  (was: Improvement)

> Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers 
> from dynamic ServiceLoader Parsers
> ---
>
> Key: TIKA-2071
> URL: https://issues.apache.org/jira/browse/TIKA-2071
> Project: Tika
>  Issue Type: Bug
>Reporter: Bob Paulin
>Assignee: Bob Paulin
> Fix For: 2.0
>
>
> The DefaultParser and CompositeParser do not filter dynamic services using 
> the excludedParser List.  The exclude list should be applied here as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader

2016-09-09 Thread Bob Paulin (JIRA)

 [ 
https://issues.apache.org/jira/browse/TIKA-2070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bob Paulin updated TIKA-2070:
-
Affects Version/s: 2.0

> Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service 
> Loader
> -
>
> Key: TIKA-2070
> URL: https://issues.apache.org/jira/browse/TIKA-2070
> Project: Tika
>  Issue Type: Improvement
>Affects Versions: 2.0
>Reporter: Bob Paulin
>Assignee: Bob Paulin
>
> Currently only Parser and Detector classes are added to the dynamic 
> ServiceLoader list.  We should extend this to include the EncodingDetector 
> and LanguageDetector  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (TIKA-2071) Tika 2.0 - DefaultParser and CompositeParser does not filter excludedParsers from dynamic ServiceLoader Parsers

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2071:


 Summary: Tika 2.0 - DefaultParser and CompositeParser does not 
filter excludedParsers from dynamic ServiceLoader Parsers
 Key: TIKA-2071
 URL: https://issues.apache.org/jira/browse/TIKA-2071
 Project: Tika
  Issue Type: Improvement
Reporter: Bob Paulin
Assignee: Bob Paulin
 Fix For: 2.0


The DefaultParser and CompositeParser do not filter dynamic services using the 
excludedParser List.  The exclude list should be applied here as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (TIKA-2070) Tika 2.0 - Add Encoding Detector and Language Detectors to Dynamic Service Loader

2016-09-09 Thread Bob Paulin (JIRA)
Bob Paulin created TIKA-2070:


 Summary: Tika 2.0 - Add Encoding Detector and Language Detectors 
to Dynamic Service Loader
 Key: TIKA-2070
 URL: https://issues.apache.org/jira/browse/TIKA-2070
 Project: Tika
  Issue Type: Improvement
Reporter: Bob Paulin
Assignee: Bob Paulin


Currently only Parser and Detector classes are added to the dynamic 
ServiceLoader list.  We should extend this to include the EncodingDetector and 
LanguageDetector  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (TIKA-2069) Extract Macro text from Microsoft Office documents

2016-09-09 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477188#comment-15477188
 ] 

Tim Allison edited comment on TIKA-2069 at 9/9/16 4:19 PM:
---

Thank you!  

This question is for [~jeffswindle] and fellow Tika devs (esp. [~rgauss] and 
[~gagravarr]), should we:
1) add macro text as metadata items (e.g. msoffice:macro)
2) inline them in the content via  elements?
3) treat them as embedded documents (mime type would be?)

I'd prefer option 1 or 3.  Option 1 is probably simpler for end users; but 
option 3 would allow us to capture metadata about the macro.

-[~jeffswindle], the title of this issue is for msoffice...is it ok to limit 
this to ooxml?  Do you need this for the older doc and xls?- Already handled by 
POI at no extra cost. :)


was (Author: talli...@mitre.org):
Thank you!  

This question is for [~jeffswindle] and fellow Tika devs (esp. [~rgauss]), 
should we add macros as metadata items or inline them in the content via  
elements?

I'd prefer a metadata item for each macro, but could go either way.

[~jeffswindle], the title of this issue is for msoffice...is it ok to limit 
this to ooxml?  Do you need this for the older doc and xls?

> Extract Macro text from Microsoft Office documents
> --
>
> Key: TIKA-2069
> URL: https://issues.apache.org/jira/browse/TIKA-2069
> Project: Tika
>  Issue Type: Improvement
>  Components: detector, parser
>Affects Versions: 1.13
> Environment: RHEL 5.x, Apache Tomcat
>Reporter: Jeff Swindle
>  Labels: features
> Attachments: excel-macro.PNG, test-macro-doc.docm, 
> test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, 
> xlsmacro.xlsm.tika-app-output.txt
>
>
> Tika supports macro-enabled Microsoft Office documents by extracting metadata 
> and contents, however, macros within the document are not in the metadata or 
> content output.
> Desire is to have the macro text extracted also.
> Info regarding macro extraction: http://www.decalage.info/vba_tools



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2069) Extract Macro text from Microsoft Office documents

2016-09-09 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477438#comment-15477438
 ] 

Tim Allison commented on TIKA-2069:
---

Once we upgrade to POI 3.15-beta3, this _should_ be fairly straightforward, 
thanks to the work of others on POI.  We may want to copy/modify the "find the 
vba.bin file" at the Tika level for OOXML files to pass an npoifs into 
VBAMacroReader from an open OOXML/zip file.

> Extract Macro text from Microsoft Office documents
> --
>
> Key: TIKA-2069
> URL: https://issues.apache.org/jira/browse/TIKA-2069
> Project: Tika
>  Issue Type: Improvement
>  Components: detector, parser
>Affects Versions: 1.13
> Environment: RHEL 5.x, Apache Tomcat
>Reporter: Jeff Swindle
>  Labels: features
> Attachments: excel-macro.PNG, test-macro-doc.docm, 
> test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, 
> xlsmacro.xlsm.tika-app-output.txt
>
>
> Tika supports macro-enabled Microsoft Office documents by extracting metadata 
> and contents, however, macros within the document are not in the metadata or 
> content output.
> Desire is to have the macro text extracted also.
> Info regarding macro extraction: http://www.decalage.info/vba_tools



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2069) Extract Macro text from Microsoft Office documents

2016-09-09 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477415#comment-15477415
 ] 

Tim Allison commented on TIKA-2069:
---

Thanks to  [~blagerw...@gmail.com], [~gagravarr] and [~onealj] among others, it 
looks like this is all nicely handled by POI now as of 
[bug-52949|https://bz.apache.org/bugzilla/show_bug.cgi?id=52949].

> Extract Macro text from Microsoft Office documents
> --
>
> Key: TIKA-2069
> URL: https://issues.apache.org/jira/browse/TIKA-2069
> Project: Tika
>  Issue Type: Improvement
>  Components: detector, parser
>Affects Versions: 1.13
> Environment: RHEL 5.x, Apache Tomcat
>Reporter: Jeff Swindle
>  Labels: features
> Attachments: excel-macro.PNG, test-macro-doc.docm, 
> test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, 
> xlsmacro.xlsm.tika-app-output.txt
>
>
> Tika supports macro-enabled Microsoft Office documents by extracting metadata 
> and contents, however, macros within the document are not in the metadata or 
> content output.
> Desire is to have the macro text extracted also.
> Info regarding macro extraction: http://www.decalage.info/vba_tools



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2069) Extract Macro text from Microsoft Office documents

2016-09-09 Thread Jeff Swindle (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477255#comment-15477255
 ] 

Jeff Swindle commented on TIKA-2069:


OOXML would be great.
Not just limited to Word and Excel. Need Powerpoint also. 

> Extract Macro text from Microsoft Office documents
> --
>
> Key: TIKA-2069
> URL: https://issues.apache.org/jira/browse/TIKA-2069
> Project: Tika
>  Issue Type: Improvement
>  Components: detector, parser
>Affects Versions: 1.13
> Environment: RHEL 5.x, Apache Tomcat
>Reporter: Jeff Swindle
>  Labels: features
> Attachments: excel-macro.PNG, test-macro-doc.docm, 
> test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, 
> xlsmacro.xlsm.tika-app-output.txt
>
>
> Tika supports macro-enabled Microsoft Office documents by extracting metadata 
> and contents, however, macros within the document are not in the metadata or 
> content output.
> Desire is to have the macro text extracted also.
> Info regarding macro extraction: http://www.decalage.info/vba_tools



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2069) Extract Macro text from Microsoft Office documents

2016-09-09 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477188#comment-15477188
 ] 

Tim Allison commented on TIKA-2069:
---

Thank you!  

This question is for [~jeffswindle] and fellow Tika devs (esp. [~rgauss]), 
should we add macros as metadata items or inline them in the content via  
elements?

I'd prefer a metadata item for each macro, but could go either way.

[~jeffswindle], the title of this issue is for msoffice...is it ok to limit 
this to ooxml?  Do you need this for the older doc and xls?

> Extract Macro text from Microsoft Office documents
> --
>
> Key: TIKA-2069
> URL: https://issues.apache.org/jira/browse/TIKA-2069
> Project: Tika
>  Issue Type: Improvement
>  Components: detector, parser
>Affects Versions: 1.13
> Environment: RHEL 5.x, Apache Tomcat
>Reporter: Jeff Swindle
>  Labels: features
> Attachments: excel-macro.PNG, test-macro-doc.docm, 
> test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, 
> xlsmacro.xlsm.tika-app-output.txt
>
>
> Tika supports macro-enabled Microsoft Office documents by extracting metadata 
> and contents, however, macros within the document are not in the metadata or 
> content output.
> Desire is to have the macro text extracted also.
> Info regarding macro extraction: http://www.decalage.info/vba_tools



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (TIKA-2069) Extract Macro text from Microsoft Office documents

2016-09-09 Thread Jeff Swindle (JIRA)

[ 
https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15477176#comment-15477176
 ] 

Jeff Swindle commented on TIKA-2069:


Desire is for TIKA to extract macro text from Microsoft Office files as it does 
metadata and content.
Need is to search for specific signatures that may be present in macros and if 
present should be removed prior to distributing document. TIKA would facilitate 
the search.

> Extract Macro text from Microsoft Office documents
> --
>
> Key: TIKA-2069
> URL: https://issues.apache.org/jira/browse/TIKA-2069
> Project: Tika
>  Issue Type: Improvement
>  Components: detector, parser
>Affects Versions: 1.13
> Environment: RHEL 5.x, Apache Tomcat
>Reporter: Jeff Swindle
>  Labels: features
> Attachments: excel-macro.PNG, test-macro-doc.docm, 
> test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, 
> xlsmacro.xlsm.tika-app-output.txt
>
>
> Tika supports macro-enabled Microsoft Office documents by extracting metadata 
> and contents, however, macros within the document are not in the metadata or 
> content output.
> Desire is to have the macro text extracted also.
> Info regarding macro extraction: http://www.decalage.info/vba_tools



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (TIKA-2069) Extract Macro text from Microsoft Office documents

2016-09-09 Thread Jeff Swindle (JIRA)

 [ 
https://issues.apache.org/jira/browse/TIKA-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeff Swindle updated TIKA-2069:
---
Attachment: excel-macro.PNG
word-macro.PNG
xlsmacro.xlsm.tika-app-output.txt
xlsmacro.xlsm
test-macro-doc.docm-tika-app-output.txt
test-macro-doc.docm

Word file containing macros. Output from tika-app-1.13. Screen shot of macro 
within Word file.
Excel file containing macros. Output from tika-app-1.13. Screen shot of macros 
within Excel file.

> Extract Macro text from Microsoft Office documents
> --
>
> Key: TIKA-2069
> URL: https://issues.apache.org/jira/browse/TIKA-2069
> Project: Tika
>  Issue Type: Improvement
>  Components: detector, parser
>Affects Versions: 1.13
> Environment: RHEL 5.x, Apache Tomcat
>Reporter: Jeff Swindle
>  Labels: features
> Attachments: excel-macro.PNG, test-macro-doc.docm, 
> test-macro-doc.docm-tika-app-output.txt, word-macro.PNG, xlsmacro.xlsm, 
> xlsmacro.xlsm.tika-app-output.txt
>
>
> Tika supports macro-enabled Microsoft Office documents by extracting metadata 
> and contents, however, macros within the document are not in the metadata or 
> content output.
> Desire is to have the macro text extracted also.
> Info regarding macro extraction: http://www.decalage.info/vba_tools



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)