[jira] [Updated] (SLING-2924) Full text extraction issue with Tika v1.0 under OSGi environment

2013-07-09 Thread Robert Munteanu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SLING-2924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Munteanu updated SLING-2924:
---

  Component/s: (was: JCR)
   Launchpad
Fix Version/s: Launchpad Builder 7

> Full text extraction issue with Tika v1.0 under OSGi environment
> 
>
> Key: SLING-2924
> URL: https://issues.apache.org/jira/browse/SLING-2924
> Project: Sling
>  Issue Type: Bug
>  Components: Launchpad
>Reporter: Anjan
>Assignee: Robert Munteanu
>  Labels: tika,text-extraction
> Fix For: Launchpad Builder 7
>
>
> The latest stable build (I checked out revision 1487628) of Sling is using 
> Jackrabbit version 2.4.2 and it uses Tika version 1.0 for extracting 
> metatdata and text for indexing purpose.  Jackrabbit v2.4.2 deployed as a 
> separate web application extracts metadata and text from the uploaded 
> documents perfectly fine, but when deployed in Sling (OSGi environment), full 
> text extraction doesn't work.
> Updating the Tika dependency to Version 1.2 in Sling resolved the above issue.
> Secondly, if the indexes are deleted from the repository and the server is 
> restarted, indexes are not rebuilt for the existing documents.  The Tika 
> bundles were not ready by the time Jackrabbit starts to rebuild the indexes 
> during the Sling server start up.  Updating the startlevel from 15 to 10 for 
> the Tika bundles helps to resolve the issue.
> The changes related to above fixes are in 
> /launchpad/builder/src/main/bundles/list.xml file.
> Currently Tika bundles are at start level 15 as shown below:
> 
> ..
> 
> org.apache.tika
> tika-core
> 1.0
> 
> 
> org.apache.tika
> tika-bundle
> 1.0
> 
> ..
> 
> Moved the above bundles to start level 10 and also the version is changed to 
> 1.2
> 
> ..
> 
> org.apache.tika
> tika-core
> 1.2
> 
> 
> org.apache.tika
> tika-bundle
> 1.2
> 
> ..
> 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (SLING-2924) Full text extraction issue with Tika v1.0 under OSGi environment

2013-07-09 Thread Robert Munteanu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SLING-2924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Munteanu updated SLING-2924:
---

Assignee: Robert Munteanu

> Full text extraction issue with Tika v1.0 under OSGi environment
> 
>
> Key: SLING-2924
> URL: https://issues.apache.org/jira/browse/SLING-2924
> Project: Sling
>  Issue Type: Bug
>  Components: JCR
>Reporter: Anjan
>Assignee: Robert Munteanu
>  Labels: tika,text-extraction
>
> The latest stable build (I checked out revision 1487628) of Sling is using 
> Jackrabbit version 2.4.2 and it uses Tika version 1.0 for extracting 
> metatdata and text for indexing purpose.  Jackrabbit v2.4.2 deployed as a 
> separate web application extracts metadata and text from the uploaded 
> documents perfectly fine, but when deployed in Sling (OSGi environment), full 
> text extraction doesn't work.
> Updating the Tika dependency to Version 1.2 in Sling resolved the above issue.
> Secondly, if the indexes are deleted from the repository and the server is 
> restarted, indexes are not rebuilt for the existing documents.  The Tika 
> bundles were not ready by the time Jackrabbit starts to rebuild the indexes 
> during the Sling server start up.  Updating the startlevel from 15 to 10 for 
> the Tika bundles helps to resolve the issue.
> The changes related to above fixes are in 
> /launchpad/builder/src/main/bundles/list.xml file.
> Currently Tika bundles are at start level 15 as shown below:
> 
> ..
> 
> org.apache.tika
> tika-core
> 1.0
> 
> 
> org.apache.tika
> tika-bundle
> 1.0
> 
> ..
> 
> Moved the above bundles to start level 10 and also the version is changed to 
> 1.2
> 
> ..
> 
> org.apache.tika
> tika-core
> 1.2
> 
> 
> org.apache.tika
> tika-bundle
> 1.2
> 
> ..
> 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira