[jira] [Updated] (SLING-2924) Full text extraction issue with Tika v1.0 under OSGi environment
[ https://issues.apache.org/jira/browse/SLING-2924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Munteanu updated SLING-2924: --- Component/s: (was: JCR) Launchpad Fix Version/s: Launchpad Builder 7 > Full text extraction issue with Tika v1.0 under OSGi environment > > > Key: SLING-2924 > URL: https://issues.apache.org/jira/browse/SLING-2924 > Project: Sling > Issue Type: Bug > Components: Launchpad >Reporter: Anjan >Assignee: Robert Munteanu > Labels: tika,text-extraction > Fix For: Launchpad Builder 7 > > > The latest stable build (I checked out revision 1487628) of Sling is using > Jackrabbit version 2.4.2 and it uses Tika version 1.0 for extracting > metatdata and text for indexing purpose. Jackrabbit v2.4.2 deployed as a > separate web application extracts metadata and text from the uploaded > documents perfectly fine, but when deployed in Sling (OSGi environment), full > text extraction doesn't work. > Updating the Tika dependency to Version 1.2 in Sling resolved the above issue. > Secondly, if the indexes are deleted from the repository and the server is > restarted, indexes are not rebuilt for the existing documents. The Tika > bundles were not ready by the time Jackrabbit starts to rebuild the indexes > during the Sling server start up. Updating the startlevel from 15 to 10 for > the Tika bundles helps to resolve the issue. > The changes related to above fixes are in > /launchpad/builder/src/main/bundles/list.xml file. > Currently Tika bundles are at start level 15 as shown below: > > .. > > org.apache.tika > tika-core > 1.0 > > > org.apache.tika > tika-bundle > 1.0 > > .. > > Moved the above bundles to start level 10 and also the version is changed to > 1.2 > > .. > > org.apache.tika > tika-core > 1.2 > > > org.apache.tika > tika-bundle > 1.2 > > .. > -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (SLING-2924) Full text extraction issue with Tika v1.0 under OSGi environment
[ https://issues.apache.org/jira/browse/SLING-2924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Munteanu updated SLING-2924: --- Assignee: Robert Munteanu > Full text extraction issue with Tika v1.0 under OSGi environment > > > Key: SLING-2924 > URL: https://issues.apache.org/jira/browse/SLING-2924 > Project: Sling > Issue Type: Bug > Components: JCR >Reporter: Anjan >Assignee: Robert Munteanu > Labels: tika,text-extraction > > The latest stable build (I checked out revision 1487628) of Sling is using > Jackrabbit version 2.4.2 and it uses Tika version 1.0 for extracting > metatdata and text for indexing purpose. Jackrabbit v2.4.2 deployed as a > separate web application extracts metadata and text from the uploaded > documents perfectly fine, but when deployed in Sling (OSGi environment), full > text extraction doesn't work. > Updating the Tika dependency to Version 1.2 in Sling resolved the above issue. > Secondly, if the indexes are deleted from the repository and the server is > restarted, indexes are not rebuilt for the existing documents. The Tika > bundles were not ready by the time Jackrabbit starts to rebuild the indexes > during the Sling server start up. Updating the startlevel from 15 to 10 for > the Tika bundles helps to resolve the issue. > The changes related to above fixes are in > /launchpad/builder/src/main/bundles/list.xml file. > Currently Tika bundles are at start level 15 as shown below: > > .. > > org.apache.tika > tika-core > 1.0 > > > org.apache.tika > tika-bundle > 1.0 > > .. > > Moved the above bundles to start level 10 and also the version is changed to > 1.2 > > .. > > org.apache.tika > tika-core > 1.2 > > > org.apache.tika > tika-bundle > 1.2 > > .. > -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira