Hi Anjan, I've recently been experimenting with storing pdf captures of web pages in sling and was hoping they would be automatically full-text indexed so I could do queries using 'CONTAINS'. I'm also using a fairly recent build of Sling 7 (mine might be a month old). I thought that the text would be extracted into a 'text' property, but I am not seeing this and of course then the query does not work... Here's a screen grab of the properties of my upload pdf's jcr:content node (hope the list allows images)... I would also love to hear from someone more experienced if this is expected behavior or not for Sling 7....
On Thu, Jun 13, 2013 at 12:57 AM, anjan <[email protected]> wrote: > I did lot of debugging without much success. When I use Sling 6 Web > Application > < > http://mirror.metrocast.net/apache//sling/org.apache.sling.launchpad-6.war > > > , the full text indexing is working fine. But in Sling 6, Apache Tika 0.6 > is used (I believe Jackrabbit internally uses Tika for metadata and text > extraction). Secondly, the entire Tika is bundled as a single OSGI bundle > (Core and Parsers) in Sling 6. But in the latest build of Sling Tika 1.0 > is > used and Tika Core and Tikar Parsers are deployed as separate OSGI bundles. > 'Search' is an important feature and it's is not working. Please reply if > anyone has noticed this issue. > > > > -- > View this message in context: > http://apache-sling.73963.n3.nabble.com/Full-text-indexing-is-not-happening-tp4024383p4024535.html > Sent from the Sling - Users mailing list archive at Nabble.com. >
