Hi Anjan,

I've recently been experimenting with storing pdf captures of web pages in
sling and was hoping they would be automatically full-text indexed so I
could do queries using 'CONTAINS'.  I'm also using a fairly recent build of
Sling 7 (mine might be a month old).  I thought that the text would be
extracted into a 'text' property, but I am not seeing this and of course
then the query does not work...  Here's a screen grab of the properties of
my upload pdf's jcr:content node (hope the list allows images)...  I would
also love to hear from someone more experienced if this is expected
behavior or not for Sling 7....



On Thu, Jun 13, 2013 at 12:57 AM, anjan <[email protected]> wrote:

> I did lot of debugging without much success.  When I use  Sling 6 Web
> Application
> <
> http://mirror.metrocast.net/apache//sling/org.apache.sling.launchpad-6.war
> >
> , the full text indexing is working fine.  But in Sling 6, Apache Tika 0.6
> is used (I believe Jackrabbit internally uses Tika for metadata and text
> extraction).  Secondly, the entire Tika is bundled as a single OSGI bundle
> (Core and Parsers) in Sling 6.  But in the latest build of Sling Tika 1.0
> is
> used and Tika Core and Tikar Parsers are deployed as separate OSGI bundles.
> 'Search' is an important feature and it's is not working.  Please reply if
> anyone has noticed this issue.
>
>
>
> --
> View this message in context:
> http://apache-sling.73963.n3.nabble.com/Full-text-indexing-is-not-happening-tp4024383p4024535.html
> Sent from the Sling - Users mailing list archive at Nabble.com.
>

Reply via email to