Sorry, but not only can you not capture that specific <div>, but you cannot capture ANY <div>. Really. For some mysterious reasoning, Tika silently eats <div> HTML parsing events. Plenty of other HTML tags can be captured, but not <div>.

Both the Solr Wiki for Solr Cell and the new/Lucid Apache Solr Reference Guide mislead people with examples that clearly can never run as expected with real data.

-- Jack Krupansky
-----Original Message----- From: eShard
Sent: Tuesday, June 25, 2013 1:17 PM
To: solr-user@lucene.apache.org
Subject: Is there a way to capture div tag by id?

let's say I have a div with id="myDiv"
Is there a way to set up the solr upate/extract handler to capture just that
particular div?



--
View this message in context: http://lucene.472066.n3.nabble.com/Is-there-a-way-to-capture-div-tag-by-id-tp4073120.html Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to