Sorry, but not only can you not capture that specific <div>, but you cannot
capture ANY <div>. Really. For some mysterious reasoning, Tika silently eats
<div> HTML parsing events. Plenty of other HTML tags can be captured, but
not <div>.
Both the Solr Wiki for Solr Cell and the new/Lucid Apache Solr Reference
Guide mislead people with examples that clearly can never run as expected
with real data.
-- Jack Krupansky
-----Original Message-----
From: eShard
Sent: Tuesday, June 25, 2013 1:17 PM
To: solr-user@lucene.apache.org
Subject: Is there a way to capture div tag by id?
let's say I have a div with id="myDiv"
Is there a way to set up the solr upate/extract handler to capture just that
particular div?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Is-there-a-way-to-capture-div-tag-by-id-tp4073120.html
Sent from the Solr - User mailing list archive at Nabble.com.