Thanks, but I still need to "store" text at any rate in order to get the highlighted snippets for the search results list. This isn't a problem. The issue is how to obtain correct offsets or other mechanisms for being able to display the original HTML text plus term highlighting when navigating to an individual search result.

Scott.

On 1/10/10 12:53 PM, Neil Lunn wrote:
On Fri, 2010-10-01 at 12:00 +1000, Scott Yeadon wrote:
Hi,

The problem is that the article text is HTML and Solr appears to strip
the HTML by default.
I think what you need to look at is how the fields are defined by
default in your schema. If Data sent as HTML is being added to the
standard html-text type and stored then the html is stripped and words
indexed by default. If you want to store the raw html then maybe you
should be doing that and not storing the stripped version, just indexing
it.


Reply via email to