Re: PDF Highlighting using PDF Highlight File

Dawn Zoë Raison Thu, 12 May 2011 08:05:56 -0700


On 12/05/2011 15:47, Wulf Berschin wrote:

I think support for highlighting documents would be a very welcomefeature. Highlighting HTML documents is already possible with theorg.apache.solr.analysis.HTMLStripCharFilter and a NullFragmenter, butther seems to be nothing for highlighting PDF files...

It would be very useful. That said being able to highlight words in anypictorial representation of a document would be a huge bonus.

As starting point I quarried out fromorg.apache.lucene.search.highlight.Highlighter the class below whichjust returns the Tokens contributing to the hit.

I use a similar (home brew) solution to extract the hit terms, and thenpass them to the Adobe PDF viewer plugin as a search term via the PDF URL.



--

Rgds.
*Dawn Raison*
Technical Director, Digitorial Ltd.

E:[email protected] W:http://www.digitorial.co.uk
M: 07956 609 618                T: 01428 729 431
Reg: 04644583, England&  Wales
Church Villas Ecchinswell, Newbury, RG20  4TT

This email and any attached files are for the exclusive use of theaddressee and may contain privileged and/or confidential information. Ifyou receive this email in error you should not disclose the contents toany other person nor take copies but should delete it immediately.Digitorial Ltd makes no warranty as to the accuracy or completeness ofthis email and accepts no liability for its contents or use. Anyopinions expressed in this email are those of the author and do notnecessarily reflect the opinions of Digitorial Ltd.

Re: PDF Highlighting using PDF Highlight File

Reply via email to