The simplest solution is always the best - when storing the page, do not
break up sentences. So a page will be all the sentences that occur on
it. If a sentence starts on one page and finishes on the next it will be
included in both pages in the index.

Hope this helps

Mike

www.ardentia.com the home of NetSearch
-----Original Message-----
From: Mile Rosu [mailto:[EMAIL PROTECTED] 
Sent: 11 July 2006 15:55
To: java-user@lucene.apache.org
Subject: Searching for a phrase which spans on 2 pages

Hello,

I am working on an application similar to google books which allows 
searching on documents which represent a scanned page. Of course, one 
might search for a phrase starting at the end of one page and ending at 
the beginning of the next one. In this case I do not know how I might 
treat this. Both pages should be returned as hit results.
Do you have any idea on how this situation might be handled?

Thank you,
Mile Rosu

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to