On Tue, Aug 4, 2009 at 8:31 AM, Shai Erera<ser...@gmail.com> wrote: > Hi Darren, > > The question was, how given a string "aboutus" in a document, you can return > that document as a result to the query "about us" (note the space). So we're > mostly discussing how to detect and then break the word "aboutus" to two > words.
When traversing Japanese text you have a use a similar algorithm to searching a maze (keep left and retrace your steps). It's possible to go a long way along sentence before you find the tokens you've already picked out are invalid. Rough example... thereallibrary there allibrary there all i brary (fail) the reallibrary the real library --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org