Congratulations on the book. I ordered my copy the other day via
regular post and am eagerly awaiting it. It looks like it will make
lucene available to a much wider audience.
Based on the table of contents, I wanted to toss out a couple of ideas
for your next book or articles.
1. I didn't see
When comparing RAMDirectory and FSDirectory it is important to mention
what OS you are using. When using linux it will cache the most recent
disk access in memory. Here is a good article that describes its
strategy: http://forums.gentoo.org/viewtopic.php?t=175419
The 2% difference you are seeing
I investigated how the algorithm implemented in this spell checker
compares with my simple implementation of a spell checker.
First here is what my implementation looks like:
//Each word becomes a single Lucene Document
//To find suggestions:
FuzzyQuery fquery = new FuzzyQuery(new Term("word"
That is exactly right. It is searching the ASCII. To solve it I pad
my price using a method like this:
/**
* Pads the Price so that all prices are the same number of characters and
* can be compared lexigraphically.
* @param price
* @return
*/
public static String formatPriceA
and PDFs. It may
be a good reference to figure out how to create an open source Google
Desktop Search.
Jonathan Hager
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]