I expected this would be a subject that would bring out input from a lot of
people. Right now this list numbers 65 people, and only a few messages have
flashed back and forth.

I'll sum up the requirements we have so far. Are there any others? Are
there some on the list that we *don't* really need? Is it safe for me to
assume this is the requirements we want in our specification?

Andrew's list:
*  phrase searching
*  fuzzy searching (basically as it is now)
*  use of "+" or "-" as prefix to search words (ala altavista)
*  use of "near" as a method to determine relations between search words
*  cross platform (unix, nt)
*  ability to search only in specific areas of documents (title, headers, etc)
*  better relevance ranking
*  faster results generation for searches returning many hits

Mine:
*  Collections of databases
*  Parallel indexing and searching (no need for alternate files or htmerge)
*  Multithreading support (some sort of locking for writes)
*  Removing duplicate documents
*  Referer links (e.g. AltaVista-style link:)
*  Search for "more like" or "similar to" (a la Excite)
*  On-the-fly editing of search factors (without needing to rebuild the db)

(I also forgot to include)
*  Flexible backend (use Berkeley DB, *SQL, Oracle, etc.)
*  Internationalization (e.g. Chinese support, probably through Unicode)

-Geoff


------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to