On Wed, Jul 27, 2005 at 10:07:57AM -0500, Andy Lester wrote:
> On Tue, Jul 26, 2005 at 06:38:00PM -0700, Michael G Schwern ([EMAIL 
> PROTECTED]) wrote:
> > More importantly, make your source data as exact as possible so the maximum
> > amount of information is available to the user.  Case is information.
> 
> My only concern was false information, such as X<Operators> instead of
> X<operators>.

This is valid concern.

I'd see it as:

  Proper nouns and terms where cApItAlIsaTiOn is significant really should be
  put into X<> as is.

  Everything else should be lower case.

  Likewise singular unless the plural is specifically needed.

  No abbreviations. (Or always be consistent with everything using exactly
  1 abbreviation)

This lets the search tool decide whether it wants to do case insensitive
matching. Rather than giving it no choice.

However, I suspect that with a small amount of interactive search and replace
we can correct anything needed by a later change of policy on normalisation.


So this is a bit of a bikeshed issue.

Nicholas Clark

Reply via email to