Gerry Snyder <[EMAIL PROTECTED]> wrote:

> Chris Jones wrote:
> Hi all,

> I have a very simple schema.  I need to assign a unique identifier
> to a large collection of strings, each at most 80-bytes, although
> typically shorter.

Would it help to hash the strings, then save them in the DB, checking the hash 
instead of the string for duplication?  You might still get duplicates, 
depending on what hashing algorithm you use, but those should be relatively 
easy to find afterwards.  Hashing could take a while, but that should be a 
linear time operation all by itself.



-----------------------------------------------------------------------------
To unsubscribe, send email to [EMAIL PROTECTED]
-----------------------------------------------------------------------------

Reply via email to