Hi, Currently I'm working on a search engine for a website. A mysql table on which searching should be done was created which contains Words of website articles:
CREATE TABLE docs_words ( crc32_word int(11) NOT NULL default '0', id_doc int(11) NOT NULL default '0'); As you can see not the real word is taken for indexing but the crc32 checksum of a word. So my question is: How big is the probability that 2 different words has the same crc32 checksum? Thanks for response