We are indexing/searching for  long strings, containing separators. htdig 
breaks these up into tokens
and then does a boolean  AND on them.

We were testing how many tokens are tolerated before htdig is unable to 
distinguish
between two strings.  Tests show that if the string is broken into 14 tokens, 
it 
gives a unique result, but if the 15th token differs, then it does not 
recognise this
difference. 

Can anyone comment on the coded token length of htdig (this  for  V 3.1.6).
Does perchance 3.2 differ in this regard?

Apologies if this is a developer question! 
-- 

Henry Rzepa. 
+44 (020) 7594 5774 (Voice); +44 (0870) 132 3747 (eFax); [EMAIL PROTECTED] 
(iChat)
 http://www.ch.ic.ac.uk/rzepa/ Dept. Chemistry, Imperial College London, SW7  
2AZ, UK. 

(Voracious anti-spam filter in operation for received email.
If expected reply not received, please phone/fax). 




-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now. 
http://productguide.itmanagersjournal.com/
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to