Steve Onnis wrote: > What I want to do it implement like a "Recently Updated" type of function > into an app but I don't want to go by the Date Modified stamp because they > might just be fixing a typo or something in there which doesn't really class > as a major update.
I am wondering if a tool we wrote as part of a SPAM checking engine could be put to use here. It used the triad idea (out of IBM IIRC) but working on entire words rather than characters. It parsed a string into sets of three words: 1,2,3;2,3,4;3,4,5; etc , put each triple word set into a structure and in the case of the SPAM engine it then compared those triplets with an existing structure with values for each triplet to get a score of goodness or badness. If you did that type of process on one string and then repeated it on the other then you will be able to compare the structures and pick out word sets that are different. Then you can look at the differences and score the amount of change in each case, sum them up for a grand difference score and with some suitable scaling is close to what you are looking for is it not? I am happy to share the code. Its all custom tags as its part of our mail server which is running CF5 and it runs fast, it has to :-) -- Yours, Kym Kovan mbcomms --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "cfaussie" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/cfaussie?hl=en -~----------~----~----~----~------~----~------~--~---
