Hi, If I have a large corpus of text documents and I need to find the probability of occurence of a phrase like "I am" in the given set of text documents, how do I go about finding the value? I can very well search how many time does the phrase "I am" occurs in the whole set of text documents including all the sentences, but what do i divide the count by? Thanks
With Regards, Abhishek S --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Algorithm Geeks" group. To post to this group, send email to algogeeks@googlegroups.com To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/algogeeks -~----------~----~----~----~------~----~------~--~---