Hi Abhishek,
On Dec 3, 11:48 pm, Abhishek <[EMAIL PROTECTED]> wrote: > Hi, > > I just came across an idea wherein they wanted to find out how > frequently a particular phrase occurs in a set of documents. > So they refer to the probability of that phrase coming in a set of > documents put together. > I was just wondering how they find the probability of the phrase in > the whole set of documents. > > With Regards, > Abhishek S > > On Dec 3, 9:51 pm, "James Fang" <[EMAIL PROTECTED]> wrote: > > > > > No problem man. > > I wonder what's the user senario of your probability.One document or > > multiple documents are not the matter, because u can actually combine their > > stastistics together. > > > Best Regards, > > James Fang > > > -----Original Mail----- > > 发件人: algogeeks@googlegroups.com [mailto:[EMAIL PROTECTED] 代表 > > Abhishek > > 发送时间: 2007年12月3日 17:17 > > 收件人: Algorithm Geeks > > 主题: [algogeeks] Re: Probability of a phrase in a text document? > > > Thanks James. I was thinking on the same lines too. > > I guess I have some homework to be done on this regard :) > > > With Regards, > > Abhishek S > > > On Dec 3, 1:27 pm, "James Fang" <[EMAIL PROTECTED]> wrote: > > > > Hi Abhishek, > > > > You need to build up your metric for "probability" first. > > > For e.g., > > > 1. keywords occurrence/total words count > > > 2. Keywords occurrence/total sentences > > > 3. the number of files who contain keyword / total files number > > > > Best Regards, > > > James Fang > > > > -----邮件原件----- > > > 发件人: algogeeks@googlegroups.com [mailto:[EMAIL PROTECTED] 代 > > 表 > > > Abhishek > > > 发送时间: 2007年12月3日 16:10 > > > 收件人: Algorithm Geeks > > > 主题: [algogeeks] Probability of a phrase in a text document? > > > > Hi, > > > If I have a large corpus of text documents and I need to find the > > > probability of occurence of a phrase like "I am" in the given set of > > > text documents, how do I go about finding the value? > > > I can very well search how many time does the phrase "I am" occurs in > > > the whole set of text documents including all the sentences, but what > > > do i divide the count by? > > > Thanks > > > > With Regards, > > > Abhishek S- Hide quoted text - > > - Show quoted text - --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Algorithm Geeks" group. To post to this group, send email to algogeeks@googlegroups.com To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/algogeeks -~----------~----~----~----~------~----~------~--~---