[algogeeks] Re: Probability of a phrase in a text document?

2007-12-08 Thread Venkatraman S
On Dec 8, 2007 11:15 AM, James Fang [EMAIL PROTECTED] wrote: How do u handle this situation with the phrase count/ total phrase count probability? Is the probability of the only phrase 100% while it occurs rarely in the whole document? Amazon's SIPs :) -- Venkat Blog @

[algogeeks] Re: Probability of a phrase in a text document?

2007-12-04 Thread Shobhit Sinha
Hi Abhishek, On Dec 3, 11:48 pm, Abhishek [EMAIL PROTECTED] wrote: Hi, I just came across an idea wherein they wanted to find out how frequently a particular phrase occurs in a set of documents. So they refer to the probability of that phrase coming in a set of documents put together.

[algogeeks] Re: Probability of a phrase in a text document?

2007-12-04 Thread Shobhit Sinha
Hi Abhishek, Regarding your question about finding probability of the occurence of a phrase. I guess its not as straight as dividing the total no of occurences with total no of words/keywords/sentences. In my view the question is about finding the probability of a particular 'phrase' out of

[algogeeks] Re: Probability of a phrase in a text document?

2007-12-03 Thread James Fang
Hi Abhishek, You need to build up your metric for probability first. For e.g., 1. keywords occurrence/total words count 2. Keywords occurrence/total sentences 3. the number of files who contain keyword / total files number Best Regards, James Fang -邮件原件- 发件人:

[algogeeks] Re: Probability of a phrase in a text document?

2007-12-03 Thread Abhishek
Thanks James. I was thinking on the same lines too. I guess I have some homework to be done on this regard :) With Regards, Abhishek S On Dec 3, 1:27 pm, James Fang [EMAIL PROTECTED] wrote: Hi Abhishek, You need to build up your metric for probability first. For e.g., 1.

[algogeeks] Re: Probability of a phrase in a text document?

2007-12-03 Thread James Fang
No problem man. I wonder what's the user senario of your probability.One document or multiple documents are not the matter, because u can actually combine their stastistics together. Best Regards, James Fang -Original Mail- 发件人: algogeeks@googlegroups.com [mailto:[EMAIL PROTECTED]

[algogeeks] Re: Probability of a phrase in a text document?

2007-12-03 Thread Abhishek
Hi, I just came across an idea wherein they wanted to find out how frequently a particular phrase occurs in a set of documents. So they refer to the probability of that phrase coming in a set of documents put together. I was just wondering how they find the probability of the phrase in the