On Dec 8, 2007 11:15 AM, James Fang [EMAIL PROTECTED] wrote:
How do u handle this situation with the phrase count/ total phrase count
probability? Is the probability of the only phrase 100% while it occurs
rarely in the whole document?
Amazon's SIPs :)
--
Venkat
Blog @
Hi Abhishek,
On Dec 3, 11:48 pm, Abhishek [EMAIL PROTECTED] wrote:
Hi,
I just came across an idea wherein they wanted to find out how
frequently a particular phrase occurs in a set of documents.
So they refer to the probability of that phrase coming in a set of
documents put together.
Hi Abhishek,
Regarding your question about finding probability of the occurence
of a phrase.
I guess its not as straight as dividing the total no of occurences
with total no of words/keywords/sentences.
In my view the question is about finding the probability of a
particular 'phrase' out of
Hi Abhishek,
You need to build up your metric for probability first.
For e.g.,
1. keywords occurrence/total words count
2. Keywords occurrence/total sentences
3. the number of files who contain keyword / total files number
Best Regards,
James Fang
-邮件原件-
发件人:
Thanks James. I was thinking on the same lines too.
I guess I have some homework to be done on this regard :)
With Regards,
Abhishek S
On Dec 3, 1:27 pm, James Fang [EMAIL PROTECTED] wrote:
Hi Abhishek,
You need to build up your metric for probability first.
For e.g.,
1.
No problem man.
I wonder what's the user senario of your probability.One document or
multiple documents are not the matter, because u can actually combine their
stastistics together.
Best Regards,
James Fang
-Original Mail-
发件人: algogeeks@googlegroups.com [mailto:[EMAIL PROTECTED]
Hi,
I just came across an idea wherein they wanted to find out how
frequently a particular phrase occurs in a set of documents.
So they refer to the probability of that phrase coming in a set of
documents put together.
I was just wondering how they find the probability of the phrase in
the