What is your dictionary size. Lot of things depend on it. When we calculate
final probability
On 22 Aug 2014 14:27, "Wei Li" wrote:
> Hi All:
>
> I have successfully compiled the Mahout 0.9 on Hadoop and submit the
> LDA CVB model, most of the parameters are set to default values and the
> --
Hi i am able run lda but wanted to know the accuracy for topic modeling or
any other efficiency measuement.
Thanks in adavnce
Vaibhav.
s that to tfidf.calculate.
>
> Anyone know what I'm doing wrong?
>
>
> Best,
>
> jce
>
--
Thanks and Regards,
Vaibhav Srivastava
Email-id: vaibhavcs...@gmail.com
Mobile no.: 9552543029
>
> java.lang.IllegalArgumentException: Label not found: 471685156584292353
>
> I know that this is due, to the fact that the documentID is recognized as
> label, but I don't know how to resolve that, could be great if you provide
> me some similar example, becouse I can't fi
that, so I don't know what put in the key of the sequence file.
> >>
> >> I've searched online for some example, but the only ones that I've found
> >> use the split command, on the original dataset, and then testing on
> part of
> >> that, but isn't my case.
> >>
> >>
> >> Every idea for developing a better sentiment analysis is welcome, thanks
> >> in advance for the help.
> >>
> >>
>
>
--
Thanks and Regards,
Vaibhav Srivastava
Email-id: vaibhavcs...@gmail.com
Mobile no.: 9552543029
Hi,
If you want to create a test set and if you do not want to measure accuracy.
Then you can make an instance of claasifier and load your model on that
classifier and then can find the best score.
Look at navie bayes test code.
Hope this help. Thanks .
On 29 Jul 2014 12:53, "Luca Filipponi" wrot