Hi All, I am new to *Machine Learning* and *NLP*.
I want to categorized a line/sentence to a pre-defined category using OpenNLP toolkit. So I for that I am following the developer documentation as well the below link http://madhawagunasekara.blogspot.in/2014/11/nlp-categorizer.html Here I can understand the data.txt(training data set, mentioned above ) but can't understand what the file *en-doccat.bin *is and what is the content of the file. Please help me. so that I can implement Thanks Amit -- -- This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. Please notify the sender immediately by e-mail if you have received this e-mail by mistake and delete this e-mail from your system. You are also notified that disclosing, copying, distributing or taking any action in reliance on the contents of this information is strictly prohibited.
