On 08/23/2012 03:30 PM, andrea maestroni wrote:
so i must add some line to the train file? or adding other file?
there are some example for the file and for the classification?
The problem here is that the default training does a feature
cutoff of 5. So a feature must be seen at least 5 times to be included
in the training. With just two training samples you do not get to 5,
it should not crash if you set the cutoff to 0.
But in the end the model will really be able to predict anything with
just two training samples. Usually you want to train with at least a few
hundred
or thousands of samples.
You need to add more lines to the training file. Each line is one
document, starting
with the category, just like in the sample you experimented with.
Jörn