Can you give us some hints about what you are doing?

Which version of Mahout, what platform and which components are a good
start.

What did you need to do?

How did you observe this (i.e. what did you run, on what data)?

Why is this a problem.

Speaking in general terms without this information, all I can say is that as
a document gets longer, it is easier to classify and so certainty should
generally increase.

In the extreme case, take a document with one word: "window"

What category should that document be in?  Computers (open a new window)?
 Building (how to fix a broken window)?  Politics (opening a window on the
east)?  Orbital
dynamics (launch window)?

On the other hand, if the document is 1000 words, you should be able to
determine the topic quite accurately.

On Wed, Apr 27, 2011 at 5:37 AM, mohammed.farrag <
mohammed.far...@pearlox.com> wrote:

> What does the score of classification mean ?
> I notice that score is directly proportional with the number of  given
> words, which will make it difficult to determine in what degree the file is
> related to a specific category ?
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Scoring-issue-tp2870234p2870234.html
> Sent from the Mahout User List mailing list archive at Nabble.com.
>

Reply via email to