hunter.hammond....@gmail.com wrote:

> This is a knn algorithm for articles that I have gotten. Then determines
> which category it belongs to. I am not getting very good results :/

[snip too much code;)]

- Shouldn't the word frequency vectors be normalized? I don't see that in
  your code. Without that the length of the text may overshade its contents.

- There are probably words that are completely irrelevant. Getting
  rid of these should improve the signal-to-noise ratio.


-- 
https://mail.python.org/mailman/listinfo/python-list

Reply via email to