Yep, If I understand you correctly, you just need to call the transform method on your new data using the fitted TfidfVectorizer.

Regards,
Philipp

Am 13.04.2013 19:13, schrieb Alex Kopp:
Suppose I used tfidfvectorizer to create features, trained a classifier, did cross-validation, etc.. Let's say I am happy with the result and I want to use my classifier with new data. When I am converting my new (unlabeled) data to a feature vector, don't I need the IDF from the original tfidf vectorizer to calculate the tfidf of the words in my new (unlabeled) data point? If so, is there an easy way to do this?

Thanks


------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter


_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to