Yep, If I understand you correctly, you just need to call the transform
method on your new data using the fitted TfidfVectorizer.
Regards,
Philipp
Am 13.04.2013 19:13, schrieb Alex Kopp:
Suppose I used tfidfvectorizer to create features, trained a
classifier, did cross-validation, etc.. Let's say I am happy with the
result and I want to use my classifier with new data. When I am
converting my new (unlabeled) data to a feature vector, don't I need
the IDF from the original tfidf vectorizer to calculate the tfidf of
the words in my new (unlabeled) data point? If so, is there an easy
way to do this?
Thanks
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general
------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general