Hi All,

Recently, when i was solving some ML problem, I came accross an
attribute which has Ordinal Values . Eg:

Student ID    |    Subjects
========================================
1            |    ['Math']
2            |    ['Math','Python']
3            |    ['C']
4            |    ['Python','Statistics']
========================================

Here, attribute Subjects is a list which contains list of subjects the
student is interested in. We have sklearn.preprocessing.OneHotEncoder
which encodes a single Categorical variable by creating multiple
columns.
Similarily, I want to propose different encoder that encodes this type
of list and creates new columns , one column for each subject. Allowed
values are 1/0 which specifies whether student is interested in this
subject or not. I'm new to Open Source contribution. Can someone tell
me If there is an existing feature that handles this type of data or
If I can start working on this feature. Any response would be
appreciated.

Thanks
Prudvi RajKumar
_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

Reply via email to