Hi All, Recently, when i was solving some ML problem, I came accross an attribute which has Ordinal Values . Eg:
Student ID | Subjects ======================================== 1 | ['Math'] 2 | ['Math','Python'] 3 | ['C'] 4 | ['Python','Statistics'] ======================================== Here, attribute Subjects is a list which contains list of subjects the student is interested in. We have sklearn.preprocessing.OneHotEncoder which encodes a single Categorical variable by creating multiple columns. Similarily, I want to propose different encoder that encodes this type of list and creates new columns , one column for each subject. Allowed values are 1/0 which specifies whether student is interested in this subject or not. I'm new to Open Source contribution. Can someone tell me If there is an existing feature that handles this type of data or If I can start working on this feature. Any response would be appreciated. Thanks Prudvi RajKumar _______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn