I have a data-set where each data-point has 112 factors. I want to remove the factors which are not relevant, and say reduce to 20 factors out of these 112 and then do clustering of data-points using these 20 factors.
How do I do these and how do I figure out which of the 20 factors are useful for analysis. I see SVD and PCA implementations, but I am not sure if these give which elements are removed and which are remaining. Can someone please help me understand what to do here thanks, -Rohit