Hello, I am new to mahout. I want to know how I can get the list of features that where extracted from the corpus by seq2sparse and the count of the total number of features.
My problem is that when I view the clustering output using clusterdumper I get only dense vectors for each point that belongs in the cluster but I want the sparse vector for each point. What I want to know is that are the vectors output from the clustering algorithm stored as dense vector or is the clusterdumper converting the vectors to dense vectors. If the clustering algorithm generates sparse vectors I can directly use them or else I will have to convert the vectors from dense to sparse for which I need the information mentioned in the above paragraph. Your suggestions on this are welcome. Thanks, Ashvini