Thank you for the response! I will try this out and let you know how it goes! ________________________________________ From: Suneel Marthi [suneel_mar...@yahoo.com] Sent: Friday, January 31, 2014 8:17 AM To: user@mahout.apache.org Subject: Re: Using Mahout to cluster a large CSV file
Use Mahout's CSVVectorIterator.java to read ur input CSV file and generate vectors. You pass in a java.io.Reader to your CSV file and it generates Dense Vectors (from CSV). U could then feed the generated vectors into KMeans clustering. On Friday, January 31, 2014 7:55 AM, "Allen, Ronald L." <allen...@ornl.gov> wrote: Hi all, Has anyone had any success using Mahout kmeans to cluster a data in a single large CSV file? If so, how did you do it? Thanks, Ronnie