OK, that's why mahout need a file descriptor, and wha't the difference between CSV and UCI?
On Wed, Jul 6, 2011 at 6:28 PM, Sean Owen <sro...@gmail.com> wrote: > Yes, but, my point is that it doesn't quite make sense to do such a thing in > MapReduce. Only one mapper will see the header, but, presumably all mappers > need that info. If it's a bit of metadata, pass it in the Configuration > object as a String. If it's a lot, put it in the DistributedCache (or on > HDFS and pass the location for mappers to read). > > On Wed, Jul 6, 2011 at 11:23 AM, Xiaobo Gu <guxiaobo1...@gmail.com> wrote: > >> Hi Sean, >> >> Thanks for your reply first, so we must wirte specific code to >> handle the CSV header if we have it in the file, right? >> >> Xiaobu Gu >> >> >