trainlogistic and runlogistic 2011/5/7, Ted Dunning <ted.dunn...@gmail.com>: > Huh? > > What program are you talking about? > > On Fri, May 6, 2011 at 9:36 PM, Xiaobo Gu <guxiaobo1...@gmail.com> wrote: > >> >> > 2. In production mode, don't use csv, you will find most of the time >> >> spent >> >> > are on parse the csv data and hash them to features. You might encode >> the >> >> > feature to vector and serialize them to the file system by MapReduce >> to >> >> > reduce cost on data parsing. >> >> >> >> Currentlly we are not familiar with Vectors, is there a standard way >> >> (command line )to encode csv files into Vector and serialize them into >> >> file system, >> >> >> > >> > There isn't a good command line for this, largely because it is >> > difficult >> to >> > describe how to convert each CSV field. There is some beginnings of >> efforts >> > on this, but the results are still limit. >> > >> > >> >> And what do you mean by "file system", local file system or HDFS, >> >> because you mentioned MapReduce >> >> How can I specify a HDFS URI for the --input option >
-- 从我的移动设备发送