Huh?

What program are you talking about?

On Fri, May 6, 2011 at 9:36 PM, Xiaobo Gu <guxiaobo1...@gmail.com> wrote:

> >> > 2. In production mode, don't use csv, you will find most of the time
> >> spent
> >> > are on parse the csv data and hash them to features. You might encode
> the
> >> > feature to vector and serialize them to the file system by MapReduce
> to
> >> > reduce cost on data parsing.
> >>
> >> Currentlly we are not familiar with Vectors, is there a standard way
> >> (command line )to encode csv files into Vector and serialize them into
> >> file system,
> >>
> >
> > There isn't a good command line for this, largely because it is difficult
> to
> > describe how to convert each CSV field.  There is some beginnings of
> efforts
> > on this, but the results are still limit.
> >
> >
> >> And what do you mean by "file system", local file system or HDFS,
> >> because you mentioned MapReduce
>
> How can I specify a HDFS URI for the --input option

Reply via email to